Adam Sampson's statistics bookmarks

PyTables - Getting the most *out* of your data

"PyTables is a package for managing hierarchical datasets and designed to efficiently and easily cope with extremely large amounts of data." Might be overkill for my temperature sensors!

to data database python statistics ... on 28 April 2014

STABILIZER: statistically sound performance evaluation

Neat trick: this uses some LLVM instrumentation to shuffle memory layout around in a program while it's running, to randomise the effects of layout on performance. As a result of the central limit theorem, this tends to normalise the distribution of timing errors too (provided your program runs long enough to have been thoroughly shuffled).

to benchmarking compiler llvm performance research statistics timing ... on 01 April 2014

Sixteen is not magic: Comment on Friston (2012) | [citation needed]

Review of "Ten ironic rules for non-statistical reviewers". Read the original paper first, since it's got some good points -- particularly on exactly what the limitations on normality are, and why you need to be careful about very large studies -- but it probably overstates its case a bit, as this review suggests.

to hypothesis normality research statistics testing ... on 01 April 2014

Welcome to the Evaluate Collaboratory! | Evaluate Collaboratory

Tomas and Richard are involved in this project for empirical measurement in CS. Their position paper would be sensible reading for students; it explains some of the common pitfalls of performance measurement.

to ag0803 benchmarking cs empirical performance research statistics ... on 26 March 2014

Publication: Quantifying Performance Changes with Effect Size Confidence Intervals - School of Computing - University of Kent

Tech report with more details of the statistics behind Tomas/Richard's approach. In particular, this describes how to do the same thing in either parametric or non-parametric ways, and gives some description of how badly the parametric approach performs when the underlying data isn't normally distributed (not very badly, as it turns out).

to benchmarking confidence effect-size non-parametric performance statistics ... on 26 March 2014

Rigorous Benchmarking in Reasonable Time - Kent Academic Repository

Tomas Kalibera and Richard Jones' paper on how to do benchmarking that's actually meaningful -- presenting results as confidence intervals for effect sizes, with techniques to establish i.i.d. results and work out how many repetitions you need to do. Very nice work for a pretty short paper! (I've spent most of today chasing references from this in the interests of understanding the maths behind it...)

to benchmarking compiler confidence effect-size independence java performance reproducibility statistics vm ... on 26 March 2014

Statistically rigorous Java performance evaluation

One of the papers that inspired Tomas/Richard's rigorous benchmarking work. This is a much simpler strategy, involving looking for overlapping confidence intervals -- which is statistically pretty dubious, but common in other disciplines...

to benchmarking confidence java performance research statistics ... on 26 March 2014

Producing wrong data without doing anything obviously wrong!

Lots of examples of how environmental factors (e.g. environment variable size, room temperature, link order, ASLR...) can affect experimental results, to the tune of 20% or more. Basically: why pretty much any benchmark you've seen in a paper where the effect size isn't huge is probably nonsense.

to benchmarking compiler performance reproducibility research statistics ... on 26 March 2014

The Earth is Round (p < 0.5)

An extremely grumpy study of statistical significance. This is worth reading in conjunction with Susan's stats tutorial, since it gives more examples of what significance actually means (and why it's probably not what you think it means).

to ag0803 significance statistics ... on 26 March 2014

Computing and Interpreting Effect Sizes - Springer

A fairly grumpy study of effect size measurement -- this makes some good points, though.

to effect-size significance statistics ... on 26 March 2014

	- statistics

1	academia
9	ag0803
4	amusements
1	analysis
9	benchmarking
1	biology
1	board-games
1	c++
2	cheating
2	chess
1	cmp201
1	comics
2	competition
3	compiler
4	confidence
1	critical-thinking
1	cs
4	data
1	data-analysis
1	data-mining
1	database
1	dice
3	effect-size
1	election
1	electronics
1	empirical
4	games
2	gnuplot
1	graph
1	graphics
2	graphing
3	honours
1	hypothesis
1	independence
2	java
1	language-design
1	library
1	llvm
13	maths
1	matplotlib
1	measurement
1	median
1	modelling
1	monopoly
2	non-parametric
1	normality
1	oscilloscope
7	performance
2	plotting
1	politics
1	prediction
1	programming
6	python
3	r
1	radio
1	random
2	reproducibility
9	research
1	retrotech
1	rpg
8	science
1	scipy
1	scotland
1	sheep
5	significance
1	simulation
2	software
1	spreadsheet
1	st-kilda
47	statistics
5	teaching
2	testing
1	timing
1	uk
1	ukc
1	validation
1	vision
3	visualisation
1	vm
1	wine

Tasty » Adam Sampson's statistics bookmarks

Tags related to statistics