2021
René Staritzbichler
Summarize reality and evaluate reliability
unsymmetric data, sensitive to extremes
symmetric, sensitive
unsymmetric, insensitive
Probability P that event with probability p is observed
in k out of n observations
\[ P(X=k) = \binom{n}{k} \cdot p^k \cdot (1-p)^{n-k}, \quad \binom{n}{k}=\frac{n!}{k!(n-k)!} \]
10 questions with 4 answers each
What is the probability to find 30 out of 100 man having 150-155 cm?
Two or more variables
Correlation is no causality!
\[ r_{xy} = \frac{ \sum_{i=1}^n (x_i - \bar{x}) (y_i - \bar{y}) }{\sigma_x\sigma_y} \]
Example: income of all citizens
Beggars and billionaires
Wording has significant influence:
Answers depend on previous questions
In surveys people tend to use round numbers
both are equivalent (2.5% growth rate)
https://www.tylervigen.com
Systems aim to return to their mean
Soccer: new trainer, return to normal
Speed cameras after accidents
"Reports that say that something hasn't happened are always interesting to me, because as we know, there are known knowns. There are things we know we know. We also know there are known unknowns. That is to say, we know there are some things we do not know. But there are also unknown unknowns — the ones we don't know we don't know,"
Donald Rumsfeld, 2002
4 random normal distributions (logscale)
mean: 0, stdev: 1
20, 200, 2000, 20000 samples
pit 1: IARC 2015: processed meat group I carcinogen
$\Rightarrow$ Daily Record: 'Bacon, ham and sausages have the same cancer risk as cigarettes warn experts'
$\Rightarrow$ IARC: confidence that there is an increased risk
pit 2: 50g/day: relative: 18% (abs: 6% $\rightarrow$ 7%)
$\Rightarrow$ Media used absolute: 6% $\rightarrow$ 24%
All scans were taken from:
David Spiegelhalter 'The art of statistics'