Documente Academic
Documente Profesional
Documente Cultură
Data Science
Statistical Models
Hanspeter Pfister & Joe Blitzstein
pfister@seas.harvard.edu / blitzstein@stat.harvard.edu
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
Xx P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
name, parameters
MGF
E(X),Var(X),SD(X)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
name, parameters
MGF
E(X),Var(X),SD(X)
F(x)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
name, parameters
MGF
E(X),Var(X),SD(X)
F(x)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
name, parameters
MGF
E(X),Var(X),SD(X)
F(x)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
function of r.v.
name, parameters
MGF
E(X),Var(X),SD(X)
X,X2,X3, E(X),E(X2),E(X3),
g(X) E(g(X))
F(x)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
function of r.v.
name, parameters
MGF
E(X),Var(X),SD(X)
1.0
0.8
0.6
0.4
0.2
0.0
-6 -4 -2 0 2 4 6
generate Xx P P(Xx)=F(x)
CDF F X
X=x P(X=x)
PMF (discrete)
PDF (continuous)
story
function of r.v.
name, parameters
MGF
E(X),Var(X),SD(X)
All models are wrong, but some models are useful. George Box
Jorge Luis Borges,
On Exactitude in Science
In that Empire, the Art of Cartography attained such Perfection
that the map of a single Province occupied the entirety of a
City, and the map of the Empire, the entirety of a Province. In
time, those Unconscionable Maps no longer satisfied, and the
Cartographers
All models Guild struck
are wrong, a Map
but some of the
models Empirewhose
are useful. size was
George Box
that of the Empire, and which coincided point for point with it.
1.0
1.0
0.8
0.8
0.6
pdf
0.6
cdf
0.4
0.4
0.2
0.2
0.0
0.0
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0
x x
timeline
http://www.etsy.com/shop/NausicaaDistribution
Family Tree of Parametric Distributions
HGeom
Limit
Conditioning
Bin
(Bern)
Limit
Conjugacy
Conditioning
Beta
Pois
(Unif)
Limit
Poisson process Bank - post office
Conjugacy
Gamma NBin
Normal
(Expo, Chi-Square) Limit (Geom)
Limit
Limit
Student-t
(Cauchy)
Bin(10,1/2) Bin(10,1/8)
0.30
0.4
0.25
0.3
0.20
0.15
pmf
pmf
0.2
0.10
0.1
0.05
0.00
0.0
0 2 4 6 8 10 0 2 4 6 8 10
x x
Bin(100,0.03) Bin(9,4/5)
0.30
0.4
0.25
0.3
0.20
0.15
pmf
pmf
0.2
0.10
0.1
0.05
0.00
0.0
0 2 4 6 8 10 0 2 4 6 8 10
x x
Binomial Distribution
story: X~Bin(n,p) is the number of successes in n
independent Bernoulli(p) trials.
Wikipedia
Normal Approximation to Binomial
Wikipedia
Bootstrap
data: 3.142 2.718 1.414 0.693 1.618