Probability Distrubutions

Probability Distribution Summary
Dr. Mohammed Hawa

October 2007
There are many probability distributions that have been studied over the years. Each
of which was developed based on observations of a particular physical phenomenon. The
table below shows some of these distributions:
Probability distributions
Univariate Multivariate
Benford • Bernoulli • binomial • Boltzmann • categorical • compound
Poisson • discrete phase-type • degenerate • Gauss-Kuzmin • geometric • Ewens •
multinomial •
Discrete: hypergeometric • logarithmic • negative binomial • parabolic fractal •
multivariate
Poisson • Rademacher • Skellam • uniform • Yule-Simon • zeta • Zipf • Polya
Zipf-Mandelbrot
Beta • Beta prime • Cauchy • chi-square • Dirac delta function • Coxian •
Dirichlet •
Erlang • exponential • exponential power • F • fading • Fermi-Dirac • Generalized
Fisher's z • Fisher-Tippett • Gamma • generalized extreme value • Dirichlet
generalized hyperbolic • generalized inverse Gaussian • Half-Logistic • distribution .
Hotelling's T-square • hyperbolic secant • hyper-exponential • inverse-Wishart
hypoexponential • inverse chi-square (scaled inverse chi-square) • inverse • Kent • matrix
Gaussian • inverse gamma (scaled inverse gamma) • Kumaraswamy • normal •
Continuous:
Landau • Laplace • Lévy • Lévy skew alpha-stable • logistic • log-normal • multivariate
Maxwell-Boltzmann • Maxwell speed • Nakagami • normal (Gaussian) • normal •
normal-gamma • normal inverse Gaussian • Pareto • Pearson • phase-type • multivariate
polar • raised cosine • Rayleigh • relativistic Breit-Wigner • Rice • shifted Student • von
Gompertz • Student's t • triangular • truncated normal • type-1 Gumbel • Mises-Fisher •
type-2 Gumbel • uniform • Variance-Gamma • Voigt • von Mises • Wigner quasi •
Wishart
Weibull • Wigner semicircle • Wilks' lambda
In this article we will summarize the characteristics of some popular probability

distributions, which you are likely to encounter later in your study of communication
engineering. We will begin with discrete distributions then move to continuous
distributions.
A - Discrete Random Variables:

• Bernoulli distribution
The Bernoulli random variable takes value 1 with success probability p and value 0 with
failure probability q = 1 - p.
Applications:
• Coin toss.
• Success/Failure experiments.
Page 1 of 18
Probability mass function Cumulative distribution function
Parameters success probability (p is real)

Support
Probability mass function (pmf)
Cumulative distribution function (CDF)
Mean
Median N/A
Mode
Variance
Skewness
Excess kurtosis
Probability-generating function (PGF) 1 − +
Moment-generating function (MGF) 1 − +
Characteristic function 1 − +
• Binomial distribution
The binomial random variable is the sum of n independent, identically distributed
Bernoulli random variables, each of which yields success with probability p. In fact, when
n = 1, the binomial random variable is a Bernoulli random variable.
Applications:
• Obtaining k heads in n tossings of a coin.
• Receiving k bits correctly in n transmitted bits.
• Batch arrivals of k packets from n inputs at an ATM switch.
Page 2 of 18
0.3 1
p = 0.5 and n = 20
p = 0.7 and n = 20 0.9
0.25 p = 0.5 and n = 40

0.8
0.7
0.2
0.6
0.15 0.5
0.4
0.1
0.3
0.2 p = 0.5 and n = 20

0.05 p = 0.7 and n = 20
0.1 p = 0.5 and n = 40
0 0
0 5 10 15 20 25 30 0 5 10 15 20 25 30
Parameters number of trials (integer)

success probability (real)
Support

1 −

Mean
Median one of
Mode
Variance
Skewness
Excess kurtosis
Probability-generating function (PGF) 1 − +
Moment-generating function (MGF)
Characteristic function
• Geometric distribution
This is the distribution of time slots between the successes of consecutive independent
Bernoulli trails. Like its continuous analogue (the exponential distribution), the geometric
distribution is memoryless, which is an important property in certain applications.
Applications:
• To represent the number of dice rolls needed until you roll a six.
• Queueing theory and discrete Markov chains.
Page 3 of 18
1 1
p = 0.2
0.9 p = 0.5 0.9
p = 0.8
0.8 0.8
0.7 0.7
0.6 0.6 p = 0.2

p = 0.5
0.5 0.5 p = 0.8
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Parameters success probability (real)

(
Support
Cumulative distribution function (CDF) 1 − 1 −
Mean
− ln2

ln1
ln −
Median
Mode 1
Variance
Skewness
Excess kurtosis

Probability generating function (PGF)
Probability-generating
1 − 1 −

generating function (MGF

Moment--generating MGF)

1 − 1 −

• Poisson distribution
The Poisson distribution describes the probability that a number k of events occur in a
fixed period of time assuming these events occur at random with a rate .
Applications:
• The number of phone calls at a call center per minute.
• The number of times a web server is accessed per minute.
Page 4 of 18
• The number of spelling mistakes one makes while typing a single page.
• The number of mutations in a given stretch of DNA after radiation.
• The number of unstable nuclei that decayed within a given period of time in a piece of
radioactive substance.
• The number of stars in a given volume of space.

0.4 1
λ=1
0.9
0.35 λ=4
λ = 10 0.8
0.3 λ=1
0.7 λ=4
λ = 10
0.25
0.6
0.2 0.5
0.4
0.15
0.3
0.1
0.2
0.05
0.1
0 0
0 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20
Parameters rate (real)

Support
where Γ, ! is the Incomplete

gamma function
Mean
Median
Mode
Variance
Skewness
Excess kurtosis
Probability-generating function (PGF) "#$

' $(
Moment-generating function (MGF) "%&
)' $(
Characteristic function "%&
• Zipf distribution
The term Zipf's law refers to frequency distributions of rank data. Originally, Zipf's law
stated that, in natural language utterances, the frequency of any word is roughly inversely
proportional to its rank in the frequency table. So, the most frequent word will occur
approximately twice as often as the second most frequent word, which occurs twice as
often as the fourth most frequent word, etc.
Page 5 of 18
N=10 (log-log
( log scale).
scale Note that the function is only N=10. Note that the function is only defined at integer
defined at integer values of k. The connecting lines do values of k. The connecting lines do not indicate
not indicate continuity. continuity.
Parameters exponent (real)

number of elements (integer)
Support (rank)
where *+,, is the th
th generalized
$
harmonic number *+,, - ∑+
$ /
Cumulative distribution function

(CDF)
Mean
generating function (MGF)

Moment--generating
B - Continuous Random Variables:

Variables
• Exponential distribution
The exponential distribution represents the probability distribution of the time intervals
between successive Poisson arrivals. The exponential distribution is the continuous
counterpart of the geometric distribution,
distribution, and is the only continuous distribution that is
memoryless.
Applications:
• Similar to that of Poisson distribution (e.g., the time it takes before your next telephone
call, the distance between mutations on a DNA strand,
strand, etc).
Page 6 of 18
Probability density function Cumulative distribution function
Parameters rate or inverse scale (real)

Support
Probability density function (pdf) 0 "

Cumulative distribution function (CDF) 1 − "
Mean
Median
Mode
Variance
Skewness
Excess kurtosis

• Uniform distribution
The uniform distribution is often abbreviated U(a,, b). In this distribution, all time intervals
of the same length are equally probable.
probable.
Applications:
• To test a statistic for the simple null hypothesis.
• In simulation software packages, in which uniformly-distributed
uniformly distributed pseudo
pseudo-random
random
numbers are generated first and then converted to other types of distributions.
Page 7 of 18
Parameters lower/upper limits
Support
Probability density function (pdf)
Mean
Median
Mode any value in
Variance
Skewness
Excess kurtosis
Moment-generating function (MGF) The raw moments are:
• Triangular distribution
The Triangular distribution is used as a subjective description of a population for which
there is only limited sample data.
Applications:
• Used in business decision making and project management to describe the time to
completion of a task.
Parameters lower limit

upper limit
mode
Page 8 of 18
Support
Mean
Median
Mode
Variance
Skewness
Excess kurtosis

• Normal distribution
The normal distribution is also called the Gaussian distribution,
distribution, and is denoted by
The standard normal distribution is the normal distribution with a mean of zero
and a variance of one. It is often called the bell curve because the gra
graph of its probability
density resembles a bell.
Applications:
• Many physical phenomena (like noise)
noise) can be approximated well by the normal
distribution.
• Many
any statistical tests are based on the assumption of normal distribution
distribution.
• Normal distribution arisess as the limiting distribution of several continuous and
discrete families of distributions by the central limit theorem.
theorem
• Measurement errors are often assumed to be normally distributed.
distributed
• Financial variables such as stock values or commodity prices can be mo modeled
deled by the
normal distribution.
• Light intensity from a single source is usually assumed to be normally distributed.
Page 9 of 18
The green line is the standard normal distribution Colors match the image
image to the left
Parameters location / mean (real)

12 3 0 squared scale / variance (real)
Support
Mean
Median
Mode
Variance 12
Skewness
Excess kurtosis

• Log-normal
normal distribution
If Y is a random variable with a normal distribution, then X = exp(Y)) has a log
log-normal
normal
distribution; likewise, if X is log-normally
log normally distributed, then ln(X)
ln ) is normally distributed.
Applications:
• The
he long-term
long term return rate on a stock investment can be modeled as log
log-normal.
normal.
Page 10 of 18
µ =1 µ =1
Parameters mean
standard deviation
Support
Mean
Median 5
Mode
Variance
Skewness
Excess kurtosis
The MGF does not exist,

exist but moments do
do,, and are given by:
57 8 9 8 ⁄2
6 -
• Gamma (Erlang) distribution

A gamma distributed random variable is denoted by Γ;, 0. The Erlang distribution is a
special case of the gamma
amma distribution where the shape parameter k is an integer. In the
Gamma distribution, this parameter is a real number.
Applications:
• The
he number of telephone calls which might be made at the same time to a switching
center
center.
Page 11 of 18
Parameters shape (real)

rate (real)
(
Support
0 $ "

Γ;
D
- < ! $ = >!
where the gamma function Γ;
becomes Γ; - ; − 1! if ; is an integer..

"
< ! >!
$ =
Γ;
(CDF)
Mean
Median no simple closed form

Mode for
Variance
Skewness
Excess kurtosis

Moment--generating MGF) for
• chi-square
square distribution
The chi-square
square distribution (χ
(χ2 distribution) is a special case of the gamma distribution
where k = n/2 and λ = 1/2.
Applications:
• Used in the common chi-square
chi square tests for goodness of fit of observed data to a
theoretical distribution.
distribution
• Used in statistical significance tests.
tests One example is Friedman's analysis of variance by
ranks.
Page 12 of 18
Parameters degrees of freedom

Support
⁄2
Cumulative distribution function (CDF) < ! 2$ = >!
;
Γ

2
Mean
Median approximately
Mode if
Variance
Skewness
Excess kurtosis
generating function (MGF)

Moment--generating for
• Rayleigh distribution
The Rayleigh distribution has been used to model attenuation of wireless signals facing
path fading. The Chi,, Rice and Weibull distributions
multi-path distributions are all generalizations
generalization of the
Rayleigh distribution.
distribution
Page 13 of 18
Parameters
Support
Mean
Median
Mode
Variance
Skewness
Excess kurtosis

• Student's t distribution
The Student’s t distribution arises
arises in the problem of estimating the mean of a normally
distributed population when the sample size is small.
small
Applications:
• It is the basis of the popular Student's t-tests
t tests for the statistical significance of the difference
diffe
between two sample means.

Probability density function
Parameters deg. of freedom (real

(real)
Support
Page 14 of 18
Probability density
function (pdf)
Cumulative distribution where is the

function (CDF) hypergeometric function
Mean 0 for K 3 1,, otherwise undefined

Median 0
Mode 0
Variance
, otherwise undefined
un
Skewness 0 for K 3 3
Excess kurtosis
• Cauchy distribution
Applications:
• In spectroscopy the Cauchy distribution is the description of the line shape of spectral
lines which are broadened by collisions.
collision
• In nuclear and particle physics,
physics, the energy profile of a resonance is described by the
relativistic
elativistic Cauchy distribution.
distribution
The green line is the standard Cauchy distribution Colors match the pdf above
Parameters location (real)

scale (real)
Support
Page 15 of 18
• Pareto distribution
The Pareto distribution is characterized by a curved long tail when plotted on a linear
scale, thus is it used to describe long-range dependent phenomena.
Applications:
• Used to describe the allocation of wealth among individuals.
• Describes frequencies of words in longer texts (a few words are used often, lots of
words are used infrequently).
• File size distribution of Internet traffic which uses the TCP protocol (many smaller
files, few larger ones).
• Clusters of Bose-Einstein condensate near absolute zero.
• The values of oil reserves in oil fields (a few large fields, many small fields)
• The length distribution in jobs assigned to supercomputers (a few large ones, many
small ones).
• The standardized price returns on individual stocks.
• Areas burnt in forest fires.
xm = 1 xm = 1

shape (real)
Support
Mean
for k > 1
Median
Mode
Variance
for k > 2
Page 16 of 18
Skewness
for k > 3
Excess kurtosis
for k > 4
• Laplace distribution

scale (real)
Support
Mean
Median
Mode
Variance
Skewness
Excess kurtosis
Moment-generating function (MGF)

for
Page 17 of 18
Notes:
The expected value for a discrete random variable is:
M - NOPQ -

For a continuous random variable:
D
M - NOPQ - R S>
D
The variance of a random variable is a measure of its statistical dispersion, indicating how
its possible values are spread around the expected value. For a random variable X, the
variance is:
If the random variable is discrete the variance is equivalent to:

VarOPQ - − M2

$
The mode is the most frequent value assumed by a random variable, or occurring in a
sampling of a random variable. The mode is not necessarily unique, since the same
maximum frequency may be attained at different values.
The median is the number separating the higher half of a sample or a probability
distribution, from the lower half. The median of a finite list of numbers can be found by
arranging all the observations from lowest value to highest value and picking the middle
one. If there is an even number of observations, the median is not unique.
Skewness is a measure of the asymmetry of the probability density function.
Kurtosis (from the Greek word kurtos) is a measure of the peakedness of the probability
density function.
Page 18 of 18

Probability Distrubutions

Încărcat de

Informații document

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

Probability Distrubutions

Încărcat de

Drepturi de autor:

Formate disponibile

Probability Distribution Summary

Dr. Mohammed Hawa

In this article we will summarize the characteristics of some popular probability

A - Discrete Random Variables:

Parameters success probability (p is real)

Probability mass function (pmf)

Cumulative distribution function (CDF)

Probability-generating function (PGF) 1 − + 

Moment-generating function (MGF) 1 − +  

0.25 p = 0.5 and n = 40

0.2 p = 0.5 and n = 20

Parameters number of trials (integer)

Probability mass function (pmf)

Probability-generating function (PGF) 1 − + 

Moment-generating function (MGF)

0.6 0.6 p = 0.2

Parameters success probability (real)

Probability mass function (pmf)

Cumulative distribution function (CDF) 1 − 1 − 

generating function (MGF

Probability mass function Cumulative distribution function

Parameters rate (real)

Probability mass function (pmf)

where Γ, ! is the Incomplete

Probability-generating function (PGF)  "#$

Parameters exponent (real)

Cumulative distribution function

generating function (MGF)

B - Continuous Random Variables:

Parameters rate or inverse scale (real)

Probability density function (pdf) 0 "

Cumulative distribution function (CDF) 1 −  "

generating function (MGF

Probability density function Cumulative distribution function

Probability density function (pdf)

Cumulative distribution function (CDF)

Mode any value in

Moment-generating function (MGF) The raw moments are:

Probability density function Cumulative distribution function

Parameters lower limit

Probability density function (pdf)

Cumulative distribution function (CDF)

generating function (MGF

Parameters location / mean (real)

Probability density function (pdf)

Cumulative distribution function (CDF)

generating function (MGF

Probability density function (pdf)

Cumulative distribution function (CDF)

The MGF does not exist,

• Gamma (Erlang) distribution

Parameters shape (real)

becomes Γ; - ; − 1! if ; is an integer..

Median no simple closed form

generating function (MGF

Parameters degrees of freedom

Probability density function (pdf)

generating function (MGF)

Probability density function (pdf)

Cumulative distribution function (CDF)

generating function (MGF

Cumulative distribution function

Parameters deg. of freedom (real

Cumulative distribution where is the

Probability-generating function (PGF) 1 − +

Moment-generating function (MGF) 1 − +

Probability-generating function (PGF) 1 − +

Cumulative distribution function (CDF) 1 − 1 −

where Γ, ! is the Incomplete

Probability-generating function (PGF) "#$

Probability density function (pdf) 0 "

Cumulative distribution function (CDF) 1 − "

becomes Γ; - ; − 1! if ; is an integer..

VarOPQ - − M2