Documente Academic
Documente Profesional
Documente Cultură
The values that a random variable takes are on defined events on the sample
space S.
It follows that we can now say things such as P(X = 1) and P(X = 2)
without ambiguity.
Consider two populations - people who smoke and those who do not. If we
know that the probability of contracting a particular form of cancer is 0.8 for
smokers and 0.4 for non-smokers, then we can run a campaign to educate the
public in order to reduce the occurrence of this cancer.
The walking time (for me) from LT32 to Kent Ridge MRT is a constant 9.5
minutes. Taking the bus also has an average time of 9.5 minutes, but the
probability that it is longer than 15 minutes is 0.2. What should I do?
The sum of the probabilities for all the possible values equals 1.
We typically use uppercase letters to denote the random variable, and lower
case to denote the values that it has taken on.
Outcome(s) x px
(T , T ) 0 0.25
(H, T ), (T , H) 1 0.50
(H, H) 2 0.25
Outcome(s) y py
(1,1) 2 1/36
(1,2),(2,1) 3 2/36
(1,3),(2,2),(3,1) 4 3/36
(1,4),(2,3),(3,2),(4,1) 5 4/36
... ... ...
(5,6),(6,5) 11 2/36
(6,6) 12 1/36
A bar plot uses a rectangle for each possible value that X can take on.
The width of each rectangle is identical, but the height is proportional to px .
e −20 20z
pz =
z!
Visualising with a bar plot is still possible, but not for the full range of z:
P({Z = 2} ∩ {Z ≥ 1})
P(Z = 2|Z ≥ 1) =
P(Z ≥ 1)
P(Z = 2)
=
P(Z ≥ 1)
P(Z = 2)
=
1 − P(Z < 1)
P(Z = 2)
= = 4.12 × 10−7
1 − P(Z = 0)
r pr
0 0.43
1 0.30
2 0.10
3 0.10
4 0.07
E (a1 X1 + a2 X2 + · · · + an Xn ) = a1 µ1 + a2 µ2 + · · · + an µn
The random variables in the second strategy in both cases involve more
variability, or risk.
Most people choose the sure gain strategy in the 1st scenario and the risky
strategy in the 2nd scenario.
When comparing two random variables, the one with the larger standard
deviation has more variability.
We will try to arrive at more intuition about σ when we discuss the Normal
distribution later, but for now, think of σ as the average deviation of a
random variable from it’s mean.
P(Y ≥ 3) = 0.687.
The respective probabilities are 0.3679 and 0.5353. Hence the desired
probability is just
P(Y ≥ 8|Y ≥ 5) = 0.3679/0.5353 = 0.687
Topic 04 ST1232 Statistics for Life Sciences 37 / 92
Computing the Area Under a Curve
All the areas on the previous slides were computed using tables or software.
We will avoid using integration to obtain the area under a curve for this class.
P(X ≤ qp ) = p
If we were to try to explain it in terms of the pdf curve, then qp is the point
on the x-axis such that the area under the curve and to the left of qp , is
equal to p.
q0.0478 = 60
q0.280 = 73
q0.798 = 90
q0.313 = 3
q0.465 = 5
This involves counting combinations of cards selected from a group of 52. The
order in which the cards were selected does not matter.
2652/2 = 1326
I There are 1326 ways in which 2 cards can be drawn from a deck of 52.
Regarding the number of ways in which 2 Aces can be chosen:
I There are 4 ways of choosing the first card, and then 3 ways of choosing the
second.
I Hence there are 4 × 3 = 12 ways of choosing 2 Aces, if we were to
differentiate the order in which these were drawn.
I Again this double counts. Thus there are only 12/2 = 6 combinations of 2
cards from 4.
Definition 9 (Combinations)
The number of combinations of n things, taken k at a time is
n! n(n − 1)(n − 2) × · · · × (n − k + 1)
Ckn = =
k!(n − k)! k!
Suppose we sample White Blood Cells (WBC) from an individual, and test if
it is a lymphocyte or not. For this individual, this will be true only 20% of the
time. If we were to obtain 10 WBC and set W to be the total number of
lymphocytes obtained, then W ∼ Bin(10, 0.2).
for x = 0, 1, 2, . . . , n.
What would you conclude about the two sequences? Related or unrelated?
I In practice, we use the the probability of 5 or more matches instead of just the
probability of 5 matches to make our conclusion.
This is the second most frequently used discrete distribution after Binomial
distribution. It usually associates with rare events.
e −µ µk
P(X = k) = , k = 0, 1, 2, ...
k!
where e is approximately 2.71828, λ is the expected no. of events per time unit
and µ = λt is the expected no. of events over time period t.
2.3 −2.3
P(X = 1) = e = 0.231
1!
2.32 −2.3
P(X = 2) = e = 0.265
2!
2.33 −2.3
P(X = 3) = e = 0.203
3!
2.34 −2.3
P(X = 4) = e = 0.117
4!
2.35 −2.3
P(X = 5) = e = 0.054
5!
P(X ≥ 6) = 1 − (0.100 + 0.231 + 0.265 + 0.203 + 0.117 + 0.054) = 0.03
1.15 −1.15
P(Y = 1) = e = 0.364
1!
1.152 −1.15
P(Y = 2) = e = 0.209
2!
1.153 −1.15
P(Y = 3) = e = 0.08
3!
P(Y ≥ 4) = 1 − (0.317 + 0.364 + 0.209 + 0.08) = 0.03
Poisson distribution with parameter µ has the mean and variance are both equal
to µ. This could help to identify if the distribution is a Poisson distribution.
The expected number of cancer deaths from U.S. white male mortality rates
is µ = 5.8.
Let X be the number of deaths from cancer among the employees in the
study, then X is a Poisson random variable with µ = 5.8.
We need to find P(X ≥ 7).
P(X ≥ 7) = 1 − P(X ≤ 6) where P(X = k) = e −5.8 (5.8)k /(k!)
P(X ≥ 7) = 1 − 0.638 = 0.362
Clearly, the observed number of cancer deaths in the given study is not
excessive in this group.
For any real numbers a and b, if X ∼ N(µX , σX2 ), and Y ∼ N(µY , σY2 ) then
X −µ
Z= ∼ N(0, 1)
σ
Whenever we compute Z = (X − µ)/σ, we refer to Z as the Z -score of X .
Topic 04 ST1232 Statistics for Life Sciences 78 / 92
1 Random Variables
2 Discrete Random Variables
Parameter of a Discrete Probability Distribution: Mean
Parameter of a Discrete Probability Distribution: Variance
3 Continuous Random Variables
Parameters of a Continuous Probability Distribution: Mean and Variance
Parameter of a Continuous Probability Distribution: Quantiles
4 Random Variables And Data Types
5 The Binomial Distribution
Combinations
Assumptions
Example
6 The Poisson Distribution
7 The Normal Distribution
Z -Scores
Examples
8 Index of Definitions and Examples
9 Reading
Use the Normal Table to compute the values of the following, where Z ∼ N(0, 1).
P(Z > 1)
P(Z ≤ 2.3)
P(Z > 1.18)
P(Z > 1.18|Z > 0)
P(Z > 1.96 ∪ Z < −1.96)
q0.32
To summarise:
If we are given a value X and wish to find a probability, convert
Z = (X − µ)/σ and use the table to obtain.
Convert to Z -score:
Convert to Z -score:
We need to find q0.10 for X ∼ N(95, 144). This is equal to 79.6 mm Hg.
If we let Y be the time that a nurse spends on a hypertensive patient, then
Statistics: The Art and Science of Learning from Data, 3rd edition
Alan Agresti and Christine Franklin.
Read: Chapter 6