Documente Academic
Documente Profesional
Documente Cultură
1 Statistical Hypotheses
Statistical Hypotheses: Motivations
• We have discussed about how to construct a confidence interval estimate
of a parameter from sample data.
• In many applications, we need to decide which of two competing claims
or statements about some parameter is true.
• The statement is called hypothesis.
• The decision making procedure is called hypothesis testing.
Definition 1. A statistical hypothesis is a claim or assertion about the parame-
ters of one or more populations.
11.2
Statistical Hypotheses:Example
Example 2. Suppose that we are measuring the burning rate of a solid propellant
for some air crew escape system. We are interested in deciding whether or not the
mean burning rate is 50 centermeters per second. We may express this formally
as:
H0 :µ = 50 cm/sec (1)
H1 :µ 6= 50 cm/sec (2)
• The statement Eq.(1) is called the null hypothesis.
• The statement Eq.(2) is called the alternative hypothesis.
• Since the alternative statement specifies the value of µ can be either greater
or less than 50 cm/sec, this is called a two-sided alternative hypothesis.
• We can also formulate a one-sided alternative hypothesis
H0 :µ = 50 cm/sec (3)
H1 :µ < 50 cm/sec (4)
or,
H0 :µ = 50 cm/sec (5)
H1 :µ > 50 cm/sec (6)
• Note that hypotheses are always statements of the population (or distri-
bution) parameter under study, not statements about sample.
11.3
1
Probability and Statistics 2/16 Fall, 2015
Statistical Hypotheses
The value of the population parameter (ex. µ = 50 cm/sec) is usually speci-
fied in one of three ways:
1. It may result from past experience, knowledge, or experiment. The object
of hypothesis testing is usually to determine whether the parameter value
has been changed.
2. The value may be determined from some theory or model. The objective
is to verify the theory or model.
3. The value results from external considerations, such as design or specifi-
cation. The objective is conformance testing.
11.4
H0 :µ = 50 cm/sec
H1 :µ 6= 50 cm/sec
H0 : µ = 50 cm/sec; H1 : µ 6= 50 cm/sec
• Suppose that if 48.5 ≤ x̄ ≤ 51.5, we will not reject the null hypothesis H0 .
• If either x̄ ≤ 48.5 or x̄ ≥ 51.5, we will reject the null hypothesis in favor of
the alternative hypothesis H1 .
• 48.5 ≤ x̄ ≤ 51.5 is called the acceptance region.
• x̄ ≤ 48.5 or x̄ ≥ 51.5 is the critical region of the test.
• The boundaries between acceptance and critical regions are called the crit-
ical values (48.5 and 51.5 in this example).
11.6
Type I Error:
• The probability of making a type I error is denoted by α:
α = P(type I error)
(7)
= P(reject H0 when H0 is true)
The probability of type I error α is also called the significant level, the
α-error or the size of test.
• In the pollutant example, if the true mean µ = 50 cm/sec and the standard
deviation
√ σ =√ 2.5, then the sample mean is normal with standard deviation
σ / n = 2.5/ 10 = 0.79.
• The corresponding probability of type I error is
• This implies that 5.74% of all random samples would lead to rejection
of the hypothesis H0 : µ = 50 cm/sec when the true mean burning rate is
really 50 cm/sec.
• The more significant the test is, the more restrictive the test becomes and
the higher the type I error will be. √
• We can reduce α by increasing the sample size. If n = 16, then σ / n =
0.625. We have
48.5 − 50
z1 = = −2.40
0.625
51.5 − 50
z2 = = 2.40
0.625
α = P(Z < −2.40) + P(Z > 2.40)
= 0.0082 + 0.0082 = 0.0164.
11.9
Type II Error:
β = P(type II error)
(8)
= P(fail to reject H0 when H0 is false)
0.625. We have
48.5 − 52
z1 = = −5.60
0.625
51.5 − 52
z2 = = −0.80
0.625
β = P(−5.60 ≤ Z ≤ −0.80)
= P(Z ≤ −0.80) − P(Z ≤ −5.60)
= 0.2119 − 0.0000 = 0.2119.
11.11
3 P-Value
Motivations
• One way to report the results of a hypothesis test is to state that the null
hypothesis was or was not rejected at a specified α-value or level of sig-
nificance.
• For previous example, we can say that H0 : µ = 50 was rejected at the 0.05
level of significant.
• However, we have no idea about whether the computed value of the test
statistic was just barely in the rejection region or whether it was very far
into this region.
• This approach may be unsatisfactory because we might be uncomfortable
with the risks implied by α = 0.05.
11.15
P-Value
A P-value conveys how much information about the strength of evidence
against H0 , and allows an individual decision maker to draw a conclusion at any
specified significant level α.
H0 :µ = 50
H1 :µ 6= 50
• The P-value of the test is the α associated with this critical region.
• Recall that at 0.05 level of significance, the critical values of Z are −1.96
and 1.96.
• The interval derived from P-value contains the interval from 0.05 level of
significance.
• p = 0.038 < α = 0.05 in this example.
• This null hypothesis would be rejected for any 0.038 ≤ α (such as 0.05);
but not rejected for a smaller significant level 0.038 > α, such as 0.01.
11.16
Connection to CI
• Suppose that the parameter θ is of interest.
• [l, u] is the a 100(1 − α)% confident interval for θ .
• The test of size α of the hypothesis:
H0 :θ = θ0 (9)
H1 :θ 6= θ0 (10)
will lead to the rejection of H0 if and only if θ is NOT in the 100(1 − α)%
confident interval [l, u].
• In previous propellant problem, we have √ x̄ = 51.3, σ = 2.5 and n = 16. The
95% two-sided CI on µ is 51.3 ± 2.5/ 16 which is 50.075 ≤ µ ≤ 52.525.
• Because µ = 50 is not included in the interval 50.075 ≤ µ ≤ 52.525, the
null hypothesis H0 : µ = 50 is rejected.
11.17
Practical Issues
Consider the propellant example. We are testing
H0 :µ = 50 cm/sec (11)
H1 :µ 6= 50 cm/sec (12)
with known σ = 2.5. Suppose that the sample mean is x̄ = 50.5 with various
sample sizes n:
11.18
Test Statistics
• The X̄ has a normal distribution with mean µ0 and variance σ 2 /n.
• The test procedure for H0 : µ = µ0 uses the test statistic:
X̄ − µ0
Z0 = √
σ/ n
• Z0 is the standard normal distribution.
• If H0 : µ = µ0 is true, the probability is 1 − α that the test statistic Z0 falls
between −zα/2 and zα/2 .
• zα/2 is the 100α/2 percentage point.
• If the observed value of the test statistic z0 is either z0 > zα/2 or z0 < −zα/2 ,
we should reject H0 .
• We shall fail to reject H0 , if −zα/2 ≤ z0 ≤ zα/2 .
11.20
One-Sided Hypotheses
Theorem 6. Suppose that we specify the hypotheses as
H0 :µ = µ0 (15)
H1 :µ > µ0 (16)
We place the critical region in the upper tail of the standard normal distribution
and reject H0 is the computed value of z0 is too large, i.e., z0 > zα .
Similarly, to test
H0 :µ = µ0 (17)
H1 :µ < µ0 (18)
we would calculate the test statistic Z0 and reject H0 is the computed value of z0
is too small, i.e., z0 < −zα .
11.22
P-value
• If z0 is the computed value of the test statistics, the P value is
2[1 − Φ(|z0 |)] H0 : µ = µ0 , H1 : µ 6= µ0
P = 1 − Φ(z0 ) H0 : µ = µ0 , H1 : µ > µ0
Φ(z0 ) H0 : µ = µ0 , H1 : µ < µ0
11.24
Figure 6: The reference distribution for H0 : µ = µ0 with critical region for (a)
H1 : µ 6= µ0 , (b) H1 : µ > µ0 , (c) H1 : µ < µ0 .
11.27
• To test
H0 :σ 2 = σ02 (21)
H1 :σ 2 6= σ02 (22)
H0 :σ 2 = σ02 (23)
H1 :σ 2 > σ02 (24)
H0 :σ 2 = σ02 (25)
2
H1 :σ < σ02 (26)
Figure 7: Reference distribution for the test of H0 : σ 2 = σ02 with critical region
for (a) H1 : σ 2 6= σ02 , (b) H1 : σ 2 > σ02 , (c) H1 : σ 2 < σ02 .
11.29
2
• Critical region: χ0.05,19 = 30.14
2 2
• Since χ0 = 29.07 < 30.14 = χ0.05,19 , we fail to reject H0 .
• We conclude that there is no strong evidence that the variance of fill vol-
ume exceeds 0.01.
2
• Given χ0.1,19 2
= 27.2 and χ0.05,19 = 30.14 and 27.2 < 29.07 < 30.14, we
can conclude that the P-value of this example is in the interval 0.05 < P <
0.1. (P > α, fail to reject H0 ).
• The actual P-value is 0.0649.
11.30
p < p0 , p > p0 , p 6= p0
H0 :p = p0 (27)
H1 :p < p0 (28)
P = P(X ≤ x when p = p0 )
H0 :p = p0 (29)
H1 :p > p0 (30)
P = P(X ≥ x when p = p0 )
H0 :p = p0 (31)
H1 :p 6= p0 (32)
or
P = 2P(X ≥ x, when p = p0 ), if x > np0
and reject H0 in favor of H1 if this P-value is less than or equal to α.
11.31