Documente Academic
Documente Profesional
Documente Cultură
FOR MANAGEMENT
Session -10
Chapter :
RECAP
• Introduction – definition, types of statistics, levels of measurement
• Collection / compilation/ classification / tabulation
• Presentation – graphical and diagrammatic
• Measures of central tendencies
• Measures of dispersion
• Measures of skewness
• Exploratory data analysis
• Association between variables – covariance and correlation
• Probability – concepts, laws and Baye’s theorem
• Random variable- discrete and continuous
• Theoretical distributions – Binomial, Poisson and Normal
distributions
• Sampling techniques , sampling distribution, central limit theorem
and estimation theory- types ( point and interval estimate)
Learning objectives
• Hypothesis testing
• Errors in TOH
• Level of significance
• Types – parametric and non parametric
• Steps involved in TOH
• Chi square
• Independence
• Goodness of fit (uniform)
TESTING
OF
HYPOTHESIS
Hypothesis
• An assumption to be tested
• If the sample statistic differs from the population
parameter, a decision must be made as to
whether or not this difference is significant
• If it is, the hypothesis is rejected. If not, it is
accepted
Set up a Hypothesis
• H0:Null Hypothesis
• No significant difference between the sample statistic
and the population parameter
• Any difference found is accidental, arising out of
sampling fluctuations
• H1: Alternate Hypothesis
• A hypothesis that is different from the null hypothesis
• If sample info leads us to reject H0, then accept H1
In hypothesis testing, we must stated the assumed or
hypothesized value of the population before we begin
sampling. This assumption is called the null hypothesis.
Two-tailed test
Is a significance test in which it will reject the null
hypothesis if the sample mean is significantly higher
or lower then hypothesized population mean.(i.e.
there are two rejection region)
Ho : # o
Terminologies
Significance level
Complementary concepts to confidence limits.
Probability of committing a TYPE 1 error, naming
rejecting the null hypothesis when in reality it is true.
There is no single standard or universal level of
significance for testing hypothesis.
The higher the significance level, the higher the
probability of rejecting a null hypothesis when it is
true.
Set up a Significance level
• The confidence with which an experimenter
rejects or retains H0 depends on the level of
significance involved
• = 5%
• 5% chance that H0 is rejected when it should be
accepted
• 95% confident that we have made the right decision
• Willing to accept a 5% chance of being wrong to reject
H0
Suitable Test Statistic
D C O N D I T I O N
E
H0 : True H0 : False
C
I
Accept H0 Correct Decision TYPE 2 Error
S Confidence Level ()
I (1 - )
O Reject H0 TYPE 1 Error Correct Decision
N () Power of Test
(1 - )
Type 1 error,
Is the error of rejecting a null hypothesis when it is
true.
Type 11error,
Is the error of accepting a null hypothesis when it is
actually false.
Step 4 : calculation
Step 5 : inference
• Classical approach : if table value > calculated value – accept Ho
• P value approach : if p> accept Ho.
FLOW CHART FOR HYPOTHESIS TESTING
State H0 as well as H1
2 = O E 2
E
where O = the observed frequency of any value
E = the expected frequency of any value
The obtained value from the formula is
compared with the value from 2 table for a
given significance level and the number of
degrees of freedom.
Example:
Are technical support calls equal across all days of the week? (i.e., do calls follow a
uniform distribution?)
Sample data for 10 days per day of week:
( o e ) 2
2 i i
ei
• Reject H0 if
2 2
α
(with k – 1 degrees of
freedom) 0 2
Do not Reject H0
reject H0 2
Chi-Square Test Statistic
Contingency Tables
Situations involving multiple population
proportions
Used to classify sample observations according
to two or more characteristics
Also called a cross-tabulation table.
Example:
The following data concerning industrial accidents and absentees
classified according to the types of employee.
χ
2 O E 2
with d.f . (r 1)(c 1)
E
where:
O = observed frequency
E = expected frequency
r = number of rows
c = number of columns
Contingency Analysis
Example:
Left-Handed vs. Gender
Dominant Hand: Left vs. Right
Gender: Male vs. Female
for
Area
380 415
2
550 585
2
450 415
2
585 415 585 415 585 415 585 415
12250.002409 0.001709 0.002409 0.001709
i.e. 2cal =10.0891
Tabulated 2 for (2-1)(2-1)=1 d.f. at 5% level of significance is 3.841 i.e.
2tab=3.841.
Here we see that 2cal>2tab (10.0891>3.841)Ho is rejected i.e. it is highly
significant at 5% level of significance. Thus we conclude that nature of area is
related to voting preference in the election.
Alternative procedure: To calculate the value 2, we can use the following
formula:
Total
a b a+b
c d c+d
Total a+c b+d N = a+b+c+d
N = 620+380+550+450=2000
N ad bc 2000620 450 380 550
2 2
2
10.09165
a b a c b d c d 1000 1170 830 1000
What is a Hypothesis
Of a test?
• A hypothesis is an I assume the mean AGE
of this class is 50!!!
assumption about the
population parameter. Am I correct? TEST IT!
• A parameter is a
characteristic of the
population, like its
mean or variance.
• The parameter must
be identified before
analysis.
• Steps:
• State the Null Hypothesis
• State its opposite, the Alternative Hypothesis
• Hypotheses are mutually exclusive &
exhaustive
• Sometimes it is easier to form the
alternative hypothesis first.
Hypothesis Testing Process
Assume the
population
mean age is 50.
(Null Hypothesis) Population
The Sample
IsX =20 @ m =50? Mean Is 20
No, not likely!
REJECT
Sample
Null Hypothesis
Reason for Rejecting H0
Sampling Distribution
Our sample
mean (20) we reject the
falls in the null hypothesis
tails!It’s H0 that µ = 50.
not likely!
Hypotyzed
population mean.
20 µ = 50 Sample Mean
Critical
Rejection Value
Regions
α “Area” of the
Rejection region
0
Level of Significance, α and
the Rejection Region
One tail (left) test
Critical
H0: 0
H1: < 0 Value(s)
0
Rejection
Regions One tail (right) test
H0: 0
H1: > 0
0
H0: 0
Two tails test
H1: 0 /2
0
Errors in Making Decisions
• Type I Error
• Reject Null Hypothesis when it is True (“False
Positive”)
• Has Serious Consequences
• Probability of Type I Error Is α
• Called Level of Significance
• Type II Error
• Do Not Reject Null Hypothesis when it is
False (“False Negative”)
• Probability of Type II Error Is β (
Power 1- β )
& Have an Inverse
Relationship
Reduce probability of
one error and the
other one goes up.
0
• Used to Make Rejection Decision
• Used to determine optimal strategies where a decision maker faced with several
decision alternatives and an uncertain, or risky pattern of future events.
• Decision – Definition
Choosing from
alternatives
Determination of
payoff
Identification of all
courses of action
(Strategies)
S2 50 0 100 200
(10)
100 50 0 100
S3
(15)
150 100 50 0
S4
(20)
Decision Making Under Certainty
Manager knows which event will occur
pick the alternative with the best payoff
Possible Future Demand
Alternative Low High
Small facility 200 270
Large facility 160 800
Do nothing 0 0
Conservative
30 m 5m -10 m
Speculative
40 m 10 m -30 m
Counter cyclical
-10 m 0 15 m
• If the prior probabilities for improving economy, stable economy and worsening
economy are 0.1, 0.5 and 0.4, which investment would Warren consider?
Solution
E(conservative) = 0.1*30+0.5*5-0.4*10 =
E(Speculative) = 40*0.1+0.5*10 -30*0.4 =
E(counter cylic) = -10*0.1+ 0*0.5+ 10*0.4 =
Decision Trees