Documente Academic
Documente Profesional
Documente Cultură
Overview
When we generate a new statistic from a
sample, all we have is an estimate of the
population.
Our audience wants to know: how
confident can we be that this statistic
accurately reflects the population?
Fortunately, statistics can also be used to
measure the likelihood that our sample
statistics resemble the population.
Overview of Terminology
Descriptive Statistic: a statistic calculated to
describe the sample or population, such as a
mean, proportion, or correlation coefficient.
Inferential Statistic: a statistic calculated to
assess how accurately a sample statistic
matches the true population parameter.
Can we infer that the population parameter is
the same as the sample statistic?
Sampling Distribution
Hypothetical distribution of all sample
means in the population.
How is it created?
Take a sample from the population. Calculate
the mean.
Repeat many, many times.
Plot all of those means in a frequency
distribution graph, where:
The x-axis indicates, from left to right, the lowest to
the highest mean values
The y-axis indicates, from bottom to top, the lowest
to the highest frequencies of each mean.
When we take many sample means and plot them on a graph, the sample
means that are most different from the population mean are least likely;
the sample means that are most similar to the population mean are most
likely; and the distribution about the population mean is normal.
Sampling Distribution
The sampling distribution is normally distributed,
even if our sample mean is not.
The center of the sampling distribution is the
population mean.
Because it is normally distributed, we can apply
the empirical rule to determine the proportion of
values above or below any one mean.
We can therefore estimate the probability that
the population mean is different from the sample
mean based on the proportions under the curve.
Empirical Rule
Standard Error
The standard error is the standard deviation
of the sampling distribution.
Because the sampling distribution is based
on sample means, not individual scores, it
has less variance than the standard
deviation.
Because inferential statistics are based on
the sampling distribution, the standard error
is used, instead of standard deviation.
Formula:
Sample
Population
Sampling
Distribution
Mean
Standard
Deviation
The sample distribution is the distribution of values for your
sample youre measuring; the population distribution for the
population youre making inferences about; and the sample
distribution for the hypothetical distribution of means drawn
from the population in repeated sampling.
We use the Greek letter mu to refer to the population
mean; and the Greek letter sigma to refer to the
population standard deviation.
Going forward, descriptive statistics will often have a sample
symbol and a corresponding Greek letter symbolizing the
population value.
Hypothesis Test
A hypothesis is a statement about some
characteristic of a variable or a collection of
variables.
A significance test evaluates the hypothesis
by comparing the values predicted by the
hypothesis to the values we find in our data:
Data that fall far from what the hypothesis
predicts suggest that the hypothesis is false;
Data that are close to what the hypothesis
predicts suggest that the hypothesis is true.
Hypothesis Test
Hypothesis Testing is a procedure for
measuring a hypothesis against data.
The Central Limit Theorem tells us that in any
random sample, our sample statistic is most
likely to be close to the population parameter;
and it is less likely that it will be very different
from the population parameter.
Using the normal distribution, we can estimate
the probability that the population parameter
suggested by a hypothesis is true given our
sample statistic.
4. Test Statistic.
A statistic that estimates, given the standard error
and the assumption of normality, how different out
sample finding is from what wed expect to find if
the population is true.
For means and Pearson correlations: t. For ANOVA
and regression: F. For nonparametric association:
Chi-squared.
Test
Statistic
The
variable
ExpPerMonth starts
off in the left-hand
box; Ive selected it
by moving it to the
right. Under test
value, I enter 400;
this is . Then click
Options.
The Confidence Interval
Percentage is just 100%alpha. The alpha level for a
two-tailed test should be .05
(or 5%); so we should set this
at 95%. Click continue and
OK.
What is statistical
significance?
1. An assessment of the population based
on sample data.
A population parameter is always
statistically significant.
2. It is relative.
What is statistical
significance?
3. It is determined by:
What is statistical
significance?
We could hypothetically choose a
higher or lower alpha level, but there
are consequences:
If we increase alpha (say, to .10), we
are more likely to find statistical
significance, but we run the risk of a
type 1 error.
If we decrease alpha (say, to .01), we
are less likely to find statistical
significance, but we run the risk of a
type 2 error.
What is statistical
significance?
Type I Error
Correct Decision
Accept H0 that
Correct Decision
Type II Error
Expenditure Expenditure
Per Month, Per Month,
Rec February
March
Gender
1
513
575
467
450
298
373
494
525
367
389
621
664
404
390
533
610
379
449
10
513
517
11
440
481
12
475
455
Again, we can do #3
(find descriptive
statistics) and #4 (find
test statistic) at once:
go to Analyze
Compare Means
Independent Samples Ttest.
Expenditure Expenditure
Per Month, Per Month,
Rec February
March
Gender
1
513
575
467
450
298
373
494
525
367
389
621
664
404
390
533
610
379
449
10
513
517
11
440
481
12
475
455
on monthly spending on
groceries you ran first in
February, and again in
March.
Your boss suggests that
spending has not increased
from February to March.
Perform a hypothesis test
that the true population
mean for monthly
expenditure on groceries has
increased.
ANOVA
ANOVA allows you to compare the means of
several groups.
ANOVA provides a test against the H0 that more
than two means are equal.
Commonly used for experiments, where we
might compare:
A new treatment vs. a placebo treatment vs. no
treatment, or
A new treatment vs. a old treatment vs. no
treatment
treatment here refers to a subset of a sample, or
a group sharing a common characteristic.
H0 and HA
We wish to determine that there are
any statistically significant
differences between treatment
means (1, 2, 3, etc.)
H0: 1 = 2 = 3 etc.
HA: At least two of 1, 2, etc. differ.
We seek to show that at least one
pair of means are significantly
different from the others.
Variability
Between-Treatment Variability:
variability among the sample means.
Within-Treatment Variability:
variability among cases in each
sample.
ANOVA seeks to determine that
between-treatment variability is
greater than within-treatment
variability.
To get descriptive
and test statistics,
go to Analyze
Compare Means
One-Way ANOVA
2
)
3
)
4
)
Dependent List
will be the variable
you find means for
(gpa_cum); Factor
is the grouping
variable. To get
Tukey HSD, go to
Post Hoc and
select Tukey and
Games-Howell.
Also go to Options
and select
Homogeneity of
Variance Test and
2)
3)
ANOVA
What does it mean to reject H0 for
ANOVA?
At least one mean is statistically
significantly different from one other.
But F-test does not tell you which.
Post-Hoc Test: Tukey HSD. Identifies
meaningful differences among the
means.
Appropriate
Hypothesis Test
Example
Single Mean
compared with a
hypothesized mean
Comparing mean
expenditure for
February with some
prior guess
Comparison of
means, paired
samples
Comparing mean
expenditure for
February with mean
for March
Analysis of Variance
(ANOVA)
Comparing mean
expenditure for
freshmen,
sophomores, juniors,
and seniors