Documente Academic
Documente Profesional
Documente Cultură
7
Paired t-Test
Course Code: IE 301
Program: BSIE
Section:IE31FB3
Members:Madrigal, Dominic R.
1. Objective(s):
The activity aims to introduce basic ideas of power and sample size calculations for 2-sample t-Test.
2. Intended Learning Outcomes (ILOs):
The students shall be able to:
2.1
test for a difference between two population means using a 2-sample t-test
2.2
determine the sample size required to detect an effect of a given size with a given degree of
confidence.
3. Discussion:
A paired t-Test helps determine whether the mean differences between paired observations is significant.
Statistically, the paired t-test is equivalent to performing a 1-sample t-test on the differences. A paired ttest also helps you to evaluate whether the mean difference is equal to a specific value.
Paired observations are related. Examples include:
1. Weights recorded for individuals before and after an exercise program
2. Measurements of the same part taken with two different measuring devices.
Paired t-test with a random sample of paired observations. The test also assumes that the paired
differences come from a normally distributed population. However, the test is robust to violations of this
32
assumption, provided the observations are collected randomly and the data are continuous, unimodal,
and reasonably symmetric.
Why use a paired t-test?
A paired t-test answers questions such as:
1. Does a new treatment result in a difference in the product?
2. Do two different instruments provide similar measurements for the same sample?
4. Resources:
MiniTab Software/Manual
Training Data Set
Textbooks
5. Procedure:
Practice Problem: A consumer group wants to determine whether drivers can park one car more quickly
than the other. Because the data are paired (each individual parked both cars), use a paired t-test to teatv
the following hypothesis:
H0: The mean difference between paired observations in the population is zero.
H1: The mean difference between paired observations in the population is not zero.
Use the default confidence level of 95%. Display individual value plots and boxplots to help visualize the
data.
1. Open CARCTL.MPJ
2. Choose Stat Basic Statistics Paired t.
3. Complete the dialog box as shown below.
33
4. Click Graphs
5. Check Individual value plot and Boxplots of differences.
6. Click OK in each dialog box
7. Interpret the results
8. Draw conclusions.
Before checking for normality, store the pair wise differences in the worksheet.
1. Choose Stat Basic Statistics 2 Variances
2. Complete the dialog box as shown below.
3. Click OK.
4. Interpret the results.
5. Draw conclusions.
6. Data and Results:
Part 1:
Boxplot of Differences shows the mean is within the hinges of the box while the Individual Value Plot of Differences
shows the most of the red dots is within the mean, therefore, there is no mean difference between paired observations in the
population.
35
The two graphs shows that the data points approximately follow the straight line, therefore, the two products has no
Difference.
Part 3:
The individual value plot shows the mean of SupplrA increases than SupplrB. In histogram, SupplrA has a normal shape
while SupplrB is left-skewed. The individual plot of SupplrA and SupplrB do not overlap and the boxplot shows the medians are
close, therefore, the mean probability is significant.
36
CRITERIA
Laboratory Skills
Manipulative Members do not
Skills
demonstrate needed
skills.
Experimental Members are unable to
Set-up
set-up the materials.
Members occasionally
demonstrate needed
skills.
Members are able to
set-up the materials
with supervision.
Members occasionally
demonstrate targeted
process skills.
Members always
demonstrate needed
skills.
Members are able to
set-up the material with
minimum supervision.
Members always
demonstrate targeted
process skills.
Process
Skills
Members do not
demonstrate targeted
process skills.
Safety
Precautions
Work Habits
Time
Management/
Conduct of
Experiment
Cooperative
and
Teamwork
Ability to do
Members require
independent
supervision by the
work
teacher.
Other Comments/Observations:
SCORE
37
Program:BSIE
Section:IE31FB3
Members:Madrigal, Dominic R.
Description
39
Revenue
5. Click OK.
6. Interpret the results
Part 2: Calculating the correlation
11. Choose Stat Basic Statistics Correlation
40
13. Click OK
14. Interpret the results
15. Draw conclusions.
6. Data and Results:
41
The plot shows the data values of variable x and y. As the sales calls increases, the
revenue also increases.
Therefore they are directly proportional and the correlation is postive.
The correlation is equal to 0.802. The relationship between revenue to sales calls is directly proportional but the amount
is not consistent.
42
CRITERIA
Laboratory Skills
Manipulative Members do not
Skills
demonstrate needed
skills.
Experimental Members are unable to
Set-up
set-up the materials.
Members occasionally
demonstrate needed
skills.
Members are able to
set-up the materials
with supervision.
Members occasionally
demonstrate targeted
process skills.
Members always
demonstrate needed
skills.
Members are able to
set-up the material with
minimum supervision.
Members always
demonstrate targeted
process skills.
Process
Skills
Members do not
demonstrate targeted
process skills.
Safety
Precautions
Work Habits
Time
Management/
Conduct of
Experiment
Cooperative
and
Teamwork
Neatness and
Orderliness
Ability to do
Members require
independent
supervision by the
work
teacher.
Other Comments/Observations:
SCORE
43
Program:BSIE
Section:IE31FB3
Members:Madrigal, Dominic R.
1. Objective(s):
The activity aims to measure the degree of linear association between two variables using graphs and
correlation
Model the relationship between a continuous response variable and one or more predictor variables.
2. Intended Learning Outcomes (ILOs):
The students shall be able to:
2.1 Evaluate the linear relationship between two variables using scatterplot, correlation, and fitted line plot.
2.2 Analyze and interpret results and draw conclusions about the output provided by Minitab.
3. Discussion:
Simple Linear Regression examines the relationship between a continuos response variable (y) and one
predictor variable (x) . The general equation for a simple linear regression model is:
Y O 1
Where Y is the response, X is the predictor, O is the intercept (the value of Y when X equals zero), 1 is
the slope and is random error.
Use simple linear regression when you have a continuos y and one predictor , x. The following conditions
44
R (R Sq )
R 2 is the proportion of variability in the response that is explained by the equation. Acceptable values for
R
vary depending on the study. For example for engineers studying chemical reactions may require an
R 2 of 90% or more. However, someone studying human behavior ( which is more variable) may be
satisfied with much lower R 2 values.
45
R adjusted (R q (adj))
S
2
R adjusted is sensitive to the number of terms in the model and is important when comparing models
with different number of terms.
The Least Squares regression line
The coefficients for the regression equation are chosen to minimize the sum of the squared differences
between the response values observed in the sample and those predicted by the equation.
In other words the squared vertical distances between the points and line are minimized. The result is
called the Least squares regression line.
Confidence and prediction bands
Confidence bands provide the estimated range in which the mean response for a given value of the
predictor is expected to fall.
Prediction bands provide the estimated range in which a single new observation for a given value of the
predictor is expected to fall.
Analysts want to be confident that the mean and the individual points of the y-variable, Revenue, fall within
certain limits of variability.
Use the default confidence level of 95%
Confidence Interval
The 95% confidence interval defines a likely range of values for the population mean of y. For any given
value of x, you can be % confident that the population mean for y is between the indicated lines.
Prediction interval
The 95% prediction interval defines a likely range of y values for future individual observations. For any
given value of x, you can be 95% confident that the corresponding value of y for a single future observation
is between the indicated lines.
Note : The prediction interval is always wider than the confidence interval because of the added uncertainty
46
Indicates
..
47
Curvilinear
Outliers exist
Indicates
..
Additional Considerations
1. Be careful when using regression analysis to assert that changes in the predictor values were fixed
at predetermined levels in a controlled experiment. If the values of the predictors are allowed to
vary randomly, other factors may influence both the predictors and the response.
2. Do not apply regression results to values of x that are outside the sample range. The relationship
between Sales calls and Revenue may be very different for sales calls above 168.
3. Be alert for outliers when using regression procedures. Some outliers (called high leverage points)
have a large effect on the calculation of the least squares regression line. In such cases, the line
may no longer represent the rest of the data very well.
4. Time order trends in the data can violate the assumption of independence,. A run chart or individual
chart is a useful tool for detecting such efforts.
4. Resources:
48
MiniTab Software/Manual
Training Data Sets
Textbooks
5. Procedure:
Practice Problem: The sales department for a software company wants to determine whether a relationship
exists between the number of sales calls made and the revenue earned. Analysts record the number of
sales calls and the revenue earned each day for a period of 420 days.Determine the effect of Sales calls on
Revenue. Use fitted line plot to calculate and plot the regression equation.
Variable
Description
Revenue
4. Click OK.
49
4. Click OK
5. Click Graphs
6. Complete the dialog box shown below
50
51
Ho = U1 = U2=...=U420
Ha = some means are differrent
The p-value is equal to 0.000 and is less than to 0.05. We reject Ho. Therefore, the means of the number of sales calls
and the revenue earned are different.
Part 2:
Fitted Line Plot we can see that as the sales calls increases, so does the revenue.
Normal Probability Plot the data points follow the straight line and the p-value is less than 0.05, therefeore, the normal
distribution appears to fit the sample data.
Histogram the shape is normal so the data appear to be normally distributed.
Versus Fits shows that the data have a constant variance.
Versus Order shows that the data are correlated with each other.
52
CRITERIA
Laboratory Skills
Manipulative Members do not
Skills
demonstrate needed
skills.
Experimental Members are unable to
Set-up
set-up the materials.
Members occasionally
demonstrate needed
skills.
Members are able to
set-up the materials
with supervision.
Members occasionally
demonstrate targeted
process skills.
Members always
demonstrate needed
skills.
Members are able to
set-up the material with
minimum supervision.
Members always
demonstrate targeted
process skills.
Process
Skills
Members do not
demonstrate targeted
process skills.
Safety
Precautions
Work Habits
Time
Management/
Conduct of
Experiment
Cooperative
and
Teamwork
Neatness and
Orderliness
Ability to do
Members require
independent
supervision by the
work
teacher.
Other Comments/Observations:
SCORE
53