Documente Academic
Documente Profesional
Documente Cultură
Problem 1: Jennifer Dahl, supervisor of the Circle O discount chain, would like to forecast
the time it takes to check out a customer. She decides to use the following independent
variables: number of purchased items and total amount of the purchase. She collects data for
a sample of 18 customers, shown in Table P-8.
Problem 2: Table P-9 contains data on food expenditures, annual income, and family size for
a sample of 10 families.
b. Fit the regression model and interpret the least squares coefficient,
c. Compute the fitted values for each of the pairs, and plot the fitted values on the scatter
diagram. Draw straight lines through the fitted values for male drivers and female drivers,
respectively. Specify the equations for these two straight lines.
X1 X2 X3 Y
87 85 2.7 91
100 84 3.3 90
91 82 3.5 83
85 60 3.7 93
56 64 2.8 43
81 48 3.1 75
77 67 3.1 63
86 73 3.0 78
79 90 3.8 98
96 69 3.7 99
93 60 3.2 54
92 69 3.1 63
100 86 3.6 96
80 87 3.5 89
100 96 3.8 97
69 51 2.8 50
80 75 3.6 74
74 70 3.1 58
79 66 2.9 87
95 83 3.3 57
a. Fit a multiple regression model to predict the final exam score from the scores on the
within-term exams and GPA. Is the regression significant? Explain.
b. Predict the final exam score for a student with within-term exam scores of 86 and 77 and a
GPA of 3.4.
c. Compute the VIFs and examine the t statistics for checking the significance of the
individual predictor variables. Is multicollinearity a problem? Explain.
d. Compute the mean leverage. Are any of the observations high leverage points?
Problem 5: Table P-21 contains the number of accounts (in thousands) and the assets (in
billions of dollars) for 10 online stock brokerages. Plot the assets versus the number of
accounts. Investigate the possibility the relationship is curved by running a multiple
regression to forecast assets using the number of accounts and the square of the number of
accounts as independent variables.
Problem 6: The sales manager of a large automotive parts distributor, Hartman Auto
Supplies, wants to develop a model to forecast as early as May the total annual sales of a
region. If regional sales can be forecast, then the total sales for the company can be forecast.
The number of retail outlets in the region stocking the company’s parts and the number of
automobiles registered for each region as of May 1 are the two independent variables
investigated. The data appear in Table P-12.
c. Forecast the annual sales for region 12, given 2,500 retail outlets and 20.2 million
automobiles registered.
Number of
Annual Sales Number of
Region Automobiles
($ millions) Retail
X2 Registered
Y Outlets X1
($ millions)
1 52.3 2011 24.6
2 26 2850 22.1
3 20.2 650 7.9
4 16 480 12.5
5 30 1694 9
6 46.2 2302 11.5
7 35 2214 20.5
8 3.5 125 4.1
9 33.1 1840 8.9
10 25.2 1233 6.1
11 38.2 1699 9.5
Problem 7: Ms. Haight, a real estate broker, wishes to forecast the importance of four factors
in determining the prices of lots. She accumulates data on price, area, elevation, and slope
and rates the view for 50 lots. She runs the data on a correlation program and obtains the
correlation matrix given in Table P-17. Ms. Haight then runs the data on a stepwise multiple
regression program.
a. Determine which variable will enter the model first, second, third, and last.
View
Variable Price Area Elevation Slope View
Price 1 0.59 0.66 0.68 0.88
Area 1 0.4 0.64 0.41
Elevation 1 0.13 0.76
Slope 1 0.63
View 1