Documente Academic
Documente Profesional
Documente Cultură
Variance=
SD=
a (5 Points) Calculate the average number of cars per a day and describe how outliers affect the value of the mean.
b (5 Points) Calculate the mean number of cars per a day. What does this tell you and is this a better or a worse estim
number of calls per a day? Explain.
d (2.5 Points) Calculate the Standard Deviation. Explain what this tells you.
e (5 Points) Based on the data collected, is the graph skewed to the left, right, or symmetric? If so, please explain you
oning and how to interpret this information.
Car washed
879
1003
1200
1118
car washed
car washed
car washed
55.04
es that "lie outside" the other values. Hence if a number is too far from the main group, it gives a bad mean.
14643.5
121.010330138
The graph is skewed to the left (negative skew). This means that the mean is on the left of the peak.
y cars you wash on a daily basis.
Day
0-19
20-39
40-59
ric? If so, please explain your 60-80
80-99
100-110
120-139
140-159
160-180
2-c)
Here f is frequency of the group. X is the mid point of the group and x bar is total mea
Histogram
Frequency
52.4383562
Sales
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.3601347911
R Square 0.1296970678
Adjusted R Square 0.1016227797
Standard Error 857089.133581316
Observations 33
ANOVA
df SS
Regression 1 3393699486846.59
Residual 31 22772655269998.3
Total 32 26166354756844.9
m = XiYi
Xi^ 2
C= Y - m X
Xi^ 2
C= Y - m X
X= $ 36,948.24
Y= $ 1,746,155.98
m= $ 46.28
C= $ 36,329.36
Y=46.28 X + 36.329.36
Sales of 50,000 Y=46.28 X 50,000+36,329.36
Y=2,350,329.36
Growth HS College
0.8299 73.5949 17.8350
0.6619 88.4557 31.9439
0.9688 73.5362 18.6198 25 Marks:
0.0821 79.1780 20.6284
0.4646 84.1838 35.2032 The data at left are m
2.1796 93.4996 41.7057 the sample, are app
customers is referre
1.8048 78.0234 28.0250
Sales ------Latest on
-0.0569 70.2949 15.0882 Income ---Median fa
-0.1577 70.6674 10.9829 Age --------Median a
0.3664 63.7395 13.2458 HS ----------Percenta
2.2256 76.9059 19.5500 College ---Percentag
1.5158 82.9452 20.8135 Growth ---Annual po
0.1413 65.2127 16.9796
-1.0400 73.4944 32.9920 Q-2a (5 Points). Co
1.6836 80.2201 22.3185
Q-2b (5 Points). As
2.3596 87.5973 24.5670
0.7840 85.3041 30.8790 Q-2c (2.5 points): Pr
0.1164 65.5884 17.4545
1.1498 80.6176 18.6356 Q-2d (2.5 points): E
0.0606 80.3790 38.3249
1.6338 79.8526 23.7780 Q-2e (10 Points). In
1.1256 81.2371 16.9300
1.4884 70.2244 19.1429
4.7937 87.1046 30.8843
1.8922 80.2057 26.5570
1.8667 75.2914 28.3600
1.7896 77.6162 19.2490
0.2707 85.1753 35.4994
3.0129 74.1792 18.6375
3.4630 81.6991 41.1130
0.7041 73.4140 17.8566
-0.1569 73.7161 26.5426
0.7084 78.6493 29.8734
Sales
Sale
Sale
MS F Significance F
3393699486846.59 4.6197811737 0.0395229519
734601782903.171
The data at left are monthly sales totals from a random sample of 33 stores in a large chain of nationwide clothing s
the sample, are approximately the same size and carry the same merchandise. The county, or in some cases coun
customers is referred to here as the customer base. For each of the 33 set are:
Sales ------Latest one month sales total (dollars)
Income ---Median family income of customer base (dollars)
Age --------Median age of customer base (years)
HS ----------Percentage of customer base with a high school diploma
College ---Percentage of customer base with a college diploma
Growth ---Annual population growth rate of customer base over the past 10 years.
Q-2a (5 Points). Construct a scatter plot, using sales as the dependent variable and median family income as the in
Q-2b (5 Points). Assuming a linear relationship, use the least-squares method to compute the regression coefficien
Q-2c (2.5 points): Predict the sales based off income of $50,000.00
Q-2d (2.5 points): Explain why it would not be appropriate to use the model to predict sales when income is $10,000
Q-2e (10 Points). Interpret the meaning of the Y-intercept, b0, and the slope, b1, in this problem.
It can be seen that the plots are scattered all over the place.
Sales
Sales
Upper 95.0%
1724732
63.327983
2-d)
At income =10,000 Sales 870489.64
Standard Error 857089.13
Predicted sales is almost the same as the error. That's why it won't be appropriate
2-e)
Sales = b1*Income +b0
Sales = 32.494*income+545545.7873
b1>0 which means that for an increase in income sales will also increase
Also, when we increase income by 1 unit, Sales increases by 32.494 unit
b0=545545.7873 which is equal to the sales generated when the income of the family =0
e chain of nationwide clothing stores. All stores in the franchise, and thus within
county, or in some cases counties, in which the store draws the majority of its
median family income as the independent variable. Discuss the scatter plot.
mpute the regression coefficients b0 and b1 and state the regression equation.
his problem.
Q-3: The cellular spinoff company Jog wants to estimate the proportion of househ
were made available with a free handset. A random sample of 500 accounts is se
purchase an additional line if the handset was free.
Construct a 99% confidence interval estimate of the population proportion of accounts that w
the table below to arrive at your answer. Points breakdown below:
(2.5 Points)- Determine N, P-bar, and Confidence Level, Square Root, and determine the Cen
(2.5 Points)- Complete the table in full.
(5 Points)- Describe the confidence interval estimate of the population proportion of accoun
Q-4 (15 Points): The amount of time it takes to take your order at the local Briarp
a standard deviation s of 0.40 minutes. If you select a random sample of 16 custo
Q-4a (5 Points): What is the probability that the mean time spent per customer is at least 3 m
Q-4b (5 Points): What is the probability that it takes between 3 minutes and 5 minutes to tak
Q-4c: (5 Points) What is the length of an order if only 1% of all orders are shorter? (Round to
Proportions
n >= 30
Interval width
1-confidence level 1%
(1-confidence level)/2 0.005
z 2.58
(p)(1-p) 0.20
s = sqrt[(p)(1-p)} 0.44
sqrt(n) 22.36
s/sqrt(n) 0.02
Check assumptions:
np>5 OK
n(1-p) > 5 OK
proportion of households that would purchase an additional cellular line if it
of 500 accounts is selected. The results indicate that 135 of the accounts would
ortion of accounts that would purchase the additional line if the handset were free. Fill in
n proportion of accounts that would purchase the additional line if the handset were free.
er at the local Briarpatch restaurant has a population mean m of 3.1 minutes and
m sample of 16 customers:
Z score at X =5 19
Z score at X =3 -1
Probability = 0.8413447
c) Probability = 1%
Z score at probability -2.326348
- P( 3-//sqrt(n))
iation/sqrt(n))
25 Points:
Q-5 (15 Points). You are the manager of a popular retail store. You want to determine w
check-out has changed in the past month from its previous population mean value of 4.2
population is normally distributed with a population standard deviation of 1.6 minutes. Y
during a one hour period. The sample mean is 4.75 minutes. Determine whether there
population mean wait time to check-out has changed in the past month from its previous
findings utilizing information from the hypothesis test you conduct.
Q-5c (5 Points): Based on the information do we reject or accept the null hypothesis. De
table below to help answer the questions.
Q-5d (5 Points): Describe what would occur if the same size was doubled.
Q-6 (5 points):
A sport preference poll yielded the following data for men and women. Use the 5% signif
gender are independent.
Sport Preference
Basketball Football Soccer
Gender Men 20 25 30 75
Women 18 12 15 45
Total 38 37 45 120
Q-7 (5 Points): Suppose that we observe a random sample of size n from a normally dis
5% significance level, is it true that we can definitely reject in favor of the appropriate one
why not?
Mean
p-value
Significance Level
(Significance Level)/2
z
sqrt(n)
s/sqrt(n)
z*s/sqrt(n)
Value of z-statistic
ail store. You want to determine whether the population mean wait time for customer to
ous population mean value of 4.25 minutes. From past experience, you can assume that the
dard deviation of 1.6 minutes. You select a sample of 43 customers wait time to check-out
nutes. Determine whether there is evidence at the 0.05 level of significance that the
the past month from its previous population mean value of 4.25 minutes. Explain your
u conduct.
or accept the null hypothesis. Describe what this means for the results of the test. Fill in the
n and women. Use the 5% significance level and test to determine is sport preference and
mple of size n from a normally distributed population. If we are able to reject in favor of at the
ect in favor of the appropriate one-tailed alternative at the 2.5% significance level? Why or
Since p value >0.05, we accept the null hypothesis and say that sports preference and gender are
Expected
Football Soccer Total
23.125 28.125 75
13.875 16.875 45
37 45 120