Documente Academic
Documente Profesional
Documente Cultură
1. (data from J&W Exercise 1.6 ) The data in air.dat are 42 measurements on air-pollution variables in
Los Angeles. The columns are x1 = wind, x2 = solar radiation, x3 = CO, x4 = NO, x5 = NO2, x6 = O3, and x7
= HC.
(a) Using notation consistent with our lessons, explain briefly what Xij and j represent in terms of the
variables here. Similarly, explain briefly what jk represents. What does it mean if jk = 0?
(b) Considering only solar radiation, CO (carbon monoxide), and NO2 (nitrogen dioxide), can these
variables reasonably be assumed to be multivariate normal? Justify your answer with appropriate plots.
(c) Provide a matrix of scatterplots for the three variables above. Also, report the numeric correlation
for each pair of variables. Is any pair significantly correlated? Answer this with separate confidence
intervals of correlation. Use a Bonferroni adjustment for multiplicity so that your confidence for all
intervals simultaneously is 95%.
Hypothesis: H0: = 0 vs. Ha : = 0 at the = 0.05
Since the p-value for CO and NO2 is 0.000 < .05, we can reject the null hypothesis and conclude that
there is a significant correlation between these two variables.
2.4962
35 : 0.628 (0.243, 1.013)
(42)
20.243 1 21.013 1
( 20.243 , ) (.238, 0.767)
+ 1 21.013 + 1
2. Measurements of biochemical oxygen demand (Y1) and suspended solids (Y2) were obtained from the
discharge of n = 11 municipal wastewater treatment plants into the rivers of Wisconsin.
Assume that these data are sampled from a bivariate normal population with mean vector and
covariance matrix .
(a) Find the sample mean vector and the sample covariance matrix.
(b) Find 95% confidence intervals for the population means using
10.45
1 : 34.64 2.634 (26.34, 42.94)
11
19.07
2 : 33.18 2.634 (18.04,48.33)
11
2(10)4.257 10.45
1 : 34.64 (24.94, 44.33)
9 11
2(10)4.257 19.07
2 : 33.18 (15.50,50.86)
9 11
(c) Compute the sample correlation r between oxygen demand and suspended solids.
12 120.37
12 = = = 0.604
12 22 109.25 363.76
(d) Test H0 : = 0 against Ha : = 0 at the = 0.01 level. What are your conclusions?
9
= 0.604 = 2.27, with 9 df and a p value of 0.0492. Since the p-value > .01, we cannot
10.604 2
reject the null hypothesis that says the correlation is equal to zero at the .01 significance level.
1 + .604
12 = .5 ln ( ) = 0.6994
1 .604
1.96
0.6994 (0.0064, 1.39)
(8)
2.0064 1 21.39 1
( 2.0064 , ) (.0064, 0.88)
+ 1 21.39 + 1
SAS CODE