Sunteți pe pagina 1din 7

A Guide to Modern Econometrics, 4th edition

Answers to selected exercises - Chapter 4


Exercise 4.1
a. The OLS results (Eviews output) are as follows:
Dependent Variable: AIRQ
Method: Least Squares
Included observations: 30
Variable

Coefficient

C
COAS
DENS
MEDI
RAIN
VALA

111.9347
-33.39830
-0.001073
0.000554
0.250699
0.000883

Std. Error
15.33179
10.45752
0.001623
0.000850
0.343518
0.002256

t-Statistic

7.300823
-3.193711
-0.661225
0.652151
0.729798
0.391543

R-squared
0.382922
Adjusted R-squared 0.254364
S.E. of regression 24.20266
Sum squared resid
14058.45
Log likelihood
-134.8149
F-statistic
2.978597
Prob(F-statistic)
0.031334

Prob.

0.0000
0.0039
0.5148
0.5205
0.4726
0.6989

Mean dependent var


S.D. dependent var
Akaike info criterion
Schwarz criterion
Hannan-Quinn criter.
Durbin-Watson stat

104.7000
28.02850
9.387659
9.667898
9.477310
1.829941

In this sample, air quality (airq) varies from 59 (best) to 165 (worst). Taking the estimated coe cients as given, the results indicate that coastal
regions, ceteris paribus, have a better air quality. Regions with a higher
value added, with more rain, or a higher household income tend to have
somewhat worse air quality. A higher population density is associated
with a somewhat better air quality, other things being equal. Statistical
signicance is ignored here (see b).
b. We test the null hypothesis that the (true) coe cient for medi is zero by
means of a t-test. The test statistic can be computed as
0:000554 0
= 0:652:
0:000850
This value does not allow us to reject the null hypothesis at any reasonable
level of signicance. (The associated p-value is 0.52). To test the joint

hypothesis that all coe cients, except the intercept, are equal to zero, we
use the F -test that is routinely provided. The F -statistic is 2.979 with a
p-value of 0.031. This indicates a marginal rejection at the 95% condence
level.
c. We perform a Breusch-Pagan test by saving the residuals from the OLS
regression of a. and computing their squares e2i : Next, we regress them
upon the full set of explanatory variables, including a constant. This
produces the following results.
Dependent Variable: RESID2
Method: Least Squares
Included observations: 30
Variable

Coefficient

C\qquad
COAS
DENS
MEDI
RAIN
VALA

220.2809
586.4857
0.375595
0.7105
570.1359
400.0307
1.425230
0.1670
-0.040621
0.062096
-0.654163
0.5192
-0.004736
0.032525
-0.145612
0.8854
-1.001574
13.14058
-0.076220
0.9399
0.000113
0.086306
0.001304
0.9990

R-squared
Adjusted R-squared
S.E. of regression
Sum squared resid
Log likelihood
F-statistic
Prob(F-statistic)

Std. Error

0.104721
-0.081795
925.8226
20571541
-244.1415
0.561458
0.728407

t-Statistic

Prob.

Mean dependent var


S.D. dependent var
Akaike info criterion
Schwarz criterion
Hannan-Quinn criter.
Durbin-Watson stat

468.6151
890.1337
16.67610
16.95634
16.76575
2.597881

The simple variant of the Breusch-Pagan test is based on computing N = 30


times the R2 of this regression and produces = 3:142 (p = 0:678): Note
that the p-value for the F -test of this auxiliary regression is 0.728. At the
95% condence level, this does not allow us to reject the null hypothesis.
Note, however, that the non-rejection may be due to a lack of power due to
the small number of observations and the general nature of the alternative
hypothesis.
d. The White test is based on including all squares and cross-products of the
regressors in the auxiliary regression of c, which leads to a large number
of regressors in the test regression. (In this case: 19 unique terms in
addition to the intercept.) Obviously, the maximum value for the White
test statistic based on N R2 is 30. Critical values for any relevant Chisquared distribution would be much larger than 30, which is very close
to the 95% critical value of a Chi-squared distribution with 19 degrees of
2

freedom. As a result, it is extremely unlikely to nd a rejection, indicating


that the use of the White test is inappropriate in this case, as the sample
size is too small (given the number of regressors). As an aside, Eviews
provides the option of computing a version of the White test without
cross-terms.
e. The auxiliary regression produces the following results.
Dependent Variable: LOGRES2
Method: Least Squares
Included observations: 30
Variable
C
COAS
MEDI

Coefficient

Std. Error

4.436858
1.571601
-5.29E-05

0.527094
0.614876
2.29E-05

R-squared
0.273084
Adjusted R-squared 0.219238
S.E. of regression 1.520931
Sum squared resid
62.45721
Log likelihood
-53.56742
Durbin-Watson stat 2.853740

t-Statistic

Prob.

8.417587
2.555965
-2.306350

0.0000
0.0165
0.0290

Mean dependent var


S.D. dependent var
Akaike info criterion
Schwarz criterion
F-statistic
Prob(F-statistic)

5.035834
1.721275
3.771161
3.911281
5.071608
0.013491

The F -statistic of this regression is 5.07 with a p-value of 0.013, indicating a


rejection of the null hypothesis at the 95% condence level.
^ 2 = exp(log res2f
f. We compute h
4:4368), where the logres2f denotes the
i
i
^ i (including
tted value from f. Subsequently, we divide all variables by h
the intercept) and re-estimate the model of a. The results are as follows.
Dependent Variable: AIRQ/H
Method: Least Squares
Sample: 1 30
Included observations: 30
Variable
ONE/H
VALA/H
RAIN/H
COAS/H
DENS/H
MEDI/H
R-squared

Coefficient

Std. Error

115.7941
0.000115
0.165046
-32.64689
-0.000503
0.000700

11.71455
0.000868
0.300249
7.743770
0.001292
0.000357
0.965137

t-Statistic
9.884637
0.132787
0.549698
-4.215891
-0.389019
1.958386

Prob.

0.0000
0.8955
0.5876
0.0003
0.7007
0.0619

Mean dependent var


3

93.67218

Adjusted R-squared
S.E. of regression
Sum squared resid
Log likelihood

0.957873
14.96537
5375.096
-120.3932

S.D. dependent var


Akaike info criterio
Schwarz criterion
Durbin-Watson stat

72.91367
8.426211
8.706451
1.796558

While the standard errors reported above are smaller than those reported
under a., note that the latter standard errors are likely to be incorrect
(given the test results of c.). If we now test whether average income
aects air quality, we nd a t-statistic of 1.96, which is on the margin
of being signicant at the 95% condence level. The F -test for the null
hypothesis that all ve slope coe cients are zero produces a value of 7.262
(based on additional calculations) and a corresponding p-value of 0.0003,
indicating a strong rejection. (The Chi-squared version of this Wald test
equals 36.31.) Note that these results are quite dierent from those based
on OLS (part b.).
g. The R2 is inated because of two reasons. First, it expresses the amount
of variation in airq/h that is explained by the model, not the variation
in air quality itself. Because observations with large values for hi are less
accurately described by the model, this increases the value of the reported
R2 . Second, because in estimation the intercept term is suppressed, Eviews adjust the denition of the R2 to its uncentred version (see eq.
(2.43)).
Exercise 4.2
We extend the specication of Table 4.9 by including cons t
sumption. The results are as follows.

1,

lagged con-

Dependent Variable: CONS


Method: Least Squares
Sample(adjusted): 2 30
Included observations: 29 after adjusting endpoints
Variable

Coefficient

C
PRICE
INCOME
TEMP
CONS(-1)

0.029105
-0.707504
0.003835
0.003323
0.098788

R-squared
Adjusted R-squared
S.E. of regression

Std. Error

t-Statistic

Prob.

0.270071
0.810356
0.001238
0.001057
0.298172

0.107767
-0.873078
3.098268
3.145391
0.331312

0.9151
0.3913
0.0049
0.0044
0.7433

0.764517
0.725270
0.034992

Mean dependent var


S.D. dependent var
Akaike info criterion
4

0.358517
0.066760
-3.711804

Sum squared resid


Log likelihood
Durbin-Watson stat

0.029387
58.82116
1.176422

Schwarz criterion
F-statistic
Prob(F-statistic)

-3.476063
19.47952
0.000000

The additional variable is statistically insignicant. Note, however, that because of the lagged dependent variable, the Durbin-Watson test statistic is no
longer appropriate (even though Eviews reports it). One can test for rst-order
serial correlation using one of the asymptotic tests discussed in Subsection 4.7.1.
This requires an auxiliary regression of the OLS residual upon its lag and all
explanatory variables from the model. The following results are obtained.
Dependent Variable: RES
Method: Least Squares
Sample(adjusted): 3 30
Included observations: 28 after adjusting endpoints
Variable
C
RES(-1)
INCOME
PRICE
TEMP
CONS(-1)

Coefficient
0.079185
0.506950
0.000581
-0.207927
0.000906
-0.327099

R-squared
Adjusted R-squared
S.E. of regression
Sum squared resid
Log likelihood
Durbin-Watson stat

Std. Error
0.260370
0.248772
0.001224
0.784520
0.001096
0.324038

0.159628
-0.031365
0.033330
0.024439
58.88266
1.697923

t-Statistic
0.304124
2.037808
0.474802
-0.265038
0.826613
-1.009448

Prob.

0.7639
0.0538
0.6396
0.7934
0.4173
0.3237

Mean dependent var


S.D. dependent var
Akaike info criterion
Schwarz criterion
F-statistic
Prob(F-statistic)

-0.000613
0.032819
-3.777333
-3.491860
0.835779
0.538434

The implied estimate for the rst-order serial correlation coe cient
is
0:51. The t-test on its signicance rejects marginally at the 95% condence
level. The test based on (T 1)R2 produces a statistic of 28 0:1596 = 4:47,
which implies a rejection (again at 95% condence). Apparently, the inclusion
of lagged consumption is insu cient to solve the autocorrelation problem.
Exercise 4.3
a. The inconclusive regionof the Durbin-Watson test is an interval of outcomes
for the dw test statistic for which the test is inconclusive (see Subsection
4.7.2). Because exact critical values are unknown, we work with upper
and lower bounds and outcomes may be such that we can neither reject
the null hypothesis nor not-reject the null hypothesis. The inconclusive
region is typically small if the sample size is large.
5

b. In general, the nding of autocorrelation means that the unobservables contained in the error term are related from one period to the next. If the
functional form of the model is incorrect, the unobservables pick up this
misspecication and may exhibit patterns of serial correlation. See Figure
4.5 for an illustration.
c. The answer here is the general version of the answer under b. For example, excluding a variable that describes seasonal patterns in monthly or
quarterly data may lead to ndings of autocorrelation.
d. In case of rst order autocorrelation, including the lagged error term as
a (hypothetically observable) regressor would eliminate the autocorrelation problem. Because the lagged error term is a function of the lagged
dependent variable and the lagged regressors, including the latter set of
variables also eliminates the serial correlation problem (see Subsection
4.10.1). Doing so, may not be preferable. First, we may be interested in
a static model rather than a dynamic one. That is, we may be interested
in Efyt jxt g; where the lagged dependent variable (and lagged regressors)
are not included in the conditioning set. Second, if we are interested in
a dynamic model the inclusion of lagged regressors and lagged dependent
error term, without any restrictions, this is not parsimonious. If rstorder autocorrelation is appropriate, imposing coe cient restrictions as in
(4.59) is preferred. (However, not imposing these restrictions generates
more exibility and robustness, and this may be preferred if the number
of observations is not too small.)
e. An overlappings samples problem arises if the horizon at which the dependent variable in a model is dened exceeds the frequency with which
it is observed. For example, annual stock market returns observed every
month. In this case, returns are observed over (partly) overlapping periods of twelve months. The problem is that this generates a moving
average autocorrelation pattern. Unexpected events in one month come
up in dierent twelve month periods and therefore lead to serial correlation. Routinely computed standard errors and tests will be incorrect and
misleading (see Subsection 4.11.3 for an illustration).
f. See Subsection 5.2.1.
g. The use of Newey-West standard errors is appropriate if autocorrelation is
suspected, for example in case of an overlapping samples problem. However, these standard errors are only asymptotically appropriate and require
that the order of autocorrelation is limited. Reasonable sample sizes are
thus required.
h. This extends the procedure discussed in Subsection 4.6.1. First, estimate the
model by OLS. Second, estimate the rst and second order autocorrelation
coe cient by regressing the OLS residual upon its rst two lags. Denoting

the true coe cient by


yt

1 yt 1

and
2 yt 2

the transformed model is given by

= (xt

1 xt 1

2 xt 2 )

+ vt :

Finally, replacing 1 and 2 by their estimates, and estimating the resulting transformed model by OLS, produces the feasible GLS estimator.
This estimator does not exploit information contained in the rst two
observations.
c 2012, John Wiley and Sons

S-ar putea să vă placă și