Documente Academic
Documente Profesional
Documente Cultură
Ani Katchova
We have time series data on ppi (producer price index). Data are quarterly from 1960 to 2002.
Summary statistics: Mean(ppi)=64, mean ((ppi)=0.464.
Graphs in Stata
Differenced variable (ppi)
1960q1
-4
20
40
-2
100
120
1970q1
1980q1
time in quarters
1990q1
2000q1
1960q1
1970q1
1980q1
time in quarters
1990q1
2000q1
Dickey-Fuller Test
Const
L1.y or yt-1
Original variable
D.y or yt
0.5036*
-0.0006
(-0.26)
With trend
D.y or yt
0.5861*
-0.0084
(-0.793)
LD.y or yt-1
-0.4452*
(-6.86)
Trend or t
Test stat
p-value
Conclusion
Differenced variable
D2.y or yt
0.2067*
0.0050
-0.26
0.398(Stata)
0.927(SAS)
Variable is
not stationary
-0.793
0.99 (SAS)
0.96 (Stata & R)
Variable is not trend
stationary
-6.86
<0.001
Differenced variable is
stationary
The Dickey Fuller test shows that the original variable is not stationary, but the differences
variable is stationary so we need to use differences d=1 in the ARIMA models.
PACF
0.999
-0.555
-0.069
-0.209
0.023
0.125
-0.153
0.114
0.210
0.049
PACF
0.555
0.066
0.203
-0.031
-0.130
0.149
-0.118
-0.213
-0.051
0.166
-1.00
Autocorrelations of ppi
-0.50
0.00
0.50
1.00
1.00
10
20
Lag
30
40
10
20
Lag
30
40
For original variable, ACF is a slow decay function (indicating non-stationarity) and PACF cuts off
at lag 1 or 2.
5
Autocorrelations of D.ppi
-0.200.00 0.20 0.40 0.60
10
20
Lag
30
40
10
20
Lag
30
40
For differenced variable, ACF tails off and PACF cuts off after lag 1 use AR(1)?
7
ARIMA Models
Const
L1.ar
L2.ar
L1.ma
L2.ma
L3.ma
AIC
BIC
* These are the Stata results. R has very similar coefficients. In the SAS output, the MA components have reverse
signs than what is reported in this table and some coefficients have different magnitudes.
We know that the variable is not stationary so we need to use differenced variable ARIMA
(p,1,q). But here we also include models with the original variable ARIMA (p,0,q).
The coefficient on the lagged dependent variable is close to 1 indicating non-stationarity.
To select a model to use, look at the significance of the coefficients and also low AIC or BIC.
Usually, there are a few models that perform similarly, so it is up to the researcher to try a few
models and decide which one to use. The recommendation is to go with the simplest model.
ARIMA(1,1,1) is a good choice based on low AIC and BIC.
ARIMA(2,1,3) is also a good choice based on the significance of the lags.
Forecasting in R
Original variable after ARIMA(1,0,1)
10