Regression Analysis 02

Quantitative Methods 2
Lecture 3
The Simple Linear
Regression Model
What is Ordinary Least Squares?
✔ Ordinary Least Squares (OLS) finds the
linear model that minimizes the sum of the
squared errors.
✔ Such a model provides the best
explanation/prediction of the data.
✔ Later we’ll show that OLS is the “Best
Linear Unbiased Estimator” (BLUE)
“Explained and “Unexplained”
Variation
yˆ = Βˆ 0 + Βˆ 1 x
û i
( yi − y )
Y
yi
ŷ i
Β̂1
Β̂ 0
Xi
X
“Explained and “Unexplained”
Variation Square this quantity
and sum across all
Square this observations and
quantity and sum we have our SST
across all (Total Sum of
Squares)
observations and û i
we have our SSR
(Residual Sum of ( yi − y )
Squares)
Y
Square this yi
quantity and sum
ŷ i
across all
observations and
we have our SSE
(Explained Sum
of Squares) Xi
X
Some Useful Properties of
Summation
n
1. xi  x1  x2  ...xn
i 1
n
2. c  nc
i 1
n n
3. cxi  c  xi
i 1 i 1
n n n
4.  axi  byi   a  xi  b  yi
i 1 i 1 i 1
  n
 
n    xi  
4.1. ( xi / yi )    i 1  
  n
 
i 1
   yi  
  i 1  
Some Useful Properties of
Summation
n n
y i x i
5. i 1  y ...and... i 1  x Proofs of 7 and 8
n n in Appendix A
n (page 709) of
6. ( xi  x )  0 Wooldridge
i 1
n n
7. ( xi  x )   x  n( x )
2 2
i
2
i 1 i 1
n n n n
8. ( xi  x )( yi  y )   xi ( yi  y )   ( xi  x ) yi   xi yi  n( x * y )
i 1 i 1 i 1 i 1
Minimizing the Sum of Squared
Errors
✔ How to put the Least in OLS?
✔ In mathematical jargon we seek to minimize
the residual sum of squares (SSR), where:
n
SSR = ∑ ( yˆ i − y i )
2
i =1
n
=∑ˆ
ui2
i=1
Picking the Parameters
✔ To Minimize SSR, we need parameter
estimates.
✔ In calculus, if you wish to know when a
function is at its minimum, you take the
first derivative.
✔ In this case we must take partial
derivatives since we have two parameters
(β0 & β1) to worry about.
Parameters that Minimize SSR
Minimize the Squared Errors
✔The SSR Function is:
n
SSR = ∑uˆ 2
i =1
SSR   ( yi  yˆ i )
Substitute in
2
our equation
for yhat.
SSR   ( yi  0  1 xi ) 2
Here comes the magic, Baby!
◆ Simplify Terms
 Partial Derivative with respect to β0
 Partial Derivative with respect to β1

Simplify Terms
Separate
SSR  ( yi  0  1 xi ) 2 terms
(A-B)
(A-B)2=
2=
First,
SSR  ( yi  (0  1 xi )) 2
22
Outside,
AA -BA-AB+B
-2AB+BLast
Inside, 2 2
(F.O.I.L)
SSR  yi  2 yi ( 0  1 xi )  (0  1 xi )
2 2
SSR  yi  20 yi  21 xi yi    20 1 xi   x

2 2
0
2 2
1 1
F.O.I.L
Multiply
-2yi
through
Partial Derivative with respect to β0
SSR  yi  20 yi  21 xi yi    20 1 xi   x

2 2
0
2 2
1 1
dSSR
 0  2 yi  0  20  21 xi  0
d 0 Take the
derivative
dSSR
 2 yi  20  21 xi only of
d 0 terms
which
include β0
dSSR
 2( yi  0  1 xi )(1)
d 0 Simplif
y
Partial Derivative with respect to β1
SSR  yi  20 yi  21 xi yi    20 1 xi   x

2 2
0
2 2
1 1
dSSR
 0  0  2 xi yi  0  2 0 xi  21 xi 2
d 1 Take the
dSSR derivative
 2 xi yi  2 0 xi  21 xi 2 only of
d 1 terms
which
dSSR include β1
 2( yi   0  1 xi )( xi )
d 1
Simplif
y
Partial Derivatives for β0 and β1
n
✔ First equation is the dSSR
partial derivative with   2( yi   0 1 xi )(1)
respect to β
dB0 i 1
0
n
dSSR
✔ Second equation is   2( yi  0  1 xi )( xi )
with respect to β1 d 1 i 1
Simplify and Set Equal to Zero
✔ First equation is for β0, second is for β1
✔ Set = 0 to find minimum point
✔ (Hats denote that parameters are estimates)
dSSR
 2 ( yi  ˆ0 ˆ1 xi )  0
dB0
dSSR
 2  xi ( yi  ˆ0 ˆ1 xi )  0
d 1
The Normal Equations
Divide equation 1 by -2  ( y  ˆ ˆ x )  0
i 0 1 i
and equation 2 by 2   x ( y  ˆ ˆ x )  0

i i 0 1 i
Multiply through by -x in  ( y  ˆ ˆ x )  0

i 0 1 i
β1 ’s equation
 ( x y  x ˆ ˆ x
i i i 0 1 i
2
)0
Separate summation terms

and rearrange to yield:
 y  (nˆ ) ˆ ( x )
i 0 1 i
 x y  ˆ ( x ) ˆ ( x
i i 0 i 1 i
2
)
Solving the Normal Equations
✔ Now we have two equations with two
unknown terms: β0 and β1
✔ These can be solved using algebra to

calculate the values of both β0 and β1
Solving for β1
✔ Multiply first normal
equation by the sum of  xi  yi  n 0  xi 1 ( xi )
ˆ ˆ 2
xi
✔ Multiply second n xi yi  ˆ0 n xi nˆ1 ( xi 2 )

normal equation by n.
Still Solving for β1 …
✔ Now subtract first equation from the second
✔ This yields:
n xi yi   xi  yi 
0  i 1  i 0 i 1  i
ˆ n x  nˆ ( x 2 )  nˆ x  ˆ ( x )2
Still Solving for β1 …
✔ Terms of ˆ0 n xi
cancel one n xi yi   xi  yi 
another out
nˆ1 ( xi 2 )  ˆ1 ( xi ) 2
✔ Then we
factor out β1 n xi yi   xi  yi 
from both
terms on the ˆ1 (n xi 2  ( xi ) 2 )
right-hand
side
✔ Then divide ˆ  n xi yi   xi  yi
through by the
n xi 2  ( xi )2
1
quantity on the
right hand side
to yield:
Tricky: Multiply both sides by
A Solution for β 1
1/n*1=1/n*n/n=n/n2
Why?
 
xi yi 
I need 
  to multiply
xi    
ythree
i  n 
Now multiply i
✔ xi yseparate  I can’t 
numerator &
ˆ
x y
i i 
n simply

 
numbers.
n   n  
split the 1/n.
denominator 1  
by 1/n

( x ) 2
Imagine

  I wanted  xi  
xi   to multiply
xi  
2 i
i  n 
2
x½(5*10)=25.

Ican’t solve
  it
n
  n 1/2(5)*1/2(10),
by multiplying   n  
✔ Recall that: which equals 12.5. That is like
multiplying by ¼. I need to
y i 
 y ...and...
xi
x ˆ
 
xi yi  n( x )( y )multiply xbyi y½*1=1/2*2/2=2/4.
i  n( x * y )
n n 1   Now,
 2

xi  n( x )( x ) 2(1/2*5)(1/2*10)=25 xi  n( x )
2 2
✔ This yields:
ˆ   ( xi  x )( yi  y )
 i
1
( x  x ) 2
Now Solving for βo
✔ Take the first normal equation

 i 0 1 i
y  ( n ˆ ) ˆ ( x )

✔ Then divide both sides by n and rearrange to
yield:
 yi  1 ( xi )  ˆ
ˆ
0
n n
A Solution for βo
✔ Now again that recall that:
 y  y ...and...  x  x
i i
n n
✔ Thus:
ˆ ˆ
 0  y  1 x
But What Does It Mean?
✔ Equation for β1 may not seem to make
intuitive sense at first
✔ But if we break it down into pieces, we can
begin to see the logic
ˆ 
 ( x  x
i )( y  yi)
 (x  x )
1 2
i
Understanding what makes β1
✔ Numerator for β1 is made of of TWO parts
– Deviations of X from its mean
– Deviations of Y from its mean
We know
– Then we multiply those deviationsthis as….
– Covarianc
And sum them up across all observations
e.
ˆ1 
 ( x  x )( y  y )
i i
 (x  x ) i
2
Understanding What Makes β1
✔ Denominator of β1 is made up of the
We know this
deviation of x from its mean times itself
as….
✔ We square this term. Variance in
✔ And sum up across all observations the
Independen
t Variable
ˆ 
 ( x ix )( y  y
i )
 (x  x )
1 2
i
✔ Thus β1 is made of of changes in x times changes in
y, divided by changes in x squared
– A.K.A “rise over run”
✔ Notice if the changes in x are EQUAL to the changes
in y, then β1 = 1
ˆ1 
 ( x  x )( y  y )
i i
 (x  x ) i
2
✔ If the changes in y are LARGER than the
changes in x, then β1 > 1
– I.E. a 1 unit change in x creates more than a 1
unit change in y
✔ If the changes in y are SMALLER than the
changes in x, then β1 < 1
– I.E. a 1 unit change in x creates less than a 1
unit change in y
✔ This corresponds to our intuitive
understanding of the slope of a line
– How much change in y do we observe for each
change in x?
✔ We can also see how β1 is calculated in
units of the dependent variable.
– It is changes in the dependent variable over
changes in the independent variable
Let’s Do An Example!
y x
8 2
2 0
5 1
26 8
14 4
17 5
26 8
Calculating β0 & β1
✔ Mean of x is 4
✔ Mean of y is 14
Y X Y - mean X - mean
8 2 -6 -2
2 0 -12 -4
5 1 -9 -3
26 8 12 4
14 4 0 0
17 5 3 1
26 8 12 4
Calculating β0 & β1
 ( x  x )( y  y ) = 186
i i
 i
( x  x ) 2
= 62
β1 =3
y x y - mean x- mean (x-x)(y-y) (x-x)(x-x)
8 2 -6 -2 12 4
2 0 -12 -4 48 16
5 1 -9 -3 27 9
26 8 12 4 48 16
14 4 0 0 0 0
17 5 3 1 3 1
26 8 12 4 48 16
Calculating βo and β1
ˆ0  y  ˆ1 x
✔ βo = mean of y - β1 (mean of x)
✔ Recall that:
– mean of y = 14 & mean of x = 4
✔ βo = 14 - 3(4)
✔ βo = 2
✔ Our equation is: y = 2 + 3x
Which Looks Like…This!
Regression of y on x
30
25
20
15
10
0
0 1 2 3 4 5 6 7 8 9
Calculating R2
✔ Let’s return to i  i 0 1i
ˆ
u 2
 ( y  ˆ  ˆ x )2

SSR
✔ Plug in βo and  i
( y  y ) 2
 1  i
2
( x  x ) 2
 i
ˆ
u 2
solve to get
SST SSE SSR
Calculating R2
✔ R2= SSE/SST
y - mean (y-mean)2
-6 36
-12 144
-9 81
2
 (x  x ) 2 12 144
0 0
R 
2 1 i
( y  y)
3 9
2
12 144
i
558
9(62)
R 
2
1
558 Our model
perfectly
explains
variation in y.

Regression Analysis 02

Încărcat de

Informații document

Descriere originală:

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

Regression Analysis 02

Încărcat de

Drepturi de autor:

Formate disponibile

Quantitative Methods 2

 Partial Derivative with respect to β1

SSR  yi  20 yi  21 xi yi    20 1 xi   x

SSR  yi  20 yi  21 xi yi    20 1 xi   x

SSR  yi  20 yi  21 xi yi    20 1 xi   x

and equation 2 by 2   x ( y  ˆ ˆ x )  0

Multiply through by -x in  ( y  ˆ ˆ x )  0

Separate summation terms

✔ These can be solved using algebra to

✔ Multiply second n xi yi  ˆ0 n xi nˆ1 ( xi 2 )

S-ar putea să vă placă și