Documente Academic
Documente Profesional
Documente Cultură
What We Learn ?
ρ = Population Correlation
r = Sample Correlation
y y y
x x x
r = -1 r = -.6 r=0
y y
x x
r = +.3 r = +1
Sample correlation coefficient:
r
( x x)( y y) r
n xy x y
[ ( x x ) ][ ( y y ) ]
2 2
[n( x 2 ) ( x )2 ][n( y 2 ) ( y )2 ]
where:
r = Sample correlation coefficient
n = Sample size
x = Value of the independent variable
y = Value of the dependent variable
Significance Test for Correlation :
H 0: ρ = 0 (no correlation)
HA: ρ ≠ 0 (correlation exists)
r
t Df = n -2
1 r 2
n2
Simple Linear Regression Model :
Independent
ŷ i b 0 b1x variable
Simple Linear Regression Model :
b1
( x x )( y y )
b0
x xy
y x 2
(x x) n x x
2 2 2
x
Xi
Coeficient of Determination :
SSR
R
2
y y
x x
R2 =1 R2 = +1
Standard Error of Estimate :
SSE
s SSE = Sum of squares error
n k 1 n = Sample size
k = number of independent variables in the model
sε sε
sb1
(x x) 2
(
x2 n x) 2
Variation of observed y Variation in the slope of
values from the regression regression lines from different
y line y possible samples
x x
small s small sb1
y y
x x
large s large sb1
t Test for Slope :
Df = n - 2
b1 β1
t
sb1
Interval Estimate :
Average y :
1 (x p x)2
ŷ t /2sε
n (x x)2
Individual y :
1 (x p x)
2
ŷ t /2sε 1
n (x x)2
Example :
x y xy x2 y2
3 487 1,461 9 237,169 Correlation Coefficient
5 445 2,225 25 198,025
2 272 544 4 73,984 n xy x y
r
8 641 5,128 64 410,881 [n( x 2 ) ( x) 2 ][n( y 2 ) ( y)2 ]
2 187 374 4 34,969
12(26.145) (55)(4.882)
6 440 2,640 36 193,600
7 346 2,422 49 119,716 [12(329) (55) 2 ][12(2.255.942) (4.882) 2 ]
1 238 238 1 56,644 0.8325
4 312 1,248 16 97,344
2 296 592 4 87,616
9 655 5,895 81 429,025
6 563 3,378 36 316,969
55 4,882 26,145 329 2,255,942
Example :
d.f. = 12-2 = 10 H 0: ρ = 0
HA: ρ ≠ 0
/2=.025 /2=.025
r .8325
t 4.75
1 r 2
1 .8325 2
n2 12 2
Reject H0 Do not reject H0 Reject H0
-tα/2 0 tα/2
-2.2281 2.2281
4.75
Example :
(x -
x y x - xbar y - ybar (x - xbar)(y-ybar)
xbar)2
Regression Model :
3 487 -1.58 80.17 2.51 -126.93
( x x )( y y ) 49.003
5 445 0.42 38.17 0.17 15.90
2 272 -2.58 -134.83 6.67 348.32
b1
(x x) 2 8
2
641
187
3.42
-2.58
234.17
-219.83
11.67
6.67
800.07
567.90
6 440 1.42 33.17 2.01 46.99
7 346 2.42 -60.83 5.84 -147.01
b0
y x x xy 182.235
2 1
4
238
312
-3.58
-0.58
-168.83
-94.83
12.84
0.34
604.99
55.32
n x x
2 2
2 296 -2.58 -110.83 6.67 286.32
9 655 4.42 248.17 19.51 1096.07
6 563 1.42 156.17 2.01 221.24
55 4,882 76.92 3769.17
Example :
Regression Model :
700
600
500
Sales = 182.235 + 49.003(Years)
400
Sales
300
200
100
0
0 2 4 6 8 10
Years
Example :
Coefficient of Determination :
Y (y - y
x y (y - ybar)2
regresi regresi)2
3 487 329.24 6,426.69 24,886.69 SST ( y y ) 2 269,781.67
5 445 427.25 1,456.69 315.01
SSE ( y yˆ ) 2 85,080.25
2 272 280.24 18,180.03 67.92
8 641 574.26 54,834.03 4,454.08
2 187 280.24 48,326.69 8,694.00
6 440 476.25 1,100.03 1,314.40 SSR SST SSE 184,701.42
7 346 525.26 3,700.69 32,133.38
1 238 231.24 28,504.69 45.72
4 312 378.25 8,993.36 4,388.81
SSR
R 0.6846
2 296 280.24 12,284.03 248.33 2
9 655 623.26 61,586.69 1,007.15
6 563 476.25 24,388.03 7,524.76 SST
55 4,882 4882 269,781.67 85,080.25
Example :
SSE 85,080.25
s 92.24
n k 1 12 1 1
sε 92.24
s b1 10.517
(x x) 2
76.92
Example :
Estimate Individul y :
1 (x p x) 1 (4.5 4.58) 2
2
y
ŷ b0 b1x1 b2 x 2
n 1
R 2A 1 (1 R 2 )
n k 1
F-Test for overall significance :
SSR
k MSR (numerator) D1 = k
F (denominator) D2 = (n – k - 1)
SSE MSE
n k 1
t-Test for individual significance :
bi 0
t df = (n – k - 1)
sb i
Multicollinearity
1
VIFj If, VIFj > 5, xj highly correlated with other explanatory
1 R2j variable
Qualitative Multiple Linear Regression ?
Using code 0 or 1
564 10 70
601 13 50
560 12 65
616 13 50
674 15 45
630 15 65
554 14 63
532 15 65
661 17 64
Multiple Regression Example (Excel) :
Correlation
Sales Salesman Price
Sales 1 0.518062386 -0.545613556
Salesman 0.518062386 1 -0.163554532
Price -0.545613556 -0.163554532 1
Regression
Regression
Regression
Regression
Regression
F =2.844
H0: β1 = β2 = 0
HA: β1 and β2 not both zero = .05
0
F
Do not Reject H0
reject H0 F = 5.143
.05
Multiple Regression Example (Excel) :
-1.597 1.486
Regression
a/2=.025 a/2=.025
H0: βi = 0
HA: βi ≠ 0
Regression
103 50 10
102 51 11
65 42 6
3. A real estate agent wishes to determine the selling price of residences using the
size (square feet) and whether the residence is a condominium, single-family home
or SRO. The agent believe that holiday also related to the selling price. Produce
the regression equation to predict the selling price, how much selling price
explained? A sample of 20 residences was obtained with the following results :
Selling Price Square Feet Type Holiday
US$ 269,700.00 1500 Family Yes
US$ 211,800.00 2085 Condo No
US$ 257,100.00 1450 Family Yes
US$ 224,400.00 1836 SRO Yes
US$ 245,800.00 1730 Family No
US$ 180,900.00 1726 SRO No
US$ 346,200.00 2300 Family Yes
US$ 243,600.00 1650 Condo No
US$ 289,000.00 1950 Family No
US$ 164,400.00 1545 SRO No
US$ 175,600.00 1375 Condo No
US$ 238,000.00 1825 Condo Yes
US$ 230,500.00 1650 Family No
US$ 253,300.00 1960 SRO Yes
US$ 213,200.00 1360 Condo Yes
US$ 180,200.00 1200 Condo No
US$ 277,100.00 2000 SRO Yes
US$ 297,200.00 1755 Family Yes
US$ 265,200.00 1850 SRO Yes
US$ 266,100.00 1630 Family No
Thank You