Documente Academic
Documente Profesional
Documente Cultură
Correlation
& Regression
Prof.Dhananjay M.Apte
For private circulation only. All rights reserved Prof.D.M.Apte Monday, January 01, 2007
Example
Correlation
between Age & Growth is
Scatter Diagram)
Correlation
between Age & Growth is
1/1/2007
r = close to 0
r = 0.94
r = 0.99
Negative Correlation
r =1
Example: Calculation of cell phone bill 3
Lets determine r
Prof.D.M.Apte
1/1/2007
826
826
Correlation is
Correlation
Correlation is a measure of association/relationship between two numerical variables. Correlation r measures the direction and the strength of the linear association between two numerical paired variables.
1/1/2007
1/1/2007
exercise
EXERCISES
1) Find r for Tree Problem.
10
x 65
y 67
2) Determine r using scientific calculator for the given dataMode-reg-linear. x,y M+ S-sum
66
67 67 68 69 70 72
68
65 68 72 72 69 71
r = 0.604
3) Find r for following.. x y 100 130 200 110 300 100 400 80 500 60 600 50
700 30
Ans.3) n= 7, sum xy = 178000, sum x= 2800, sum y= 560, sum x 2=1400000 Sum y2=52400.r
10
= -- 0.99
XX
Prof.D.M.Apte
1/1/2007
Regression
11
r<1
We draw (Fit) the line that is representative of all the data points. Such line is called the line of best fit, the Regression line
Thus Regression calls for estimating the Best fitted line, passing through the given data points. It is done by using Least Square Theory Hence the line is also called as least square line
12
Least sq theory
1/1/2007
( Least
13
Square Theory )
use
A new baby is born that had gestated for 30 weeks. Whats your best guess at the birth-weight?
Extrapolated line
3000
Y=birthweight (g)
30
14
How to fit the line
1/1/2007
= s xy /sxx
N = 10
= b0 + b 1 X
Monday, January 01, 2007
Whats
Error ?..
Its the deviation between given data point & respective point on Regression line
error
are the points on Regression Line y are the given points in the data set. The difference between these, (i.e. y y cap) is the error (Also called Residual) Residual Plot
16 Monday, January 01, 2007
1/1/2007
Exercise
Find the error for all the y points given below
17
Exercise Exercise
1) From the Regression Equation, find the value of y at x = 75 2) How to find the value of x at y =150 ?
Note: . ............Replace x by y in the denominator part of the Slope Equation
Regression 18
y on x
and
Regression
x on y
Monday, January 01, 2007
1/1/2007
To measure prediction quality of fitted line, we measure S.D. of residualsSe Se = Sq. Root of Syy b1 Sxy.= We can define the value of Slope (b, b1) in
Interval
Interval of Slope =
20
10
1/1/2007
21
(4) Obtain Regression Equation (both, y on x & x on y) for the data in above problems no 1 & 3 and Also find the Mean Error
21 Prof.D.M.Apte Monday, January 01, 2007
11