Documente Academic
Documente Profesional
Documente Cultură
UNIVERSITAS ANDALAS
STATISTIKA dan
PROBABILITAS
oleh :
Purnawan, PhD
Chapter 3
Describing Data Using
Numerical Measures
Chapter Goals
After completing this chapter, you should be able to:
Chapter Topics
Measures of Variation
Summary Measures
Describing Data Numerically
Other Measures
of Location
Mean
Median
Mode
Weighted Mean
Variation
Range
Percentiles
Interquartile Range
Quartiles
Variance
Standard Deviation
Coefficient of
Variation
Mean
Median
Mode
Weighted Mean
xi
x
i 1
XW
n
N
xi
i 1
w ixi
wi
w ixi
wi
Sample mean
n = Sample Size
xi
x
i 1
x1
Population mean
x 2 xn
n
N = Population Size
xi
i 1
x1
x 2 xN
N
0 1 2 3 4 5 6 7 8 9 10
Mean = 3
1 2 3 4 5
5
0 1 2 3 4 5 6 7 8 9 10
Mean = 4
15
5
1 2 3 4 10
5
20
5
Median
0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10
Median = 3
Median = 3
Mode
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Mode = 5
0 1 2 3 4 5 6
No Mode
Weighted Mean
Example: Sample of
26 Repair Projects
Days to
Complete
Frequency
12
w i xi
wi
(4 5) (12 6) (8 7) (2 8)
4 12 8 2
164
26
6.31 days
Review Example
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
$500 K
$300 K
$100 K
$100 K
Summary Statistics
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Mean:
Sum 3,000,000
($3,000,000/5)
= $600,000
Shape of a Distribution
Symmetric or skewed
Left-Skewed
Symmetric
Right-Skewed
Quartiles
Percentiles
p
(n 1)
100
p
(n 1)
100
60
(19 1) 12
100
Quartiles
Q2
Q3
so use the value half way between the 2nd and 3rd values,
so
Q1 = 12.5
Example:
25%
Minimum
25%
1st
Quartile
25%
Median
3rd
Quartile
25%
Maximum
Q1
Q2 Q3
Symmetric
Q1 Q2 Q3
Right-Skewed
Q1 Q2 Q3
Q1
Q2
00 22 33 55
Q3
Max
10
27
27
27
Measures of Variation
Variation
Range
Interquartile
Range
Variance
Standard Deviation
Population
Variance
Sample
Variance
Population
Standard
Deviation
Sample
Standard
Deviation
Coefficient of
Variation
Variation
Same center,
different variation
Range
Example:
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
10
11
12
Range = 12 - 7 = 5
10
11
12
Range = 12 - 7 = 5
Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119
Interquartile Range
Interquartile Range
Example:
X
minimum
Q1
25%
12
Median
(Q2)
25%
30
25%
45
Q3
maximum
25%
57
Interquartile range
= 57 30 = 27
70
Variance
Sample variance:
(xi
s2
i 1
n -1
N
Population variance:
x)
(xi )
i 1
Standard Deviation
(xi
s
i 1
n -1
N
x )2
(xi ) 2
i 1
Calculation Example:
Sample Standard Deviation
Sample
Data (Xi) :
10
12
14
15
n=8
s
(10
(10
126
7
17
18
18
24
Mean = x = 16
x )2
(12
x )2
16)2
(12
16)2
4.2426
(14
n 1
x )2
(14 16)2
8 1
(24
x )2
(24
16)2
12
13
14
15
16
17
18
19
20 21
Mean = 15.5
s = 3.338
20 21
Mean = 15.5
s = .9258
20 21
Mean = 15.5
s = 4.57
Data B
11
12
13
14
15
16
17
18
19
Data C
11
12
13
14
15
16
17
18
19
Coefficient of Variation
CV
100%
Sample
CV
s
x
100%
Comparing Coefficient
of Variation
Stock A:
Average price last year = $50
Standard deviation = $5
CVA
s
x
$5
100%
100% 10%
$50
Stock B:
CVB
s
x
$5
100%
100% 5%
$100
Both stocks
have the same
standard
deviation, but
stock B is less
variable relative
to its price
95%
99.7%
Tchebysheffs Theorem
Examples:
At least
within
(1 - 1/12) = 0% ..... k=1 ( 1)
(1 - 1/22) = 75% ........ k=2 ( 2)
(1 - 1/32) = 89% . k=3 ( 3)
where:
x = original data value
= population mean
= population standard deviation
z = standard score
(number of standard deviations x is from )
x x
s
where:
x = original data value
x = sample mean
s = sample standard deviation
z = standard score
(number of standard deviations x is from )
Using Excel
Use menu choice:
Using Excel
(continued)
Click OK
Excel output
Microsoft Excel
descriptive statistics output,
using the house price data:
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Chapter Summary
Chapter Summary
(continued)
Symmetric, skewed