Sunteți pe pagina 1din 12

Statistical Tools for Managers

56

Chapter 4 Measure of Central Tendency


4.1.1. a. Properties of a Good Measure of Central Tendency Easy to understand.

A good measure of central tendency should possess as far as possible the following properties: b. Simple to compute. c. Based on all observations. d. Uniquely defined. e. Possibility of further algebraic treatment. f. 4.1.2. a. b. c. 4.2. a. b. c. 4.2.1. 4.2.1.1. Not unduly affected by extreme values. Common Measures of Central Tendency Mean. Mode. Mean Arithmetic mean (AM). Geometric Mean (GM). Harmonic Mean (HM). Simple Arithmetic Mean Simple Arithmetic Mean for Ungrouped Data (AM)
n

There are three common measures of central tendency: The average value. Most occurring value. Median. The middle value.

There are three types of mean:

x1 + x 2 + x3 + ..... + xn = = N

xi i =1 N

There is a short cut method for calculations based on a simple concept that, if a constant is subtracted or added to all data points, the arithmetic mean (AM) is reduced or increased by that amount. Thus,

= A+

di i =1

Where, A = Arbitrarily selected constant value (Assumed mean).

di = Deviation of each observation from the assumed mean.


N = Number of observations.

Statistical Tools for Managers

57

Note that, when assumed mean A is exactly equal to Arithmetic mean or X , algebraic sum of all deviations is equal to zero. Thus, algebraic sum of deviations of all observations about Arithmetic Mean is zero. Or, About Arithmetic Mean, 4.2.1.2.

di = 0 i =1

Simple Arithmetic Mean for Grouped Data

Then the weighted average is calculated by dividing sum of these values of class marks with frequency as their weights, by total number of observation (sum of all frequencies). Thus for grouped data,

i =1 = n

mi fi
fi i =1

m fi i =1
i

Example 2: From the following data compute Arithmetic Mean by direct method, short cut methods and step division method. Marks No of students Solution: Let the assumed Mean be A = 35 and Step size h = 10 Calculation Table Marks Class Mark ( mi ) 0-10 10-20 20-30 30-40 40-50 50-60 a. Direct Method:
6

0-10 5

10-20 20-30 30-40 40-50 50-60 10 25 30 20 10

No. of Students ( f i) mi * fi 5 10 25 30 20 10 100 25 150 625 1050 900 550 3300

Deviation di = mi A -30 -20 -10 0 10 20 fi * di -150 -200 -250 0 200 200 - 200

Step Deviation di=(mi-A)/h -3 -2 -1 0 1 2 fi * di -15 -20 -25 0 20 20 - 20

5 15 25 35 45 55

m fi i =1
i

f i =1

3300 100

= 33

Statistical Tools for Managers

58

b. Shortcut Method:

=A+

f di i =1
i

f i =1
n

= 35 +

200 100

= 35 2 = 33

c. Step Division method

= A+

f d i i =1
i

f i =1

h = 35 +

20 100

10 = 33

Note: The answer is same irrespective the method used. 4.3.1.6. a. b. c. d. e. 4.3.1.7. a. b. c. making. d. 4.3.1.8. Cannot be determined by inspection or graphically. Arithmetic Mean of Combined Data Merits of Arithmetic Mean Easy to understand and calculate. Takes all values into account. Lends itself to further mathematical treatment. Since sum of all deviations from Arithmetic mean is zero, it is a point of balance or center of gravity. Sum of the squared deviations from arithmetic mean is always the minimum. Limitations of Arithmetic Mean Affected significantly by extreme values. Cannot be computed for open-end class distribution without some assumptions. May give fallacious conclusions if we depend totally on Arithmetic mean for decision-

=
4.3.2.

N1 1 + N 2 2 + ...... + N n N1 + N 2 + ...... + N n
n

Weighted Arithmetic Mean

There are cases where relative importance of the different items is not the same. In such a case, we need to compute the weighted arithmetic mean. The procedure is similar to the grouped data calculations studied earlier, when we consider frequency as a weight associated with the class-mark. Now suppose the data values are x1, x2, x3, , xn and associated weights are W1, W2, W3 Wn, then the weighted arithmetic mean is: Direct Method

w =
4.3.2.1.

W 1 x1 + W 2 x 2 + ...... + Wn xn W 1 + W 2 + ...... + Wn

W x W
i i

Utility of Weighted Mean

Statistical Tools for Managers

59

Some of the common applications where weighted mean is extensively used are: a. b. c. Example 4: Construction of index numbers, e.g. consumer Price Index, BSE sensex, etc. where different weights are associated for different items or shares. Comparison of results of the two companies when their sizes are different. Computation of standardized death and birth rates. Pune University MBA [2770]-104

The management of hotel has employed 2 managers, 5 cooks and 8 waiters. The monthly salaries of the manager, the cook and waiter are Rs. 3000, Rs. 1200 and Rs. 1000 respectively. Find the mean salary of the employees. (Note: Although these salaries must be 10 to 15 year old, we will take it only to learn the principle.) Solution:
Here we need to calculate waited average of salary with salaries as weights.

w =
4.3.3.

W 1 x1 + W 2 x 2 + ...... + Wn xn 2 3000 + 5 1200 + 8 1000 = W 1 + W 2 + ...... + Wn 2+5+8 = 1333.33 Rs.


Geometric Mean (GM)

It is defined as nth root of the product of N values of data. If x1, x2 x n are values of data, then Geometric Mean,

GM = n x1 x 2 ...... xn
If different values are not of equal importance and are assigned different weights say w1, w2 ...w n then weighted Geometric Mean is given by

GMw = n x1w1 x 2 w2 ...... xn wn


Geometric Mean is useful to find the average % increase in sales, production, population, etc. It is the most representative average in the construction of index numbers. Example 5: A person takes home loan with floating interest, on reducing balance of 10 year term. The interest rates as changed from year to year in percent are 5.5, 6.25, 7.5, 6.75, 8.25, 9.5, 10.5, 9, 8.25 and 7.5. Find was the average interest rate? Was it beneficial for him to take fixed interest rate on reducible balance at 7.5% per annum? Solution: Average interest rate can be found out using G.M. as follows. First we find the index by dividing % rate by 100 and then adding 1. Then we take G.M. of this index as average index. From it we can find out the average interest rate. Average index (G.M.) =
10

1.055 1.0625 1.075 1.0675 1.0825 1.095 1.105 1.09 1.0825 1.075

= 10 2.137 = 1.0789
Thus, Average Interest Rate = 7.89% Hence it was beneficial for him to take fixed interest rate on reducible balance at 7.5% per annum. 4.3.4. Harmonic Mean (HM)

Statistical Tools for Managers

60

It is defined as the reciprocal of the arithmetic mean of the reciprocal of the individual observations. Thus Harmonic Mean is, HM =

n 1 1 1 + + .... + xn x1 x 2
=
n

i =1 xi

Example 6: A relay team has four members who have to drive four laps between two fixed points. Average speeds that the members can achieve in Km/hr are 280, 360, 380 and 310. Find average speed of the team to complete the event. Solution: The average speed can be calculated as Harmonic Mean HM. Thus, average speed of the team is, HM =

1 1 1 1 1 1 1 + + + + + .... + xn 280 360 380 310 x1 x 2


Weighted Harmonic Mean
n

= 327.69

Km/hr

4.3.5.

If weight is attached with each observation then the weighted Harmonic Mean is,

w1 + w2 + ...... + wn HM = wn = w1 w2 x1 + x 2 + .... + xn

wi i =1

i =1 xi

wi

Harmonic Mean is useful in computing the average rate of increase in profits, average speed of journey, average price of articles sold, etc. For example, airplane travels distances w1, w2, w3 wn, with speeds x1, x2, x3 xn, km\hr respectively, then the average speed is equal to weighted Harmonic Mean of speeds, with weights as the distances w1, w2, w3 wn. Example 7: An aircraft travels 200 km upto border at speed 700 km/hr (economical), then 250 km upto the target in enemy territory at speed 950 km/hr, then after dropping the bombs travels at runaway speed of 1700 km/hr upto our nearest border at 150 km and then at the speed of 800 km/hr to the base at distance of 300 km. Find the average speed of the sortie. Also find the mission time. Solution: For the average speed, we need to find the weighted Harmonic Mean. Thus the average sortie speed is, HM =

w1 + w2 + ...... + wn 200 + 250 + 150 + 300 = = 889.23 wn 200 250 150 300 w1 w2 km/hr x1 + x 2 + .... + xn 700 + 950 + 1700 + 800
Median (Md) Median M d =

Mission time = 1.012 ; 1 hr approx. 4.4.

N + 1 observation. 2

th

Statistical Tools for Managers

61

If the number of observations is even, then the median is the arithmetic mean of two middle observations.

N N observation + + 1 observation Median Md = 2 2 2


In case of grouped data we first find the value
th

th

th

N . Then from the cumulative frequency we find the class 2

in which the formula: -

N item falls. Such a class is called as Median Class. Then the median is calculated by 2

N pcf Median Md = 2 L+ h f
Where, L = lower limit of Median class. Total Frequency. preceding cumulative frequency to the median class. frequency of median class. class interval of median class.
th

N = pcf = f h = =

N Let us understand the logic of the formula. Median is value of observation. But this observation 2
falls in the median class whose lower limit is L. Cumulative frequency of class preceding to the median class is pcf. Thus, the median observation is

N pcf observation in the median class (counted 2

th

from the lower limit of the median class). Now, if we consider that all f observations in the median class are evenly spaced from lower limit L to upper limit L+h, the value of the median can be found out by using ratio proportion. Example 8: Calculate the median for the following data. Age No. of Workers Solution: Age 20-25 25-30 30-35 35-40 40-45 Frequency f 14 28 33 30 20 Cumulative frequency cf 14 42 75 105 125 20-25 25-30 30-35 35-40 40-45 45-50 50-55 55-60 14 28 33 30 20 15 13 7

Statistical Tools for Managers

62

45-50 50-55 55-60 Now, Or, N = 160

15 13 7

140 153 160

N = 80 2

80th item lies in class 35-40. Hence, pcf = 75, f =30, h = 5 and L = 35 Therefore, the Median is,

N 160 pcf 75 Md = L+ 2 h = L+ 2 5 f 30
= 35.83 4.4.1. Mathematical Properties of median a. An important mathematical property of the median is the sum of the absolute deviations about the x Md is minimum. median is minimum i.e.

b. Median is affected by total number of observations rather than values of the observations. 4.4.2. a. b. c. d. data. 4.4.3. a. b. c. d. e. 4.5. Demerits of Median Need to rearranged data. For computer it is expensive operation. In case of even number of observations, median cannot be exactly determined. Less familiar than average. Does not take into account data values and their spread. It is intensive. Not capable of algebraic treatment. Quantiles Merits of Median Easy to determine and easy to explain. Less distorted than arithmetic mean. Can be computed for open-end distribution. Median is the only measure of central Tendency that can be used for qualitative ranked

Quantiles are related positional measures of Central Tendency. These are useful and frequently employed measures. Most familiar quantiles are Quartiles, Deciles, and Percentiles. We are familiar with percentile scores in competitive aptitude tests or examinations of few institutes. If your score is 90 percentile, it means that 90% of the candidates who took the test, received a score lower than yours. In incomes in your organisation if you are 95 percentile, you are in the group of top 5% highest paid employees in your company. 4.5.1. Percentile

Statistical Tools for Managers

63

Pth percentile of a group of observations is that observation below which lie P % (P percent) observations. The position of Pth percentile is given by points. Example 9: In a computerized entrance test 20 candidates appear on a particular day. Their scores are: 9, 6, 12, 10, 13, 15, 16, 14, 14, 16, 17, 16, 24, 21, 22, 18, 19, 18, 20, 17. Find 80th and 90th percentiles of data. Solution First, we order the data in ascending order. 6, 9, 10, 12, 13, 14, 14, 15, 16, 16, 16, 17, 17, 18, 18, 19, 20, 21, 22, 24. 80th percentile of the data set is the observation lying in the position: -

(n + 1) P , where n is the number of data 100

(n + 1) P (20 + 1) 80 = = 16.8 100 100


Now, the 16th observation is 19 and 17th observation is 20. Therefore 80th percentile is a point lying, 0.8 proportion away from 19 to 20, which is 19.8. The 90th percentile is similarly found as observation lying in position: -

(n + 1) P (n + 1) 90 = = 18.9 100 100


The 18th observation is 21 and 19 th observation is 22. Therefore 90 th percentile is a point 0.8 proportion away from 21 to 22, which is 21.9 4.5.2. Quartile Example 10: In a computerized entrance test 20 candidates appear on a particular day. Their scores are: 9, 6, 12, 10, 13, 15, 16, 14, 14, 16, 17, 16, 24, 21, 22, 18, 19, 18, 20, 17. Find the quartiles of data. Solution First, we order the data in ascending order. 6, 9, 10, 12, 13, 14, 14, 15, 16, 16, 16, 17, 17, 18, 18, 19, 20, 21, 22, 24. a) First quartile is the observation in position: -

(n + 1) 25 = 5.25. 100
Value of the observation corresponding to 5.25th position is 13.25 b) Second quartile or median is the observation in position: -

(n + 1) 50 = 10.5. 100
Value of the observation corresponding to 10.5th position is 16. c) Third quartile is the observation in position: -

(n + 1) 75 = 15.75. 100

Statistical Tools for Managers

64

Value of the observation corresponding to 15.75th position is 18.75 Note: 0th quartile is same as 0th percentile, which is the minimum observation. Similarly 4 th quartile is 100th percentile, which equals to the maximum observation. 4.5.3. Deciles These are the values, which divide the total number of observations in to 10 equal parts. Obviously there are 11 deciles (including 0th and 10th). Method of calculating deciles is same as percentiles. We can use the formula same as percentile by substituting P by 10, 20, 30, etc. for 1st, 2nd, 3rd, etc. deciles. 4.6. Mode The mode of a data set is the value that occurs most frequently. There are many situations in which arithmetic mean and median fail to reveal the true characteristics of a data (most representative figure), e.g. most common size of shoes, most common size of garments. In such cases mode is the best-suited measure of the central tendency. There could be multiple model values, which occur with equal frequency. In some cases the mode may be absent. For a grouped data, model class is defined as the class with the maximum frequency. Then the mode is calculated as: Mode = L + Where, L = Lower limit of model class.

1 h 1 + 2

1 = Difference between frequency of the model class and preceding class.

2 = Difference between frequency of the model class and succeeding class.


h = Size of the model class. Example 11: In a computerized entrance test 20 candidates appear on a particular day. Their scores are: 9, 6, 12, 10, 13, 15, 16, 14, 14, 16, 17, 16, 24, 21, 22, 18, 19, 18, 20, 17. Find the mode of the data. Solution: Now the value 16 occurs 3 times which is maximum for any observation. Therefore, Mode = 16 Example 12: In a computerized entrance test 20 candidates appear on a particular day. Their scores are: 9, 6, 12, 10, 13, 15, 14, 14, 16, 17, 16, 24, 21, 22, 18, 19, 18, 20, 17. Find the mode of the data. Solution: Now the values 14, 16, 17 and 18 occur 2 times which is maximum for any observation. Therefore, Modes are 14, 16, 17 and 18 (this is a multimodal distribution) Example 13: In a computerized entrance test 20 candidates appear on a particular day. Their scores are: 9, 6, 12, 10, 13, 15, 14, 16, 24, 21, 22, 19, 18, 20, 17. Find the mode of the data. Solution: Now there is no value that occurs more than 1 time. Therefore, the data has no Mode. 4.7. Relationship Among Mean, Median and Mode

Statistical Tools for Managers

65

A distribution in which the mean, the median, and the mode coincide is known as symmetrical (bell shaped) distribution. Normal Distribution is one such a symmetric distribution, which is very commonly used. If the distribution is skewed, the mean, the median and the mode are not equal. In a moderately skewed distribution distance between the mean and the median is approximately one third of the distance between the mean and the mode. This can be expressed as: Mean Median = (Mean Mode) / 3 Mode = 3 * Median 2 * Mean Thus, if we know values of two central tendencies, the third value can be approximately determined in any moderately skewed distribution. In any skewed distribution the median lies between the mean and mode. In case of right-skewed (positive-skewed) distribution which has a long right tail, Mode <Median < Mean. In case of left-skewed (negative-skewed) distribution which has along left tail, Mean < Median < Mode 4.8. Example Set Example 17: Inflation rate in percent for past six months is given as 5.5, 6.2, 7.2, 6, 6.5 and 5.9. Find average inflation rate over past six months. Solution: Average inflation rate can be found out using G.M. as follows. First we find the index by dividing % rate by 100 and then adding 1. Then we take G.M. of this index as average index. From it we can find out the average inflation rate. Average index (G.M.) =
6

1.055 1.062 1.072 1.06 1.065 1.059

= 6 1.4359 = 1.062
Pune University MBA [2875]104

Thus, Average Interest Rate = 6.2% Example 18: The expenditure of 1000 families is given below: Expenditure in Rs. Number of Families 40-59 50 60-79 -

80-99 500

100-119 -

120-139 50

The median of the distribution is Rs. 87. Calculate missing frequencies and for the complete distribution table calculate Mode. Solution: Let the missing frequency of class 60-79 be x. Since the total frequency is 1000, the frequency of the class 100-119 is (1000 50 x 500 50 ) = 400 x Since median is given as 87, the median class is 80-99. Now,

N pcf Median Md = 2 L+ h f

Statistical Tools for Managers

66

Where,

= 80

lower limit of Median class. Total Frequency. preceding cumulative frequency to the median class. frequency of median class. class interval of median class.

N = 1000 pcf = 50 + x f h Thus, = 500 = 20

87 = 80 +

500 (50 + x) 20 7 25 = 500 (50 + x) 50 + x = 325 500

Or, x = 275 Thus the missing frequency of class 60-79 is 275. Also the frequency of the class 100-119 is (400 x ) = 125 ii) Since the highest frequency is in class 80-99, it is a modal class. Now,

Mode = L + Where, L = 80

1 h 1 + 2
Lower limit of model class. Difference between frequency of the model class and preceding class. Difference between frequency of the model class and succeeding class. Size of the model class.

1 = 225

2 = 375
h = 20

Mode = 80 +

Example 20: JHU MBA [102] 2004 The following data are scores on a management examination taken by a group of 22 people. 88, 56, 64, 45, 52, 76, 54, 79, 38, 98, 69, 77, 71, 45, 60, 78, 90, 81, 87, 44, 80, 41 Find the mean, median, standard deviation, and 60th percentile.
Solution: Number of observations N = 22 a)
=
n

225 20 = 80 + 7.5 = 87.5 225 + 375

X=

xi i =1 N
22

88 + 56 + 64 + 45 + 52 + 76 + 54 + 79 + 38 + 98 + 69 + 77 + 71 + 45 + 60 + 78 + 90 + 81 + 87 + 44 + 80 + 41

= 66.9545
b) For calculating median we need to arrange the data in ascending order as follows, 38, 41, 44, 45, 45, 52, 54, 56, 60, 64, 69, 71, 76, 77, 78, 79, 80, 81, 87, 88, 90, 98 Since the number of observations is even, hence the median,

Statistical Tools for Managers

67

N N observation + + 1 observation 11th Observation + 12th Observation 2 2 Md = = 2 2 =


c)
th

th

th

69 + 71 = 70 2
th

(n + 1) P observation. P percentile = 100 ( n + 1) 60 th 60 percentile = = 13.2 observation. 100


th

th

Since it is a fraction, we need to interpolate the value between 13 th and 14th observations. Now 13th observation is 76 and 14th observation is 77. Thus by interpolating, 60th percentile = 13.2th observation = 76.2 4.9. Exercise = 58.89 Monthly salary Rs. Number of Workers Ans: Mean = 852.94 , 400-600 4 600-800 10 Mode = 850 800-1000 12 1000-1200 6 1200-1400 2

6. Calculate arithmetic mean and mode from the following:

Pune University BBA [2791]-203

S-ar putea să vă placă și