Documente Academic
Documente Profesional
Documente Cultură
Distribution: a brief description of a distribution must include its shape and numbers describing its center and
spread.
x1 + x2 +...+ xn ∑ xi
Mean = x = n = n
Example: Fuel economy MPG for 2004 vehicles (outlier: Honda Insight)
¾ The mean is sensitive to the influence of a few extreme observations that “pull it” towards
the longer tail. The mean is not a resistant measure of the center of a distribution.
¾ We can make the mean as extreme as we want by significantly changing one single
observation.
Median = M = the midpoint of a distribution, the number such that half the observations are
smaller and the other half are larger. The position of the median is n+1
2 ; if odd number of observations, the
middle observation is the median; if even number of observations, the median M is the average of the middle
two observations.
¾ The scope, quartiles, and median offer a reasonably complete description of center and
spread.
Example: Highway Gas Mileage: 13 18 23 27 32
¾ Boxplot
• Central box spans quartiles
• A line in the box marks the median M
• Lines extend from the box to the Min and Max values
¾ Interquartile Range – IQR = distance between quartiles = the range of the center half of
the data = Q3 – Q1
¾ Suspected outlier Ù it falls more than 1.5 IQR above the third quartile or below the
first quartile