Documente Academic
Documente Profesional
Documente Cultură
;,~..,tJi"l\~,
_. 'Jth.
15.1 Populations and samples ~:;~;;;''!c--~(~f,'_u
Statistics 385
r~
15.2 Diagrammatic representation of data
There are three common types of data: qualitative, discrete and
continuous. Qualitative data consists of descriptions using names,
for example red, white, blue. Discrete data consists of exact
'For continuoUs datayou C~Q
numerical values, for example 2, 3, 4. Continuous data consists of
numerical values in cases where it is not possible to make a list of :,usera0ges'of val.'es'such as .il,
all the outcomes, for example, measurements such as weight, time
.'0-4, 5:"'8,9-1t:ietc~
. ~t. ;'j -./;
:., '_~~'
"
~ :~
,'... ,~
~,
".. [;#t
and length.
Lists of numbers can be difficult to understand and interpret
straight away. Sometimes it is not easy to just look at a set of data
and make some immediate conclusions. Therefore it is often
useful to draw tables and diagrams to show what is happening.
Some ways of representing data are tally charts, stem and leaf
diagrams, frequency tables, bar charts, pictograms, line graphs and
pie charts. Most of these should be familiar to you already, and
theyare summarised here.
lO, 12, 11, lO, 14,8,9, lO, 12, 15, 10 I 1111 The qiagbnall,ine/tally used
1,~ - : ,,~;""'!I:;f, ,__, _ _ _ ~
most common from the raw data. A tally chart referred,to asa Jive-bar gate.
14 . ..' ,;.
provides a simple summary. 11
\<;orftriQlY'~use9;jt~a
',' .'. 11. '14;;;"'-' .~ .'
df' !"'
:.
j'
8 I 1
stofn~)lrgeamu'n'~f data.a
9 11
2
10 1111
4
11 11
2
12 MI 6
13 11
2
14 11
2 i~Nbt.thaHh~t9taYbfthe
';';'IS .:.;.~.' ", L'~' . _'
't~( \.'
- _ ;~~.
I
__o
386 Statistics
Bar charts
6
You can represent the number of cars sold
per week by a garage over a 20-week period, 5
as recorded in the tally chart on page 385, in
a bar chart. The lengths of the bars are ~
e
4
u
proportional to the number of observations ::J
C'
of each outcome. J: 3
Une graphs
In a line graph, the data points are plotted
on axes and joined by straight Hnes. o
8 9 10 11 12 13 14 15
The car salesdata are shown here Number of cars sold in one week
as a line graph.
16
"O 14
u 12
....
o
.g 10
8
+-,.
O 2 4 6 8 10 12 14 16 18 20
Week
Statistics 381
Pictograms
Pictograms are bar charts in which the bars are replacedby
symbols (pictures) representing the subject of the data.
They are lessprecisethan bar charts.
"_"".1'1
Example - 1
Pie charts
Exercise 15A
..............................................................................
~
c::
Frequency 6 2 5 8 14 9 4 2
Frequency 7 12 9 7 3 2
surnrnarisetheseresults.
,. . '.. ,
.
class1~i 1ffK .
."
.
.,-
9 The shoe sizesof the ehildren in a classare:
4,5,7,6,4,6,7,7,8,5,4,7,6,4,7,8,6,5,7,4
Construet a pie ehart to represent these data.
10 In a drawing eornpetition all the exhibits are awardedagrade
frorn A to E.The results are surnrnarisedin the pie ehart.
390 Statistics
-',-,-
15.3 Mode, median, mean Cb-3
There are three common waysof measuring the typical,or
average, value of a set of data. These averagesare calledthe mean,
median and mode.
The mode
The mode of a set of data is the single value that occurs most
often. If two outcomes occur with the greatest fr~quency then
there is no unique mode. The data are described as being
bimodal. If there are three or more such outcomes then the data
are described as being multimodal.
Example 1
A fair die is thrown ten times and the following results obtained:
5,4,5,1,3,5,6,2,1,4
What is t~e modal score?
...........................................................................
The score 5 occurs the most (three times) and therefore the
modal score is 5.
15 27 36 21 11 '-dR~~Efs(~e.e,pa~~3 ~?),,;yo~t~::::
Frequency 'o,": -'-=:,;-..;_:,' ,'_ '.;_'~:,.'> . ' ',...";"f, <,,-:__
The median
The median is the middle valuewhen all the observations or
outcomes are arranged in order of magnitude.
Example 3
Find the median of these sets of data.
a) In an office block the amount spent on lunch by a cross-section
of office workers on a particular Friday was recorded as
4, 14, 2, 6, 6, 4, 24, 1O, 12, to the nearest euro.
b) The ages of university students in a tutorial group were
recorded as 20, 23,18,19,28,26,22,18.
...........................................................................
:~
.....
Statistics 391
The mean
The mean is the sum of all the observedvalues divided by the total
number of observations.It is written asx and calculatedby the
formula:
_ 1",
x=- n LX
Length (x cm) 10 15 20 25
Frequency(f) 6 12 13 4
The table shows that there were 6 plants whose stems were 10 cm
-, long, to the nearest 5 cm. There were 12 plants whose stems were
15 cm long, to the nearest 5 cm, and so on.
Notice that the total number of observations/outcomes is the total
of the frequencies:
n = 6 + 12 + 13 + 4 = 35
----
392 Statistics
To find the sum of the observed values, draw the table vertically
and add another column: Th~re~is>~n.iWportant ~.istiq~q;\
be~.eenth~
; , , , '"valu~s
,,'-
xx.',r"x';;f.~!'~
." ~.,~I:'~.("
10 6 10 X 6 = 60 "entlre~P9puatb8' aod'thec
25 4 25 X 4 = 100
?;~se~~~j:ons'
,T.~1~~l~';S~~5~:;1~~
; = 35 Xi = 600 m~~s~reme.Q~ reJ.a~ir2tp.th.~~
e,ntite'Workforce ,in,a 'f~ctory.....
Therefore the wble ponUlation~:" . :;;~, ., . .
.,'~". ',~.#_~~'- F,__:_.I,.,r;:~:~'~~..:'__-:. ".,~..d~i.~~:._.r
.Altern~tiely "JI),~..'q6.~~rvatt~~s..}:,:t
"-$',,~ ti'
x= 60 + 180 + 260 + 100 = 600 = 17.1 (3 s.f.) mayrelate to'orily lO,;efDpIQy.~es'~
35 35
an.d will be usedasa!sample
/c~'r; '-'~.:~~f".','_'~" .",,~ .".'. :;o~:,~~:. (~
frorn whih coOcl,usi20S cCJ{1()e'
-:, ..' "~I" '>".,' ~':; ,.~:
Example 5
'.
Statistics 393
Exercise 158
..............................................................................
1 A statistician throws a fair die 20 times and records the
following results.
4,2,1,6,4,5,3,4,6,2,3,2,2,4,6,3,5,1,2,3
What is the modal score?
8
7
>. 6
g.. 5
'"
l 4
J,I.. 3
2
--- -
Statistics 395
Frequency 45 18 12 5
Frequency 12 14 11 9 4
Frequency 6 22 25 14 3
Example 1
Putting the data into the GDC and calculating the mean gives '.Ensuretliat ybrGDCMs)n;
28 cm. .,: ,.'.', , ,.\ r~"~i,9;
sttisti<i31 Jn(:je.,
The GDC will give you the standard deviation 5.15, from which
you can calcuIate the variance as 26.5.
Using the formula would give the mean as:
x 30 + 17 + 32 + ... + 26
- = = 28
n 8
The variance is:
a2 = ~ x2 - (~x r
1
= -(302
8
+ 172+ 322 + ... + 262) - 282
= 26.5
Therefore the standard deviation is u = Y26.5 = 5.15
The mean height is 28 cm and the standard deviation is 5.15 cm.
Example 2
Example 3
A restaurant serves a variety of wines of different prices. In a
week chosen at random the numbers of bottles of wine soId
were recorded by price and the results are shown in the table.
x ~:S; x < 5 I 5:S;x < 10 I 1O:S;x < 151 15:S;x < 20 I 20:S;x < 25
Number
of bottles 1
27 15 8 3
of wine
sold
= 25.1
Therefore the standard deviation is u = \125.1= 5.01
The mean price is 6.57 and the standard deviation is 5.01.
-.
Statistics 399
11 ~~:7;;~:.~ .t~.~~~;.~;~
fu~.;;.~-;;..
~~d.~;~~d;;;d.
d;;;~~~~~f.
eachof thesesetsof numbers.
a) 3,5,7,8,9, 10
b) 5,5,7,8,9,10,12
c) O,0,1,3,5,6,8,9
d) -2,-1,3,4,6
Frequency 4 6 7 3
Frequency 2 3 8 6 5 1
Frequency 8 14 14 10 4
3 The table shows the prices of the 25 cars which are for salein a
garage.
o
160-170 170-180 180-190 190-200 200-210
Height (cm)
-- - -- - - - -
Statistics 401
35tmtl--tHtttttttmt)'jff!:
<J
e., 30
::1
[
...... 25
., [JOinthe"pointswitha,curve: ~ .1
,
O 20-rm1:1=:.rm::::nrJ:R:1:!:1St!:I+H+H+I+H+H-
';
e::1 15
u
10
5
40
35-fftt1"I-rtt'tt1TH-j-"
<J
e., 30
::1
1., 25-,
. 20
-S 18-
e::1 15
u
10-,
5
median is 14 (approximate)
'.
-
402 Statistics
This tells you that there are 18 plants with height less than 14 cm.
The lower point, chosen so that one quarter of the values are
less than or equal to it, is called the first quartile and denoted
as Q.
Q.
I
median
I
Q3
I
~ interquartile range
The upper point, with one quarter of the values greater than it, is
the third quartile and is denoted as Q3'
(Q2 is the median.)
40_;-'""
35
>.
ue
., 30
S. 27-
25
.,
>
.=
os 20...u.J"u..!..I..u.;...J"u.u..J..u'"-'..I-/..I.u.J"u"u..,...i
'3
e= 15
u
9JQ.
".
Statistics 403
Percentiles
The distribution may be split into a greater number of parts.
A very large sample may be split into 100 parts called percentiles.
If it is divided into ten parts they are called deciles.
In Example 1, the 3rd decile is found by calculating
~10 X 36 = 10.8,and projecting down on the cumulative
frequency curve as shown.
40
35
>.
u
eO) 30-+1T-IT-I++m'"""""I.I-I+++I..H,I''j-.
=
]" 25
.~
os 20
] 15-~' ..j
8 10.8 ~10
5"tt:t::t:t:r approximately 12
! d i" T. _\ '::~. v..,.;
Example 2
In a cricket natch the 40 completed innings gave this
distribution of scores.
,
Score 0-9 10-19 20-29 30-39 40-49
Frequen tcy 8 10 6 5 6
Frequen tcy 2 O 2 O 1
..
404 Statistics
45
40
~
..
35
g' 30
..!::
.~ 25
~
"3
e
8 15
10
5
1 \
smallest largest
value value
The box represents the central 50% of the data (Q3- Q),and the
'whiskers' show the smallestand largestvalues.
Statistics 405
Illllstrate the data from Example 2 llsing a box and whisker plot.
................................................................................
Q = 10.5,Q2 = 22.5, Q3 = 43.5,lowestvaIlle= O,
highest value = 99.5
Note that the box and whisker
plot clearly illustrates where the
central 50% of the data lies
relative to the smallest and
I I + i i I + + I + 1
o 10 20 30 ~o 50 60 7U 80 90 100 largest values.
Score
Exercise 15D
.. . . . ... ... .. ..... ......... ..... ..... ....... .... ....... ... . . ........ . ..... ....... . .........
1 The masses of 800 eggs were recorded and the reslllts are
summarised in the table.
Mass (grams) 50-55 55-60 60-65 65-70 70-75 75-80
".
406 Statistics
Frequency 62 116 98 76 48
Frequency 24 34 32 28 20 22
6 A survey of the ages of 240 ears in a ear park gives these results.
Frequency 114 36 48 28 14
Frequency (Maths) 14 30 76 56 24
-. - -
Statistics 407
15.6 Histograms
Histograms are used to illustrate grouped or continuous data.
Although histograms may look the same as bar charts,
there are some important differences.
frequency
In the case of a bar chart the variable axis represents densily
discrete data and is therefore simply divided into spaces.
A histogram represents continuous data and therefore the
variable axis is a continuous number line. variable:.x
.
fr equency denslt y = f
. '..
. w ~
".
wh~ref
,..
=.frequency
. .
and
..
w = ." classwidth.
.\
~
When all the class widths are equal, a histogram is easy to construct.
Example 1~
100 students take a mathematics test which has a maximum
possible score of 40. The results are shown in the table.
Mark 1-5 6-10 11-15 16-20 21-25 26-30 31-35 36-40
Number of students 3 10 22 28 20 10 2 5
,
5 i" ...!. .. "1 -t' ... ... . -; "-1
t i
L_ ~
~
.;
= 4 .. 1, 1! ~ r... -!
f
;
...
"'"
g
...
3 .. . ~..I.. j,-, ... i, =! ....""
- '1- -.. ..-l. ... --1
:::1
r _, ; ! I I
tt 2
. ~!.. .. -f -. .'1 . "'1; ...-~ ".. .~
.. ..... J. . ..1 ... ...! -' '.1
, ,1 i i I
- I f
o
5 10 15 20 25 30 35 40
Mark
Example 2
00OO14l, "-
0000120.
?;>
o;; 000010 I+++H-H+H+I+I-H-H-t!
111111 11 1111
e
..
"'"
>.
u OoOOOS,:,
..e
g. 0.0006--
tt
O.OOO4J
000002 HfI+H-H k:":, r;f'(1nF4il'';&'WI;v.4?'H+H-I.HfI+
O
10 20 30 40 50 60 70 SO 90 100 110 120 130 140
Salary (IOOO)
Notice that it is the area, not the height, of each bar that
represents the frequency.
Exercise 15E
..............................................................................
1 A teacher records the Intelligence Quotients of the 24 children
in her class. The results are:
125,132,117,98,151,146,133,106,114,128,126,141,
137,122,152,108,137,121,142,102,148,133,136,95
a) Construct a frequency table of these results with classes
90-99,100-109,...
b) Show these results on a histogram.
"
410 Statistics