Documente Academic
Documente Profesional
Documente Cultură
statistics
By: Judy Ann I. Caminoc
Statistics is a field of mathematics that deals with the
Collection, Organization, Analysis, and Interpretation of
quantitative data.
Tables, graphs,
Organize texts
The
mean ( also called as the “average” or
“arithmetic average”, is the most commonly used
measure of central tendency. It is said to be the most
reliable measure of central tendency and has the least
probable error but does not supply information about
the homogeneity of the distribution.
Ungrouped data
Onemethod that can be used to find the mean of grouped
data is the class mark or midpoint method.
Class mark or Midpoint Method
In this method, the class mark of each interval has to be
known and then it will be multiplied to the corresponding
frequency of every class interval. The formula for the mean
using this method is
where frequency
class mark
total number of observations
1. Consider the frequency distribution below:
CI cf
75-79 5
70-74 7
65-69 8
60-64 10
55-59 8
50-54 9
45-49 5
n=50
median
A median is defined as the middle value/observation in an
organized list of numbers and falls in the middle-most position
of the whole data.
UNGROUPED DATA
The median value in an ungrouped data is determined by the
first arranging the numbers in value order from lowest to
highest or vice versa. If there is an odd amount of numbers, the
median value is the middle most number, with the same
amount of numbers below and above. If there is an even
amount of numbers in the list, the middle pair must be
determined, added together and divided by two to find the
median value. The median can be used to determine an
appropriate average.
Ex.
1. A college professor at a certain university assigns Statistics
practice problems to be worked via net. Students must use a secret
code to access the problems and the time of log-in and log-out are
automatically recorded for the professor. At the end of the week, the
professor examines the amount of time each student spent solving
the assigned problems. Find the median. The data is provided below
in minutes.
15 28 25 48 22 34 39 44 43 49 34 22 33 27 25 22 30
2. The speed of stenographers in typing per minute are as follows:
Stenographer 1 2 3 4 5 6 7 8 9 10
Speed 121 110 120 119 112 121 118 115 107 115
Grouped data
The formula for the median for grouped data is follows:
UNGROUPED DATA
To find the mode of an ungrouped data, find the frequency
of each number/value/observation in the given data set.
Then, choose the number/value/observation having the
highest frequency as the mode.
MODE= number/value/observation with the highest frequency
Ex.
1. The speed of stenographers in typing per minute are as
follows:
Stenographer 1 2 3 4 5 6 7 8 9 10
Speed 121 110 120 119 112 121 118 115 107 115
Grouped data
SAMPLING METHOD
it is the process or the method of drawing a definite number
of the individual, cases or the observation from a particular
universe, selecting part of a total group for the investigation.
(Mildred Parton)
is a procedure for selecting sample members from a
population.
PROBABILITY SAMPLING
is a sampling technique in which sample from a larger
population are chosen using a method based on the theory of
probability.
process of selecting a sampling in such a way that all individuals
in the defined population have an equal and independent
chance of being selected for the sample.
We will use the table of random numbers, and point at any number
(without looking at the table).For this number, look only at the
appropriate number of digits.
From these number, you can read the table vertically (upward or
downward); horizontally( leftward or right ward) or diagonally.
Record those numbers which are in the population list and discard
those which are not. Continuing with our example, we record those
numbers between 000 and 349 and discard those which appeared
a second or more times. We stop the selection of the random
numbers when we have 126 such numbers.
EXAMPLE
An advertising firm, interested in determining how much to
emphasize television advertising in a certain county, decides to
conduct a sample survey to estimate the average number of hours
each week that households within the county watch television. The
county contains two towns, A and B, and a rural area. Town A is built
around a factory, and most households contain factory workers with
school-age children. Town B is an exclusive suburb of a city in a
neighboring county and contains older residents with few children at
home. There are 155 households in town A, 62 in town B, and 93 in
the rural area. Discuss the merits of using stratified random
sampling in this situation.
4. Cluster Sampling
is suitable procedure if the population is spread out
over a wide geographical area.
there is the division of population into no overlapping
groups or clusters consisting of one or more elements,
and then select a sample of clusters.
Example of cluster sampling :
Suppose all the households of a city are to be
studied. Suppose also that the sample to be used
is 20 percent. The city is divided into clusters, the
number and the sizes of which are decided upon
by the researcher. Suppose there are 40 clusters
or blocks, 20 percent of which are 8 clusters. The
8 clusters are to be selected either by simple
random sampling or by systematic random
sampling.
5.Multi stage sampling
Is a more complex sampling technique a
combination of several sampling techniques that we
have discussed.
It is used especially when the subjects of an
investigation are scattered all over a big
geographical area.
This can be done in two or more stages. This method
involves the following steps:
1.Divide the population into strata. Say , the 5 colleges of
EVSU
2.Divide each stratum into clusters. Say, the courses
offered in each colleges.
3.Draw a sample from each cluster using the simple
random sampling or systematic sampling.
Slovin’s formula