Documente Academic
Documente Profesional
Documente Cultură
It
is unbiased, consistent, the most efficient estimator, and, as long as the sample is
sufficiently large, its sampling distribution can be approximated by the normal distribution.
If we know the sampling distribution of X , we can make statements about any estimate we
may make from sampling information. Let's look at a medical-supplies company that produces
disposable hypodermic syringes. Each syringe is wrapped in a sterile package and then
jumble-packed in a large corrugated carton. Jumble packing causes the cartons to contain
differing numbers of syringes. Because the syringes are sold on a per unit basis, the company
needs an estimate of the number of syringes per carton for billing purposes. We have taken a
sample 35 cartons at random and recorded the number of syringes in each carton. The following
2 2
2 1 2 ∑x nx 2 365368 35 × ( 102 )
S =
n −1
∑ ( x − x ) = n − 1 − n − 1 = 34 − 34 = 36.12.
Hence, S = 36.12 = 6.01 .
So, sample variance and sample standard deviation are respectively 36.12 and
6.01.
2) The Pizza distribution authority (PDA) has developed quite a business in some
area by delivering pizza orders promptly. PDA guarantees that its pizzas will be
delivered in 30 minutes or less from the time the order was placed and if the
delivery is late, the pizza is free. The time that it takes to deliver each pizza order
that is on time is recorded in the Official Pizza Time Block (OPTB) and the delivery
time for those pizzas that are delivered late is recorded as 30 minutes in the OPTB.
12 random entries from the OPTB are listed as
4) The Standard Chartered Bank is trying to determine the number of tellers available
during the lunch rush on Sundays. The bank has collected data on the number of people
who entered the bank during the last 3 months on Sunday from 11 A.M. to 1 P.M. Using
the data below, find point estimates of the mean and standard deviation of the
population from which the sample was drawn.
5) Electric Pizza was considering national distribution of its regionally successful product
and was compiling pro forma sales data. The average monthly sales figures (in thousands
of dollars) from its 30 current distributors are listed. Treating them as (a) a sample and
(b) a population, compute the standard deviation.
7.3 5.8 4.5 8.5 5.2 4.1 2.8 3.8 6.5 3.4
9.8 6.5 6.7 7.7 5.8 6.8 8.0 3.9 6.9 3.7
6.6 7.5 8.7 6.9 2.1 5.0 7.5 5.8 6.4 5.2
σx = σ =
10
= 0.707 m o n th .
n 200
We could now report to the director that our estimate of the life of the company's batteries is 36
months, and the standard error that accompanies this estimate is 0.707. In other words, the
actual mean life for all the batteries may lie somewhere in the interval estimate of 35.293 to
36.707 months. This is helpful but insufficient information for the director. Next, we need to
calculate the chance that the actual life will lie in this interval or in other intervals of different
widths that we might choose, ±2 σ (2 X 0.707), ±3 σ (3 X 0.707), and so on.
P r o b a b ilit y o f t h e T r u e P o p u la t io n P a ra m e t e r F a llin g w it h in t h e In t e r v a l E s t im a t e :
To begin to solve this problem, we should review relevant parts of Probability Distribution.
There we worked with the normal probability distribution and learned that specific portions
of the area under the normal curve are located between plus and minus any given number of
standard deviations from the mean.
Fortunately, we can apply these properties to the standard error of the mean and make the
following statement about the range of values used to make an interval estimate for our
battery problem.
The probability is 0.955 that the mean of a sample size of 200 will be within ±2 standard
errors of the population mean. Stated differently, 95.5 percent of all the sample means
are within ±2 standard errors from µ , and hence µ is within ±2 standard errors of 95.5
percent of all the sample means. Theoretically, if we select 1,000 samples at random
from a given population and then construct an interval of ±2 standard errors around the
mean of each of these samples, about 955 of these intervals will include the population
mean. Similarly, the probability is 0.683 that the mean of the sample will be within ± 1
standard error of the population mean, and so forth. This theoretical concept is basic to our
study of interval construction and statistical inference. In Figure 7-2, we have illustrated
the concept graphically, showing five such intervals. Only the interval constructed around
the sample mean X 4 does not contain the population mean. In words, statisticians would
describe the interval estimates represented in Figure 7-2 by saying, "The population mean
µ will be located within ±2 standard errors from the sample mean 95.5 percent of the
time."
As far as any particular interval in Figure 7-2 is concerned, it either contains the
population mean or it does not, because the population mean is a fixed
parameter. Because we know that in 95.5 percent of all samples, the interval will contain
the population mean, we say that we are 95.5 percent confident that the interval contains
the population mean.
Applying this to the battery example, we can now report to the director. Our best estimate
of the life of the company's batteries is 36 months, and we are 68.3 percent confident that
the life lies in the interval from 35.293 to 36.707 months (36 ± 2σ x ). Similarly, we are 95.5
percent confident that the life falls within the interval of 34.586 to 37.414 months (36 ±
2σ x ), and we are 99.7 percent confident that battery life falls within the interval of 33.879
to 38.121 months (36 ± 3σ x ).
Figure 7.2: A number of intervals constructed around sample means; all except one include
the population mean
W ill y o u d e liv e r t h e
About 80% 1 w eek
w a s h in g m a cwh in
it hein 1 w e e k ? 1 a m p r e tty c e r ta in it
w ill g o o u t w ithth
inis w e e k .
U s in g S a m p lin g a n d C o n fIn
id teenrcvea l E s tim a tio n :
In our discussion of the basic concepts of interval estimation, particularly in Figure 7-2, we
described samples being drawn repeatedly from a given population in order to estimate a
population parameter. We also mentioned selecting a large number of sample means from a
population. In practice, however, it is often difficult or expensive to take more than one
sample from a population. Based on just one sample, we estimate the population parameter.
We must be careful, then, about interpreting the results of such a process.
Suppose we calculate from one sample in our battery example the following confidence interval
and confidence level: "We are 95 percent confident that the mean battery life of the population
lies within 30 and 42 months." This statement does not mean that the chance is 0.95
that the mean life of all our batteries falls within the interval established from this
one sample. Instead, it means that if we select many random samples of the same
size and calculate a confidence interval for each of these samples, then in about 95
percent of these cases, the population mean will lie within that interval.
Warning: There is no free lunch in dealing with confidence levels and confidence
intervals. When you want more of one, you have to take less of the other. Hint: To
understand this important relationship, go back to Table 2. If you want the estimate of the
time of delivery to be perfectly accurate (100 percent), you have to sacrifice tightness in
the confidence interval and accept a very wide delivery promise ("sometime this year").
On the other hand, if you aren't concerned with the accuracy of the estimate, you could
get a delivery person to say "I'm 1 percent sure I can get it to you within an hour." You
can't have both at the same time.
Problems to be solved:
1) Given the following confidence levels, express the lower and upper limits of the
confidence interval for these levels in terms of X and σ x .
(a) 54 percent (b) 75 percent. (c) 94 percent (d) 98 percent.
2) Define the confidence level for an interval estimate.
3) Define the confidence interval.
4) Suppose you wish to use a confidence level of 80 percent. Give the upper limit of the
confidence interval in terms of the sample mean, X and the standard error, σ x .
5) In what way may an estimate be less meaningful because of (a) A high confidence
level? (b) A narrow confidence level.
6) Suppose a sample of 50 is taken from a population with standard deviation 27 and
that the sample mean is 86.
(a) Establish an interval estimate for the population mean that is 95.5 percent certain
to include the true population mean.
(b) Suppose, instead, that the sample size was 5,000. Establish an interval for the
population mean that is 95.5 percent certain to include the true population mean.
(c) Why might estimate (a) be preferred to estimate (b)? Why might (b) be
preferred to (a)?
8) Is the confidence level for an estimate based on the interval constructed from a
single sample?
9) Given the following confidence levels, express the lower and upper limits of the
confidence interval for these levels in terms of X and σ x .
(a) 60 percent (b) 70 percent (c) 92 percent (d) 96 percent.
10) Steve Klippers, owner of Steve's Barbershop, has built quite a reputation among
the residents of Cullowhee. As each customer enters his barbershop, Steve yells out
the number of minutes that the customer can expect to wait before getting his
haircut. The only statistician in town, after being frustrated by Steve's inaccurate
point estimates, has determined that the actual waiting time for any customer is
normally distributed with mean equal to Steve's estimate in minutes and standard
deviation equal to 5 minutes divided by the customer's position in the waiting line.
Help Steve's customers develop 95 percent probability intervals for the following
situations:
(a) The customer is second in line and Steve's estimate is 25 minutes.
(b) The customer is third in line and Steve's estimate is 15 minutes.
(c) The customer is fifth in line and Steve's estimate is 38 minutes.
(d) The customer is first in line and Steve's estimate is 20 minutes.
(e) How are these intervals different from confidence intervals?
E s tim a t e o f th e /£fr - g
fMJ
p o p u la t io n -» o r = S = /
————— V n-\
The value $950.00 is our estimate of the standard deviation of the population. We can also
symbolize this estimated value by a, which is called sigma hat.
Now we can estimate the standard error of the mean. Because we have a finite population
size of 700, and because our sample is more than 5 percent of the population, we will use the
formula for deriving the standard error of the mean of finite populations:
[6-3]
But because we are calculating the standard error of the mean using an estimate of the
standard deviation of the population, we must rewrite this equation so that it is correct
symbolically:
„ . ^ , $950.00 /700 - 50
Continuing our exam ple, we find o^ = —.— X /— — — — —
* "V b u \ /D O 1
= $950.00 /650 7.07 V699
= ($134.37X0.9643)
=
$ 1 2 9 .5 7< — E stim ate of the stand ard error of the m e an ofp oap ufin
la ite
tio n (d e r iv e d fr o m a n
e s tim a te o f t h e p o p uslatatio
n dna rd d e v ia tio n )
Next we consider the 90 percent confidence level, which would include 45 percent of the
area on either side of the mean of the sampling distribution. Looking in the body of
Appendix Table 1 for the 0.45 value, we find that about 0.45 of the area under the normal
curve is located between the mean and a point 1.64 standard errors away from the mean.
Therefore, 90 percent of the area is located between plus and minus 1.64 standard errors
away from the mean, and our confidence limits are
x + 1.64 &j = $11,800 + 1.64($129.57) = $11,800 + $212.50
= $12,012.50<- Upper confidence limit
x - 1.64as = $11,800 - 1.64($129.57) = $11,800 - $212.50
= $11,587.50«- Lower confidence limit
Our report to the social-service agency would be: With 90 percent confidence, we estimate that
the average annual income of all 700 families living in this four-square-block section falls
between $11,587.50 and $12,012.50.
Hint: It's easy to understand how to approach these exercises if you'll go back to
Figure 7-2 on page 356 for a minute. When someone states a confidence level, they are
referring to the shaded area in the figure, which is defined by how many o-j
(standard errors or standard deviations of the distribution of sample means) there are
on either side of the mean. Appendix Table 1 quickly converts any desired confidence
level into standard errors. Because we have the information necessary to calculate
one standard error, we can calculate the endpoints of the shaded area. These are the
limits of our confidence interval. Warning: When you don't know the dispersion in the
population (the population standard deviation) remember to use Equation 7-1 to
estimate it.
SC 7-6 From a population of 540, a sample of 60 individuals is taken. From this
sample, the mean is found to be 6.2 and the standard deviation 1.368.
(a) Find the estimated standard error of the mean.
(b) Construct a 96 percent confidence interval for the mean. SC 7-7 In an
automotive safety test conducted by the North Carolina Highway Safety Research
Center, the average tire pressure in a sample of 62 tires was found to be 24 pounds
per square inch, and the standard deviation was 2.1 pounds per square inch.
(a) What is the estimated population standard deviation for this population? (There
are about a million cars registered in North Carolina.)
(b) Calculate the estimated standard error of the mean.
(c) Construct a 95 percent confidence interval for the population mean.
7-27 The manager of Cardinal Electric's lightbulb division must estimate the av erage
number of hours that a lightbulb made by each lightbulb machine will last. A sample of
40 lightbulbs was selected from machine A and the aver age burning time was 1,416
hours. The standard deviation of burning time is known to be 30 hours.
(a) Compute the standard error of the mean.
(b) Construct a 90 percent confidence interval for the true population
mean.
7-28 Upon collecting a sample of 250 from a population with known standard
deviation of 13.7, the mean is found to be 112.4.
(a) Find a 95 percent confidence interval for the mean.
(b) Find a 99 percent confidence interval for the mean.
7-29 The Westview High School nurse is interested in knowing the average height of
seniors at this school, but she does not have enough time to examine the records of
all 430 seniors. She randomly selects 48 students. She finds the sample mean to be
64.5 inches and the standard deviation to be 2.3 inches.
(a) Find the estimated standard error of the mean.
(b) Construct a 90 percent confidence interval for the mean. 7-30 Jon Jackobsen, an
overzealous graduate student, has just completed a first draft of his 700-page
dissertation. Jon has typed his paper himself and is interested in knowing the average
number of typographical errors per page, but does not want to read the whole paper.
Knowing a little bit about business statistics, Jon selected 40 pages at random to read
and found that the average number of typos per page was 4.3 and the sample
standard deviation was 1.2 typos per page.
(a) Calculate the estimated standard error of the mean.
(b) Construct for Jon a 90 percent confidence interval for the true average number of
typos per page in his paper.
• 7-31 The Nebraska Cable Television authority conducted a test to determine 1
amount of time people spend watching television per week. The NCTAj veyed 84
subscribers and found the average number of hours watched ] week to be 11.6 hours
and the standard deviation to be 1.8 hours.
(a) What is the estimated population standard deviation for this [
tion? (There are about 95,000 people with cable television in NebraskaJf
(b) Calculate the estimated standard error of the mean. I
(c) Construct a 98 percent confidence interval for the population mean.
• 7-32 Joel Friedlander is a broker on the New York Stock Exchange who is curioU
about the amount of time between the placement and execution of a market' ' order.
Joel sampled 45 orders and found that the mean time to execution wt|L 24.3 minutes and
the standard deviation was 3.2 minutes. Help Joel by corff 1 structing a 95
percent confidence interval for the mean time to execution.
• 7-33 Oscar T. Grady is the production manager for Citrus Groves Inc.,'.
just north of Ocala, Florida. Oscar is concerned that the last 3 years''.
freezes have damaged the 2,500 orange trees that Citrus Groves owns. In< der
to determine the extent of damage to the trees, Oscar has sampled tab number of
oranges produced per tree for 42 trees and found that the aVeP age production was
525 oranges per tree and the standard deviation was 30 oranges per tree.
(a) Estimate the population standard deviation from the sample standard deviation.
(b) Estimate the standard error of the mean for this finite population.
(c) Construct a 98 percent confidence interval for the mean per-tree output of all 2,500
trees.
(d) If the mean orange output per tree was 600 oranges 5 years ago, whit can Oscar
say about the possible existence of damage now?
• 7-34 Chief of Police Kathy Ackert has recently instituted a crackdown on drug
dealers in her city. Since the crackdown began, 750 of the 12,368 drug deal ers in the city
have been caught. The mean dollar value of drugs found OK these 750 dealers is
$250,000. The standard deviation of the dollar value of drugs for these 750 dealers is
$41,000. Construct for Chief Ackert a 90* cent confidence interval for the mean dollar
value of drugs possessed by city's drug dealers.
a n s w e rs to S e lf-C h e c k E x e rc is e s
SC 7-6 6- = 1.368 N = 540 n = 60 x = 6.2
(b) x ± 2.05 CTS = 6.2 ± 2.05(0.167) = 6.2 ± 0.342 = (5.86,6.54) SC 7-7 s = 2.1 n
= 62 x = 24
(a) CT = s = 2.1 psi
(b) CT = CT/Vn = 2.1/V62 = 0.267 psi
T
Pascal, Inc., a computer store that buys wholesale, untested computer chips, is
7-9
considering switching to another supplier who would provide tested and guaranteed
chips for a higher price. In order to determine whether this is a cost-effective plan,
Pascal must determine the proportion of faulty chips that the current supplier
provides. A sample of 200 chips was tested and of these, 5 percent were found to be
defective.
(a) Estimate the standard error of the proportion of defective chips.
(b) Construct a 98 percent confidence interval for the proportion of defec tive chips
supplied.
7-36 General Cinema sampled 55 people who viewed GhostHunter 8 and asked them
whether they planned to see it again. Only 10 of them believed the film was worthy of a
second look.
(a) Estimate the standard error of the proportion of moviegoers who will view the
film a second time.
(b) Construct a 90 percent confidence interval for this proportion. 7-37 The product
manager for the new lemon-lime Clear 'n Light dessert topping was worried about both
the product's -poor performance and her future with Clear 'n Light. Concerned that
her marketing strategy had not prop erly identified the attributes of the product, she
sampled 1,500 consumers and learned that 956 thought that the product was a floor
wax.
(a) Estimate the standard error of the proportion of people holding this se vere
misconception about the dessert topping.
(b) Construct a 96 percent confidence interval for the true population proportion. 7-38
Michael Gordon, a professional basketball player, shot 200 foul shots and made 174 of
them.
(a) Estimate the standard error of the proportion of all foul shots Michael makes.
(b) Construct a 98 percent confidence interval for the proportion of all foul
shots Michael makes.
7-39 SnackMore recently surveyed 95 shoppers and found 80 percent of them
purchase SnackMore fat-free brownies monthly.
(a) Estimate the standard error of the proportion.
(b) Construct a 95 percent confidence interval for the true proportion of
people who purchase the brownies monthly.
7-40 The owner of the Home Loan Company randomly surveyed 150 of the com pany's
3,000 accounts and determined that 60 percent were in excellent standing.
(a) Find a 95 percent confidence interval for the proportion in excellent standing.
(b) Based on part (a), what kind of interval estimate might you give for the absolute
number of accounts that meet the requirement of excellence, keeping the same 95
percent confidence level?
7-41 For a year and a half, sales have been falling consistently in all 1,500 fran chises
of a fast-food chain. A consulting firm has determined that 31 percent of a sample of 95
indicate clear signs of mismanagement. Construct a 98 per cent confidence interval for
this proportion.
7-42 Student government at the local university sampled 45 textbooks at the
University Student Store and determined that of these 45 textbooks, 60 percent had been
marked up in price more than 50 percent over wholesale cost. Give a 96 percent
confidence interval for the proportion of books marked up more than 50 percent by the
University Student Store.
Interval Estimates using the t distribution: Use the t distribution for estimating
is required whenever the sample size is 30 or less and the population standard
deviation is not known. Furthermore, in using the t distribution, we assume that the
population is normal or approximately normal.
Characteristics of the t distribution: Both the t and normal distributions are
symmetrical, the t distribution is flatter than the normal distribution. As the sample
size gets larger, the shape of the t distribution loses its flatness and becomes
approximately equal to the normal distribution. In fact, for sample sizes of more than 30, the
t distribution is so close to the normal distribution that we will use the normal to approximate
the t.
We will use degrees of freedom when we select a t distribution to estimate a population
mean, and we will use n - 1 degrees of freedom, where n is the sample size. For example,
if we use a sample of 20 to estimate a population mean, we will use 19 degrees of
freedom in order to select the appropriate t distribution.
U s in g th e t D is t r ib u t io n T a b le :
The table of t distribution values (Appendix Table 2) differs in construction from the z table
we have used previously. The t table is more compact and shows areas and t
values for only a few percentages (10,5,2, and 1 percent). Because there is a
different t distribution for each number of degrees of freedom, a more complete table
would be quite lengthy. Although we can conceive of the need for a more complete table,
in fact Appendix Table 2 contains all the commonly used values of the / distribution.
A second difference in the t table is that it does not focus on the chance that the
population parameter being estimated will fall within our confidence interval.
Instead, it measures the chance that the population parameter we are
estimating will not be within our confidence interval (that is, that it will lie
outside it). If we are making an estimate at the 90 percent confidence level, we would
look in the ; table under the 0.10 column (100 percent - 90 percent = 10 percent). This
0.10 chance of error is symbolized by a, which is the Greek letter alpha. We would find
the appropriate t values for confidence intervals of 95 percent, 98 percent, and 99
percent under the a columns headed 0.05, 0.02, and 0.01, respectively.
A third difference in using the t table is that we must specify the degrees of
freedom with which we are dealing. Suppose we make an estimate at the 90
percent confidence level with a sample size of 14, which is 13 degrees of freedom. Look
in Appendix Table 2 under the 0.10 column until you encounter the row labeled 13. Like
a z value, the t value there of 1.771 shows that if we mark off plus and minus 1.771 dj's
(estimated standard errors of x) on either side of the mean, the area under the curve
between these two limits will be 90 percent, and the area outside these limits (the chance
of error) will be 10 percent (see Fisure 7-4).
Recall that in our chapter-opening problem, the generating plant manager wanted to estimate
the coal needed for this year, and he took a sample by measuring coal usage for 10 weeks.
The sample data are
n = 1 0 w eeks«- Sam ple size d f = 9 * ~ D e g r e e s o f f rXe e=d o11,400
m tons<- Sample meanS =
7 0 0 to n <
s - S a m p le sta n d a rd d e v ia tio n
The plant manager wants an interval estimate of the mean coal consumption, and he
wants to be 95 percent confident that the mean consumption falls within that interval. This
problem requires the use of at distribution because the sample size is less than
30, the population standard deviation is unknown, and the manager believes
that the population is approximately normal.
As a first step in solving this problem, recall that we estimate the population standard
deviation with the sample standard deviation; thus
CT = S [7-1]
= 700 tons
Using this estimate of the population standard deviation, we can estimate the standard error
of the mean by modifying Equation 7-2 to omit the finite population multiplier (be-. cause the
sample size of 10 weeks is less than 5 percent of the 5 years (260 weeks) for which data are
available):
700
VlO
700
3.16
2
= 2 2 1 .3 8 to n<s- E stim a te d sta n d a rd e rro r omf ethan eo f anin fin it e p o p u la tio n
Now we look in Appendix Table 2 down the 0.05 column (100 percent - 95 percent = 5
percent) until we encounter the row for 9 degrees of freedom (10 - 1 =9). There we see
the t value 2.262 and can set our confidence limits accordingly:
x + 2.2620; = 11,400 tons + 2.262(221.38 tons) = 11,400 + 500.76
= 11,901 tons<- Upper confidence limit
x - 2.2620; = 11,400 tons - 2.262(221.38 tons) = 11,400 - 500.76
= 10,899 tons*~ Lower confidence limit
Our confidence interval is illustrated in Figure 7-5. Now we can report to the plant
manager with 95 percent confidence that the mean weekly usage of coal lies between
10,899 and 11,901 tons, and he can use the 11,901-ton figure to estimate how much
coal to order.
The only difference between the process we used to make this coal-usage estimate and
the previous estimating problems is the use of the t distribution as the appropriate
distribution. Remember that in any estimation problem in which the sample size
is 30 or less and the standard deviation of the population is unknown and the
underlying population can be assumed to be normal or approximately normal,
we use the t distribution.
S u m m a r y o f C o n f id e n c e uLnim
d eitrs V a r io u s C o n d itio n s :
Table 7-5 summarizes the various approaches to estimation introduced in this chapter and
the confidence limits appropriate for each.
n =10
0 .0 2 5 of
a r e am d e r
SC 7-10 For the following sample sizes and confidence levels, find the appropriate t
values for constructing confidence intervals:
(a) n = 28; 95 percent.
(b) n = 8; 98 percent.
(c) n = 13; 90 percent.
(d) n = 10; 95 percent.
(e) n = 25; 99 percent.
(f) n = 10; 99 percent.
SC 7-11 Seven homemakers were randomly sampled, and it was determined that
the distances they walked in their housework had an average of 39.2 miles per week
and a sample standard deviation of 3.2 miles per week. Construct a 95 percent
confidence interval for the population mean.
7-44 For the following sample sizes and confidence lev *s, find the appropriate f
values for constructing confidence intervals:
(a) n = 15; 90 percent.
(b) n = 6; 95 percent.
(c) H = 19; 99 percent.
(d) n = 25; 98 percent.
(e) n = 10; 99 percent.
(f) n = 41; 90 percent.
7-45 Given the following sample sizes and t values used to construct confidence
intervals, find the corresponding confidence levels:
(a) n = 27; f = ±2.056.
(b) n = 5;t = ±2.132.
(c) n = 18; t = ±2.898. 7-46 A sample of 12 had a mean of 62 and a standard
deviation of 10. Construct
a 95 percent confidence interval for the population mean. 7-47 The following sample
of eight observations is from an infinite population
with a normal distribution:
7 5 .3 7 6.4 83 .2 91 .0 8 0 .1 7 7 .5 8 4.8 81 .0
(a) Find the sample mean.
(b) Estimate the population standard deviation.
(c) Construct a 98 percent confidence interval for the population mean.
Northern Orange County has found, much to the dismay of the county commissioners,
that the population has a severe problem with dental plaque.
Every year the local dental board examines a sample of patients and rates each patient's
plaque buildup on a scale from 1 to 100, with 1 representing no plaque and 100 representing a
great deal of plaque. This year, the board examined 21 patients and found that they had an
average Plaque Rating Score (PRS) of 72 and a standard deviation of 6.2. Construct for Orange
County a 98 percent confidence interval for the mean PRS for Northern Orange County.
• 7-49 Twelve bank tellers were randomly sampled and it was determined they made an
average of 3.6 errors per day with a sample standard deviation of 0.42 error. Construct a 90
percent confidence interval for the population mean of errors per day. What assumption is
implied about the number of errors bank tellers make?
• 7-50 State Senator Hanna Rowe has ordered an investigation of the large number of
boating accidents that have occurred in the state in recent summers. Acting on her
instructions, her aide, Geoff Spencer, has randomly selected 9 summer months within the
last few years and has compiled data on the number of boating accidents that occurred
during each of these months. The mean number of boating accidents to occur in these 9
months was 31, and the standard deviation in this sample was 9 boating accidents per
month. Geoff was told to construct a 90 percent confidence interval for the true mean
number of boating accidents per month, but he was in such an accident himself recently,
so you will have to do this for him.
Determining the Sample Size in estimation: In all our discussions so far, we have used for
sample size the symbol n instead of a t cific number. Now we need to know how to
determine what number to use. How i should the sample be? If it is too small, we may fail
to achieve the objective of our i sis. But if it is too large, we waste resources when we
gather the sample.
Some sampling error will arise because we have not studied the whole Whenever we
sample, we always miss some helpful information about the population. If» want a high level
of precision (that is, if we want to be quite sure of our estimate), we 1 to sample enough of
the population to provide the required information. Sampling error i controlled by selecting
a sample that is adequate in size. In general, the more precision; want, the larger the
sample you will need to take. Let us examine some methods that are i ful in determining
what sample size is necessary for any specified level of precision.