Sunteți pe pagina 1din 8

22 Chapter 1 The Nature of Probability and Statistics

To enter data into Excel by typing:

1. Click the cell at the top of the column where you want to enter data. When working with
values for a single variable, you usually will want to enter the values into a column.
2. Type each data value and press [Enter] or [Tab].

You can also enter data into Excel by opening an Excel workbook file. Follow the steps in the ex-
ample below to open the Databank file available for download at the book’s website.

Example XL1–1
1. Click File on the menu bar, then click Open . . . (i.e., select File>Open . . . ).
The Open dialog box will be displayed.
2. In the Look in box, click the folder where the workbook file is located.
3. In the folder list, double-click folders until you open the folder containing the file you want.
4. Double-click the file name in the list box. The data will be copied into Excel.

1–7 The two major areas of statistics are descriptive and inferential. Descriptive statistics in-
cludes the collection, organization, summarization, and presentation of data. Inferential
Summary statistics includes making inferences from samples to populations, hypothesis testing,
determining relationships, and making predictions. Inferential statistics is based on
probability theory.
Since in most cases the populations under study are large, statisticians use sub-
groups called samples to get the necessary data for their studies. There are four basic
methods used to obtain samples: random, systematic, stratified, and cluster.
Data can be classified as qualitative or quantitative. Quantitative data can be either
discrete or continuous, depending on the values they can assume. Data can also be
measured by various scales. The four basic levels of measurement are nominal, ordinal,
interval, and ratio.

LAFF - A - DAY

“We've polled the entire populace, Your


Majesty, and we've come up with
exactly the results you ordered!”

Source: © 1993 King Features Syndicate, Inc. World


Rights reserved. Reprinted with special permission of
King Features Syndicate.
Section 1–7 Summary 23

There are two basic types of statistical studies. They are observational studies and
experimental studies. When conducting observational studies, researchers observe what
is happening or what has happened and then draw conclusions based on these observa-
tions. They do not attempt to manipulate the variables in any way.
When conducting an experimental study, the researchers manipulate one or more of
the independent or explanatory variables and see how this manipulation influences the
dependent or outcome variable.
Finally, the applications of statistics are many and varied. People encounter them in
everyday life, such as in reading newspapers or magazines, listening to the radio, or
watching television. Since statistics is used in almost every field of endeavor, the edu-
cated individual should be knowledgeable about the vocabulary, concepts, and proce-
dures of statistics.
Today computers and calculators are used extensively in statistics to facilitate the
computations.

Important Terms
cluster sample 12 experimental study 15 observational study 15 random variable 4
confounding variable 16 explanatory variable 15 ordinal level of ratio level of
convenience sample 14 Hawthorne effect 16 measurement 9 measurement 10
continuous variables 7 hypothesis testing 6 outcome variable 15 sample 6
control group 16 independent variable 15 population 6 statistics 3
data 4 inferential statistics 7 probability 5 stratified sample 12
data set 4 interval level of qualitative variables 7 systematic sample 11
data value or datum 4 measurement 9 quantitative variables 7 treatment group 16
dependent variable 15 measurement scales 9 quasi-experimental variable 4
descriptive statistics 5 nominal level of study 15 Review Exer-
discrete variables 7 measurement 9 random sample 11 cises

Review Exercises
Note: All odd-numbered problems and even-numbered a. In the year 2010, 148 million Americans will be enrolled
problems marked with (ans) are included in the answer in an HMO (Source: USA Today, December 30, 1997).
section at the end of this book. b. Nine out of 10 on-the-job fatalities are men (Source:
Note: Problems with an asterisk (*) require more in-depth USA Weekend, January 2–4, 1998).
skills than other problems. c. Expenditures for the cable industry were 5.66 billion
dollars in 1996 (Source: USA Today, December 30, 1997).
1–1. Name and define the two areas of statistics.
d. The median household income for people ages 25–34 is
1–2. What is probability? Name two areas where $35,888 (Source: USA Today, December 8, 1997).
probability is used. e. “Allergy therapy makes bees go away.” (Prevention,
1–3. Suggest some ways statistics can be used in April 1995).
everyday life. f. Drinking decaffeinated coffee can raise cholesterol
levels by 7%. (Source: American Heart Association).
1–4. Explain the differences between a sample and a g. The national average annual medicine expenditure per
population. person is $1,052 (Source: The Tribune Review, Greensburg,
1–5. Why are samples used in statistics? Pa., December 22, 1996).
h. Experts say that mortgage rates may soon hit bottom
1–6. (ans) In each statement that follows, tell whether
(Source: USA Today, October 6, 1997).
descriptive or inferential statistics have been used.
24 Chapter 1 The Nature of Probability and Statistics

1–7. Classify each as nominal-level, ordinal-level, interval- d. Every hundredth hamburger manufactured is checked to
level, or ratio-level data. determine its fat content.
a. Horsepower of motorcycle engines. e. Mail carriers of a large city are divided into four groups
b. Ratings of newscasts in Houston (poor, fair, good, according to gender (male or female) and according to
excellent). whether they walk or ride on their routes. Then 10 are
c. Temperature of automatic popcorn poppers. selected from each group and interviewed to determine
d. Time required by drivers to complete a course. whether they have been bitten by a dog in the last year.
e. Salaries of cashiers of Day-Night grocery stores.
f. Marital status of respondents to a survey on savings 1–13. Give three examples each of nominal, ordinal,
accounts. interval, and ratio data.
g. Ages of students enrolled in a martial arts course. 1–14. For each of the following statements, define a
h. Weights of beef cattle fed a special diet. population and state how a sample might be obtained.
i. Rankings of weight lifters. a. The average cost of an airline meal in 1993 was $4.55
j. Pages in the telephone book for the city of Los Angeles. (Source: Everything Has Its Price, Richard E. Donley,
1–8. (ans) Classify each variable as qualitative or Simon and Schuster, 1995).
quantitative. b. “More than 1 in 4 United States children have
a. Colors of automobiles in the faculty parking lot. cholesterol levels of 180 milligrams or higher” (Source:
b. Number of desks in classrooms. The American Health Foundation).
c. Classification of children in a day care center (infant, c. “Every 10 minutes, 2 people die in car crashes and 170
toddler, preschool). are injured” (Source: National Safety Council estimates).
d. Weights of fish caught in Lake Emilie. d. “When older people with mild to moderate hypertension
e. Number of pages in statistics textbooks. were given the mineral salt for six months, the average
f. Capacity (in gallons) of water in selected dams. blood pressure reading dropped by eight points systolic
g. Number of off-road vehicles sold in the United States. and three points diastolic” (Source: Prevention, March
1995, p. 19).
1–9. Classify each variable as discrete or continuous. e. “The average amount spent per gift for Mom on
a. Number of loaves of bread baked each day at a local Mother’s Day is $25.95” (Source: The Gallup
bakery. Organization).
b. Water temperature of the saunas at a given health spa.
c. Incomes of single parents who attend a community 1–15. Select a newspaper or magazine article that involves
college. a statistical study and write a paper answering the
d. Lifetimes of batteries in a tape recorder. following questions.
e. Weights of newborn infants at a certain hospital. a. Is this study descriptive or inferential in nature? Explain
f. Capacity (in gallons) of water in swimming pools. your answer.
g. Number of pizzas sold last year in the United States. b. What are the variables used in the study? In your
opinion, what level of measurement was used to obtain the
1–10. Give the boundaries for each value. data from the variables?
a. 84.5 miles c. Does the article define the population? If so, how is it
b. 6 tons defined? If not, how could it be defined?
c. 0.15 milliliters d. Does the article state the sample size and how the
d. 17 inches sample was obtained? If so, explain the size of the sample
e. 8.72 ounces and how it was selected. If not, suggest a way it could have
f. 97 feet been obtained.
1–11. Name and define the four basic sampling methods. e. Explain in your own words what procedure (survey,
comparison of groups, etc.) might have been used to
1–12. (ans) Classify each sample as random, systematic, determine the study’s conclusions.
stratified, or cluster. f. Do you agree or disagree with the conclusions? State
a. In a large school district, all teachers from two buildings your reasons.
are interviewed to determine whether they believe the
students have less homework to do now than in previous 1–16. Information from research studies is sometimes
years. taken out of context. Explain why the claims of these
b. Every seventh customer entering a shopping mall is studies might be suspect.
asked to select his or her favorite store. a. The average salary of the graduates of the class of 1980
c. Nursing supervisors are selected using random numbers is $32,500.
in order to determine annual salaries. b. It is estimated that in Podunk there are 27,256 cats.
Section 1–7 Summary 25

c. Only 3% of the men surveyed read Cosmopolitan 1–18. Identify the independent variable(s) and the dependent
magazine. variable for each of the studies in the preceding exercise.
d. Based on a recent mail survey, 85% of the respondents 1–19. For each of the studies in Exercise 1–17, suggest
favored gun control. possible confounding variables.
e. A recent study showed that high school dropouts drink
1–20. In the 1980s, a study linked coffee to a higher risk of
more coffee than students who graduated; therefore, coffee
heart disease and pancreatic cancer. In the early 1990s,
dulls the brain.
studies showed that drinking coffee posed minimal health
f. Since most automobile accidents occur within 15 miles
threats. However, in 1994, a study showed that pregnant
of a person’s residence, it is safer to make long trips.
women who drank three or more cups of tea daily may be
1–17. Identify each study as being either observational or at risk for spontaneous abortion. In 1998, a study claimed
experimental: that women who drank more than a half-cup of caffeinated
a. Subjects were randomly assigned to two groups and one tea every day may actually increase their fertility. In 1998,
group was given an herb and the other group a placebo. a study showed that over a lifetime, a few extra cups of
After six months, the number of respiratory tract infections coffee a day can raise blood pressure, heart rate, and stress
each group had were compared. (Source: “Bottom Line: Is It Good for You? Or Bad?” by
b. A researcher stood at a busy intersection to see if the Monika Guttman, USA Today Weekend, February 26–28,
color of an automobile a person drives is related to running 1999, pp. 8–9). Suggest some reasons why these studies
red lights. appear to be conflicting.
c. A researcher finds that people who are more hostile *1–21. Find an article that describes a statistical study and
have higher total cholesterol levels than those who are less identify the study as observational or experimental.
hostile.
d. Subjects are randomly assigned to four groups. Each *1–22. For the article that you used in the previous
group is placed on one of four special diets—low-fat diet, a exercise, identify the independent variable(s) and
high-fish diet, a combination of low-fat diet and high-fish dependent variable for the study.
diet, and a regular diet. After six months, the blood *1–23. For the article that you selected in Exercise 1–21,
pressures of the groups are compared to see if diet has any suggest some confounding variables that may have an
affect on blood pressure. effect on the results of the study.

Are We Improving Our Diet? Revisited


Statistics Today
Researchers selected a sample of 23,699 adults in the United States, using phone num-
bers selected at random, and conducted a telephone survey. All respondents were asked
six questions:
1. How often do you drink juices such as orange, grapefruit, or tomato?
2. Not counting juice, how often do you eat fruit?
3. How often do you eat green salad?
4. How often do you eat potatoes (not including french fries, fried potatoes, or
potato chips)?
5. How often do you eat carrots?
6. Not counting carrots, potatoes, or salad, how many servings of vegetables do you
usually eat?
Researchers found that men consumed fewer servings of fruits and vegetables per
day (3.3) than women (3.7). Only 20% of the population consumed the recommended
5 or more daily servings. In addition, they found that youths and less-educated people
consume an even lower amount than the average.
Based on this study, they recommend that greater educational efforts are needed to
improve fruit and vegetable consumption by Americans and to provide environmental
and institutional support to encourage increased consumption.

Source: Mary K. Sendula, M.D., et al., “Fruit and Vegetable Intake among Adults in 16 States: Results of a
Brief Telephone Survey,” American Journal of Public Health 85, no. 2 (February 1995).
26 Chapter 1 The Nature of Probability and Statistics

Chapter Quiz
Determine whether each statement is true or false. If the 12. A study that involves no researcher intervention is called
statement is false, explain why. a. an experimental study.
b. a noninvolvement study.
1. Probability is used as a basis for inferential c. an observational study.
statistics. d. a quasi-experimental study.
2. The height of President Lincoln is an example of a 13. A variable that interferes with other variables in the
variable. study is called
3. The highest level of measurement is the interval level. a. a confounding variable.
b. an explanatory variable.
4. When the population of college professors is divided
c. an outcome variable.
into groups according to their rank (instructor, assistant
d. an interfering variable.
professor, etc.), and then several are selected from each
group to make up a sample, the sample is called a Use the best answer to complete the following
cluster sample. statements.
5. The variable of age is an example of a qualitative 14. Two major branches of statistics are and .
variable.
15. Two uses of probability are and .
6. The weight of pumpkins is considered to be a
continuous variable. 16. The group of all subjects under a study is called
a .
7. The boundary of a value such as 6 inches would be 5.9
inches to 6.1 inches. 17. A group of subjects selected from the group of all
subjects under study is called a .
Select the best answer.
18. Three reasons why samples are used in statistics are
8. The number of absences per year a worker has is an a. b. c. .
example of what type of data? 19. The four basic sampling methods are
a. Nominal a. b. c. d. .
b. Qualitative
20. A study that uses intact groups when it is not possible
c. Discrete
to randomly assign participants to the groups is called
d. Continuous
a(n) study.
9. What are the boundaries of 25.6 ounces?
21. In a research study, participants should be assigned to
a. 25–26
groups using methods if possible.
b. 25.55–25.65
c. 25.5–25.7 22. For each statement, decide whether descriptive or
d. 20–39 inferential statistics is used.
a. A recent study showed that eating garlic can lower
10. A researcher divided subjects into two groups
blood pressure.
according to gender and then selected members from
b. The average number of students in a class at White
each group for his sample. What sampling method was
Oak University is 22.6.
the researcher using?
c. It is predicted that the average number of
a. Cluster
automobiles each household owns will increase
b. Random
next year.
c. Systematic
d. Last year’s total attendance at Long Run High
d. Stratified
School’s football games was 8235.
11. Data that can be classified according to color are e. The chance that a person will be robbed in a certain
measured on what scale? city is 15%.
a. Nominal
23. Classify each as nominal, ordinal, interval, or ratio
b. Ratio
level.
c. Ordinal
a. Number of exams given in a statistics course.
d. Interval
Section 1–7 Summary 27

b. Ratings of word-processing programs as user- d. The amount of a drug injected into a rat.
friendly. e. The amount of sodium contained in a bag of potato
c. Temperatures of a sample of automobile tires tested chips.
at 55 miles per hour for six minutes. f. The number of cars stolen each week in a large city.
d. Weights of suitcases on a selected commercial 25. Give the boundaries of each.
airline flight. a. 3.2 quarts
e. Classification of students according to major field. b. 18 pounds
24. Classify each variable as discrete or continuous. c. 9 feet
a. The time it takes to drive to work. d. 0.27 centimeter
b. The number of credit cards a person has. e. 36 seconds
c. The number of employees working in a large
department store.

Critical Thinking Challenges


1. A study of the worlds’ busiest airports was conducted by b. Comment on how this study’s conclusion might have
Airports Council International. Describe three variables been reached.
that one could use to determine which airports are the c. What are the variables used in the study and how
busiest. What units would one use to measure these might they have been defined?
variables? Are these variables categorical, discrete, or d. How might the researcher measure the variables?
continuous? e. What would the population be for this study?
2. The results of a study published in Archives of General f. What factors other than dependence on a spouse
Psychiatry stated that male children born to women who might have influenced the results of the study?
smoke during pregnancy run a risk of violent and g. Do you think the gender of the patients would make
criminal behavior that lasts into adulthood. The results a difference in the results? Why or why not?
of this study were challenged by some people in the 6. The effect of living near or working on power lines has
media. Give several reasons why the results of this been a very controversial issue. The article on page 28
study would be challenged. states several conclusions.
3. The results of a study published in Neurological a. Do you think that even if the risk of both cancers
Research stated that second-graders who took piano (leukemia and brain cancer) is greater, the workers
lessons and played a computer math game more readily should be more concerned with electrocution or
grasped math problems in fractions and proportions than other accidents? Explain your answer.
a similar group who took an English class and played b. Do you think that “a big, relatively well-done study
the same math game. What type of inferential study was that doesn’t yield persuasive evidence of harm” is a
this? Give several reasons why the piano lessons could justifiable reason to build power lines near
improve a student’s math ability. residential areas? Explain your answer.
4. A study of 2958 collegiate soccer players showed that in c. Do you think that since the death rate of industrial
46 anterior cruciate ligament (ACL) tears, 36 were in workers is lower than in the general population, it is
women. Calculate the percentages of tears for each safer to work as an industrial worker than in some
gender. Can it be concluded that female athletes tear other occupation?
their knees more often than male athletes? d. Do you consider the sample size (138,905 men)
large enough to be used to state the conclusion?
5. Read the article on the next page and answer the Explain.
questions that follow. e. What does the statement “But brain cancer risk
The recent study reprinted below reached the was 50% higher for men who worked more than
following conclusion: five years as a lineman or electrician” mean
“A heart patient overly dependent on a spouse may to you?
have a harder time making necessary lifestyle changes f. Do you think the fact that the study was funded by
in diet and exercise.” the Electric Power Research Institute could have any
a. Do you agree or disagree with this statement? State effect on its findings?
your reasons.
28 Chapter 1 The Nature of Probability and Statistics

Heart to Heart
There is a flurry of new research findings • Depression and anxiety affect pain
about how heart heals the heart: perception and the capacity to function in
the face of medical symptoms.
• A heart patient overly dependent on a • In a study at Stanford University,
spouse may have a harder time making behavioral counseling after heart attack,
necessary life-style changes in diet and especially for hard-driving Type A
exercise. individuals, lowered the rate of recurrent
• For women heart attack victims, spousal heart attacks by 45%—the same as the most
support is critical—but hard to come by. powerful prescription drugs.
“The family sometimes feels abandoned,”
explains Martin Sullivan, “and they don’t Such findings have led Martin Sullivan
want the woman to take time out of her to introduce innovative techniques at the
duties as a wife and mother to make Duke Center. These include a program
important lifestyle changes. Women are known as PAIRS (Practical Application of
more willing to change for men.” Intimacy Relationship Skills), which
• For men, a heart attack may shatter the teaches couples healthy interactive skills,
sole definition of self (as family provider). and a meditation program that teaches
The introduction of larger concepts of the patients to freeze-frame a moment in time
self is therapeutic. and look at the emotional content of what
• Patients who feel a sense of self- they are experiencing. Says Sullivan of the
efficacy and control over their disease do Duke Center’s work, “We’re trying to take
better than those who don’t. the best of everything.”
Source: Reprinted with permission from PSYCHOLOGY TODAY MAGAZINE, Copyright © 1994
(Sussex Publishers, Inc.).

Power Lines’ Link to Brain Cancer


but No Tie Found with Leukemia
Long-term exposure to power lines appears though still low, has increased in recent
to slightly increase risk of brain cancer, but decades, he says.
not leukemia, among utility workers, says a The study differs from earlier ones
large study reported in the American because of its size and because it used
Journal of Epidemiology. actual measurements of exposure levels. It
Some earlier studies linked electro- looked at 138,905 men who worked for
magnetic fields to leukemia among utility power companies between 1950 and 1986,
workers, but no study has been conclusive including 20,733 who died.
on the risks. Overall death risk among the men was
But even if the risk is greater, both lower than the general population, probably
cancers remain rare enough that other because industrial workers are most likely
dangers to those workers—such as to be healthy, the researchers report.
electrocution or other work accidents— But brain cancer risk was 50% higher
should be a greater concern, says the for men who worked more than five years
University of North Carolina’s David as a lineman or electrician. Those exposed
Savitz, one of the study leaders. to the highest levels of magnetic fields had
Savitz is encouraged by “a big, more than double the risk.
relatively well-done study that doesn’t yield There was no association found between
persuasive evidence of harm.” But the amount of magnetic field exposure and
findings also may help explain why the rate leukemia. The research was funded by the
of brain cancer in the general population, Electric Power Research Institute.

Source: Doug Levy, “Power Lines’ Link to Brain Cancer.” USA Today, January 11, 1995. Copyright
1995, USA TODAY. Reprinted with permission.
Section 1–7 Summary 29

Speaking of STATISTICS
In these studies, behaviorists found that hotheadedness nature. What is the independent variable and what is the
can lead to heart disease. After reading this article, decide dependent variable for each study?
if these studies are experimental or observational in

Science sees heart trouble


for hostile personalities
Two studies published today give scientific credence to what
behaviorists have long suspected: Hotheadedness can lead to heart
disease. Researchers who studied 1,081 older men found that those with
high hostility scores (paranoid alienation, cynicism, agression, social
avoidance) were more likely to be overweight and insulin-resistant–
factors that lead to increased risk for heart disease. A smaller study, also
in the January/February Psychosomatic Medicine, found that among 80
men and women, those with high hostility ratings had a bigger jump in
blood pressure in mild, non-provoking social situations. Findings
suggest that those people who have “high levels of mistrust and
suspicion anticipate trouble in interpersonal situations, even before they
have any overt indication that there is some cause for alarm,” says
researcher Mary Davis, author of the smaller study.

Source: Copyright 2000, USA TODAY. Reprinted with permission.

S-ar putea să vă placă și