Sunteți pe pagina 1din 6

Biology 300, Biometrics Name: KEY

Key to Exam #1b, Winter Quarter 2010

PART I
A. Multiple Choice (30 points). Circle the best answer. Only one choice is “best”.

1. The standard error of the mean


(a) decreases as the sample size increases.
(b) is a measure of the error associated with a sample estimate of the population mean.
(c) is smaller than the standard deviation.
D (d) all of the above
(e) none of the above

2. Joan has a high-density lipoprotein (HDL) level that is at the 45th percentile and Isabel’s HDL
level is at the 25th percentile for women of their age. This means that
(a) Joan has a lower HDL level than Isabel.
B (b) 20% of women in their age group have a HDL level between Isabel and Joan.
(c) There is a 0.75 probability of a woman their age having a lower HDL level than Isabel.
(d) All of the above are true.
(e) None of the above are true.

3. In general, which of the following statements is true?


(a) The arithmetic mean is greater than or equal to the geometric mean.
(b) The harmonic mean is less than or equal to the geometric mean.
(c) The median is a measure of central tendency.
(d) The median is the same as the 50’th percentile.
E (e) All of the above are true.

4. A standard normal probability distribution


A (a) has a standard deviation equal to one.
(b) has a coefficient of variation equal to one.
(c) is a uniform probability distribution.
(d) is symmetric around one.
(e) All of the above are true.

5. A couple are both carriers for the recessive allele for phenylketonuria (PKU). Each of their
children will have 1/4 probability of inheriting PKU. If the couple plans on having two
children, the which of the following statements is true?
(a) From the addition rule there is a (1/4)+(1/4) = 1/2 probability that both of their children
will inherit PKU.
B (b) From the multiplication rule there is a (3/4)(3/4) = 9/16 both of their children will be
normal.
(c) From the multiplication rule there is a (1/4)(3/4) = 3/16 probability that one of their two
children will inherit PKU and the other child will be normal.
(d) All of the above are true.
(e) None of the above are true.
Biometrics — Winter 2010 Key to Exam #1b Page 2

6. A bird lays a clutch of seven eggs. If each egg has a 80% chance of hatching, then the number
of chicks produced will follow a
(a) Z-distribution.
B (b) binomial distribution.
(c) normal distribution.
(d) Poisson distribution.
(e) uniform distribution.

7. The State of Florida has records on the number of shark attacks per year from 1948-2009. If we
assume shark attacks are random events, the number of attacks per year should follow a
A (a) Poisson distribution.
(b) normal distribution.
(c) uniform distribution.
(d) binomial distribution.
(e) Z-distribution.

8. The central limit theorem states that as the sample size increases
(a) the sample median tends towards the sample mean.
(b) the standard deviation decreases.
(c) an estimated probability tends towards a binomial distribution.
D (d) the distribution of the sample mean tends towards a normal distribution.
(e) all of the above

9. Which of the following statistics is not a measure of dispersion?


(a) the range
(b) the mean deviation
(c) the standard deviation
(d) the coefficient of variation
E (e) All of the above are measures of dispersion.

10. Which of the following is an example of ordinal data?


(a) birth weight of an infant
(b) number of cousins for an infant
(c) sex of an infant
(d) color of an infant’s hair
E (e) none of the above
Biology 300, Biometrics Name: KEY
Key to Exam #1b, Winter Quarter 2010

PART II
B. Answer each question and show all intermediate calculations. Be Neat! You may use one sheet
of paper containing statistical formulas and statistical table A. Use the back sides of these sheets
if you need more space.

1. The following data represent the length (in mm) of beetle pupae:

8, 6, 11, 4, 7

(a) Compute the arithmetic mean, the geometric mean, the harmonic mean, and the median to
two decimal places. (8 points)

Rank X LN(X) 1/X Sample size: n = 5


4 8 2.0794 0.125
2 6 1.9459 0.167
Arithmetic mean: x =  X)/n = 36/5 = 7.20 mm
5 11 2.3979 0.091
1 4 1.3863 0.250
3 7 1.7918 0.143 Geometric mean: G = exp[( Ln X)/n] = exp[9.6013/5] = exp[1.92026]
SUM: 36 9.6013 0.775
= 6.82 mm

Harmonic mean: H = n/ (1/X)] = 5/[0.775] = 6.45 mm

Median: n=5 is odd, so median is the X-value with rank


(n+1)/2 = (5+1)/2 = 6/2 = 3.
median = X3rd = 11.00 mm

(b) Compute the standard deviation, the mean deviation, the range, and the 80th percentile to
two decimal places. (8 points)

Rank X X*X |X-x| Standard deviation: s = Sqrt([( X2) – ( X)2/n] / [n-1])


4 8 64 0.8 = Sqrt([(286) – (36)2/5] / [5-1])
2 6 36 1.2 = Sqrt([286 – 1296/5] / 4)
5 11 121 3.8 = Sqrt([286 – 259.2] / 4)
1 4 16 3.2 = Sqrt(26.8 / 4) = Sqrt(6.7) = 2.59 mm
3 7 49 0.2
SUM: 36 286 9.2
Mean deviation =  |X – x|)/n = 9.2 / 5 = 1.84 mm

Range = Max(X) – Min(X) = X5th – X1st = 11 – 4 = 7.00 mm

40th percentile: P=80, n=5, (nP)/100 = (5*80/100) = 800/100 = 4 which


is an integer, so the 80th percentile is the average of the X-values with
ranks 4 and 5.
40th percentile = (X4th + X5th)/2 = (8 + 11)/2 = 19/2 = 9.50 mm
Biometrics — Winter 2010 Key to Exam #1b Page 4

2. About 30% of the world’s population has blood type A+. If three people are randomly selected,
what is the probability distribution for the number of people with blood type A+. Show your
work and then write your answers in the table below to three decimal places. (12 points)

Number of blood type A+ Probability

0 0.343

1 0.441

2 0.189

3 0.027

Each person represents an independent trial and each trial has two possible outcomes: Blood type A+ or not blood type A+.
Therefore, the probabilities can be determined using the binomial distribution:

N! k N–k
Prob  x=k  = ------------------------- p q
k!  N – k !

where A = blood type A+,


B = not Blood type A+,
p = Prob(A) = probability that a person is blood type A+,
q = Prob(B) = 1–p,
x = the number of people selected with blood type A+,
N = total number of people selected.
For this problem, p = 0.3, q = 0.7, and N = 3. The probabilities are

3! 0 3–0 6
Prob  x=0  = ------------------------  0.3   0.7  = -----------  1   0.343  = 0.343
0!  3 – 0 ! 16

3! 1 3–1 6
Prob  x=1  = ------------------------  0.3   0.7  = -----------  0.3   0.49  = 0.441
1!  3 – 1 ! 12

3! 2 3–2 6
Prob  x=2  = ------------------------  0.3   0.7  = -----------  0.09   0.7  = 0.189
2!  3 – 2 ! 21

3! 3 3–3 6
Prob  x=3  = ------------------------  0.3   0.7  = -----------  0.027   1  = 0.027
3!  3 – 3 ! 61
Biometrics — Winter 2010 Key to Exam #1b Page 5

3. The following table represents the number of lamprey scars per fish in a random sample of fish
from the Mississippi:

Number of scars
Number of fish Probability Expected values
per fish

0 20 0.4066 16.26

1 9 0.3659 14.64

2 6 0.1647 6.59

3 5 0.0628 2.51

(a) Compute the mean, standard deviation, and median for the number of lamprey scars per
fish to two decimal places. (10 points)
X f f*X f*X^2 Ranks Sample size: n =  f = 40
0 20 0 0 1-20
1 9 9 9 21-29 Arithmetic mean: x = ( f*X)/n = 36/40 = 0.90 scars/fish
2 6 12 24 30-35
3 5 15 45 36-40
SUM: 40 36 78 Std. Dev. = s = Sqrt([ ( f*X2) – ( f*X)2/n ]/[n–1])
= Sqrt([ (78) – (36)2/40 ]/[40–1])
= Sqrt([ (78) – (1296)/40 ]/[39])
= Sqrt([ 78 – 32.4 ]/[39])
= Sqrt(45.6/39) = Sqrt(1.169231) = 1.08 scars/fish

Median: n=40 is even, so median is the average of the X-values with


ranks (n/2)=20 and 1+(n/2)=21.
Median = (0+1)/2 = 0.5 scars/fish.

(b) Fit a Poisson distribution to these data using the sample mean computed in part (a). Enter
the probabilities (to four decimal places) and expected values (to two decimal places) for
the number of lamprey scars per fish into the table above. (16 points)
The probabilities can be determined using the Poisson distribution: Prob(X=k) = (e–)k )/(k!), where the sample mean
x= 0.9 is used in place of .

Prob(X=0) = exp(–0.90) (0.90)0/(0!) = (0.406570) (1)/(1) = 0.4066 Expt = n Prob(X=0) = 40 (0.4066) = 16.26
1
Prob(X=1) = exp(–0.90) (0.90) /(1!) = (0.406570) (0.90)/(1) = 0.3659 Expt = n Prob(X=1) = 40 (0.3659) = 14.64
2
Prob(X=2) = exp(–0.90) (0.90) /(2!) = (0.406570) (0.81)/(2) = 0.1647 Expt = n Prob(X=2) = 40 (0.1647) = 6.59

Prob(X3) = 1 – Prob(X2) = 1 – [ Prob(X=0) + Prob(X=1) + Prob(X=2) ]


= 1 – [ 0.4066 + 0.3659 + 0.1647 ] = 1 – 0.9372 = 0.0628
Expt = n Prob(X3) = 40 (0.0628) = 2.51
Biometrics — Winter 2010 Key to Exam #1b Page 6

4. Soil samples from a construction site had a mean PCB concentration of  = 27.3 mg/kg and a
standard deviation of  = 5.0 mg/kg.
(a) What is the percentile (to two decimal places) for an PCB concentration of 22.0 mg/kg? (5
points)
For this problem, = 27.3 mg/kg,  = 5.0 mg/kg, and X1 = 22.0 mg/kg. The percentile is given by

percentile = Prob(X < X1) (100%)


= Prob(Z < Z1 = (X1 – )/) (100%)
= Prob(Z < (22.0 – 27.3)/5.0) (100%)
= Prob(Z < –5.3 / 5.0) (100%)
= Prob(Z < –1.06) (100%)
= [0.5 – Prob(0 < Z < 1.06)] (100%)
= [0.5 – A] (100%)
= [0.5 – 0.3554] (100%) = (0.1446) (100%) = 14.46% or 14th percentile.

(b) What PCB concentration (to one decimal place) corresponds to the 72nd percentile? (5
points)
For this problem, = 27.3 mg/kg,  = 5.0 mg/kg, and the percentile = 72%. The corresponding value of X1 is the solution
to Prob(X < X1) = 0.72. From the normal probability table, (0.5+A) = 0.72, so A = 0.72–0.5 = 0.22. This gives a value of Z1
= 0.58 from Table A.

0.72 = Prob(Z < Z1 = (X1 – )/)


= Prob(X < X1 =  + Z1)
= Prob(X < X1 = 27.3 + (5.0) (0.58))
= Prob(X < X1 = 27.3 + 2.9) = Prob(X < X1 = 30.2)

Thus, X1 = 30.2 mg/kg is the 72nd percentile.

(c) In what proportion (to four decimal places) of soil samples can one expect PCB
concentrations that are between 20.0 mg/kg and 30.0 mg/kg? (6 points)
For this problem, = 27.3 mg/kg,  = 5.0 mg/kg, X1 = 20.0 mg/kg and X2 = 30.0 mg/kg.

Prob(X1 < X < X2) = Prob( (X1 – )/ < Z < (X2 – )/)
= Prob( (20.0 – 27.3)/5.0 < Z < (30.0 – 27.3)/5.0) )
= Prob( –7.3/5.0 < Z < 2.7/5.0 )
= Prob( –1.46 < Z < 0.54 )
= Prob( –1.46 < Z < 0 ) + Prob( 0 < Z < 0.54 )
= A1 + A2 = 0.4279 + 0.2054 = 0.6333.

Thus, 63.33% of the soil samples are predicted to have PCB concentrations between 20.0 mg/kg and 30.0 mg/kg.

S-ar putea să vă placă și