Documente Academic
Documente Profesional
Documente Cultură
Math 1040
Term Project Skittles data
In this project we are comparing data collected by the class for 2.17oz bags
of Skittles. We are comparing the different charts to look at the data
collected. Such as Pareto charts, pie charts, histograms, and boxplots. We
are looking at the mean, standard deviation, 5 number summaries,
proportions, and the different tables of collected data.
The total sample size is 2365 candies.
The proportion of red candies is .217,
The proportion of orange is .185,
The proportion of yellow is .188,
The proportion of green is .206,
The proportion of purple is.204.
Pareto chart for Skittles data
520
500
480
460
440
420
400
RED
GREEN
PURPLE
YELLOW ORANGE
red
orange
yellow
green
purple
Orange
9
10
Yellow
17
Green
13
Purple
14
Orange
438
Yellow
444
Green
487
Purple
483
Reflection: The difference between categorical and quantitative data are that
categorical is something your comparing that doesnt add up or the numbers
dont mean anything, such as eye color or social security numbers.
Quantitative data is something that consists of numbers that can represent
counts or measures. The Pareto and Pie charts are what we are using for the
categorical data, and the Histogram and Boxplot are what we used for the
quantitative data. For Histogram and Boxplots you use measures and
numbers to chart the data., for Pie charts and Pareto charts you are charting
the colors by how many are present. You cant measure the colors. The
calculation being used for Categorical data are the proportions and tables
showing amounts of proportions. The calculations being used for Quantitative
data are the mean, standard deviation, and 5 number summaries, because
they are measuring something that is measurable. Numbers that you can
count
Confidence Interval Estimates: Confidence Interval estimate is an interval
estimate, or a range of values that are used to estimate a true population
parameter.
-Construct a 99% confidence interval estimate for the true proportion of
yellow candies, using the calculator functions of 1-prop-int, to find this
interval (0.167, 0.208)
-Construct a 95% confidence interval estimate for the true mean number of
candies, using the calculator function of T-interval (57.945, 60.255)
-Construct a 98% confidence interval estimate for the standard deviation of
the number of candies per bag. Using the formula for finding the standard
deviation we found the interval to be (2.8, 4.8)
Hypothesis Tests: A hypothesis is a claim or statement about a property of a
population. A hypothesis test (or test of significance) is a procedure for
testing a claim about a property of a population.
-Use a 0.05 significance level to test claim that 20% of all skittles candies are
red.
Claim = Null hypothesis: p= 0.20
Alternative hypothesis: p 0.20
Test statistic: Z= 2.0563
P-Value: p= 0.0398
Since the P-value is less than 0.05 significance level we will reject the Null
Hypothesis.
In other words we do not have sufficient evidence to support the claim that
20% of all candies are red.
Use a 0.01 significance level to test the claim that the mean number of
candies in a bag is greater than 55.
Null Hypothesis: Mean= 55
professions there are many applications for these things we have learned in
this class also.
These skills will definitely help me in the rest of my education. This summer I
will start The RN-BSN program to finish my degree and I can see how this will
help me with the rest of the classes I will be taking.