Documente Academic
Documente Profesional
Documente Cultură
002636-0089
1
Muhammad Rifqi Dwi Budianto
002636-0089
Table of Contents
Introduction 3
Sample of Questionnaire 4
Data Collection 5
Simple Mathematical Process 10
Sophisticated Mathematical Process 15
Validity 23
Conclusion 23
2
Muhammad Rifqi Dwi Budianto
002636-0089
Introduction
“Is there any correlation between grade and calorie intake in a day?”
During the time of adolescence, we often ignore the amount of food we consume.
We neglect the calories intake per meal causing teenagers to have malnutrition
or obese. Often as grade level is increasing, the amount of work also increases
therefore the necessity to keep the energy up is higher for higher grade level.
Also with higher grade level there will be more activities to do meaning that the
amount of nutrition we take is crucial. The idea came up when a healthy diet is
know whether the amount of calories consumed daily have a relationship with the
age of an individual. The hypothesis for this investigation is that age and calorie
students at the age 13,14,15,16,17 from the grade level 8 to 12. There are equal
amount of students per age and per grade level. In the simple mathematical
processes, I will be using mean, median and mode and for more sophisticated
the two variable between age and calorie intake are correlated or not. On the
other hand the chi squared test is used to test the relationship between age and
calorie intake. Lastly I will be collecting and analyzing the outlier that will affect
3
Muhammad Rifqi Dwi Budianto
002636-0089
Sample of Questionnaire
Name:
13 14 15 16 17
8 9 10 11 12
Signature
________________________
4
Muhammad Rifqi Dwi Budianto
002636-0089
The survey was conducted by The data collected a below took 2 weeks to complete involving
grade level from 8 to 12 with the range of age of 13 -17. The data collected was about the
correlation of age and calorie intake in a day. WRITE A FEW LINE EXPLAINING BELOW
5
Muhammad Rifqi Dwi Budianto
002636-0089
6
Muhammad Rifqi Dwi Budianto
002636-0089
7
Muhammad Rifqi Dwi Budianto
002636-0089
8
Muhammad Rifqi Dwi Budianto
002636-0089
The first process that I’m going to make and grouped the data collected from the survey. This
process takes quite sometime to make and it used as a part of this internal assessment. For this
assessment. From the data gathered, I made several graphics that will show the grouping of the
data where it is divided into 4 categories, they are 300-1300, 1301-2300, 2301-3300, >3300.
Where the first category shows the number of people who consumed less than should and the
fourth categories shows people who consumes over than they should.
Grade
No Calories 8 9 10 11 12
1 300-1300 12 8 4 14 9
2 1301-2300 15 16 10 14 15
3 2301-3300 3 5 11 1 6
4 > 3300 0 1 5 1 0
Total 30 30 30 30 30
From the data obtained above, it can be seen the grade level that consumes the most calories
lies in grade 10 where there are 11 people that got… calories between 2301 and 3300. When it
is reviewed statistically, therefore grade 10 suffers from obese or excessive nutrition intake.
Where as grade 11 is the grade level; who only got a tiny amount of calorie intake. It can be see
that 14 people who consumes the calorie of 300-1300 meaning that grade 11 didn’t get enough
nutrition that causes fatigue during activities throughout the day.
9
Muhammad Rifqi Dwi Budianto
002636-0089
10 300-1300
8 1301-2300
6
2301-3300
4
2 > 3300
0
8 9 10 11 12
Grade
If we look further from the graph above it can be seen that grades 8 and 12 do not consume
excessive calories in the range below 3,300 calories where we can assume that grade 8 the
grade level that consumes the least sufficient amount of food, considering the needs for grade
12 it is expected that with the amount of work and sleep they have been getting they should
consume more.
Other than bar chart, I also tried to make a pie chart for each of the grade level. This help me to
know the distribution of calories distributed for each class shown with the calculations below.
FOR REFERENCE BECAUSE THE AGE AND THE GRADE IS CONSTANT, THE WORD
GRADE SHOWS THE AGE OF THE SUBJECTS.
Grade 8 :
1. 12/30 *100%= 40
2. 15/30 * 100%= 50
3. 3/30 *100%= 10
4. 0/30*100%= 0
Grade 9: 8/30 *100%= 26.7
16/30 *100%= 53.3
5/30*100%= 16.7
1/30*100%= 0.033
Grade 10:
1. 4/30*100%=13.3
2. 10/30*100%=33.3
3. 11/30*100%=36.7
4. 5/30*100%= 16.7
Grade 11
10
Muhammad Rifqi Dwi Budianto
002636-0089
1. 14/30*100%= 0.47
2. 14/30*100%= 0.47
3. 1/30*100%=0.033
4. 1/30*100%=0.033
Grade 12
1. 9/30*100%= 30
2. 15/30*100% 15/30*=9.9
3. 6/30*100%=25
4. 0/30*100%= 0
Grade
No Calories 8 9 10 11 12
1 300-1300 12 8 4 14 9
2 1301-2300 15 16 10 14 15
3 2301-3300 3 5 11 1 6
4 > 3300 0 1 5 1 0
10%
300-1300
40%
1301-2300
2301-3300
50% > 3300
11
Muhammad Rifqi Dwi Budianto
002636-0089
17% 13%
300-1300
1301-2300
33% 2301-3300
37% > 3300
300-1300
47% 1301-2300
2301-3300
47%
> 3300
12
Muhammad Rifqi Dwi Budianto
002636-0089
20%
30% 300-1300
1301-2300
2301-3300
> 3300
50%
From all the graph above, we can see that grade 11 or the student age of 16 consumes the
most calories in a day. Its also clear that grade 9 or the student at the age of 14 consumes the
less amount of calorie in a day. The data also shows that the calories intake for grade 8 is well
balance and they are living in a healthy diet. At the same time the data from grade 12 shows a
well balanced nutrition in take in a day considering the amount of work that they required to do.
13
Muhammad Rifqi Dwi Budianto
002636-0089
To show different complete processes in statistic, Box and whisker plot also has been made
where it shows the summary of the 5 most important data which is the minimum, Q1 (lower
quartile), median, Q3 (upper quartile) and maximum value from the graph. From the graph
above, the following data can be extracted:
1. Minimum:450
2. Q1: 1250
3. Median: 1650
4. Q3:2250
5. Maximum:4400
Then I calculate the outlier where the outliers are unnatural data in statistical calculations and
must be issued so as not to cause data failure, where the outlier formula is
Outlier upper boundary = 1.5 x IQR +Q3 = 1.5 x 1000 + 2250 = 3750
Outlier lower boundary= Q1 -1.5 X IQR = 1250 – 1.5 x 1000 = -250
And after I analyzed there were two outliers, they were grade 10 as many as 4,300
calories and 4,400 calories, where the two data were unnatural data and had to be issued in
statistical calculations. I tried to use the method so that my statistical calculations are precise
and accurate
14
Muhammad Rifqi Dwi Budianto
002636-0089
- Correlation Coefficient
The correlation coefficient is according to
(https://mathbits.com/MathBits/TISection/Statistics2/correlation.htm) measures the strength
and the direction of a linear relationship between two variables. This shows the two variables
between grade and number of calories with the data shown below (try making more sentences)
∑(𝑥− 𝑥̅ )2
Sx = √ is the standard deviation of x According to
𝑛
(https://www.robertniles.com/stats/stdev.shtml ) standard deviation is a measure
of dispersement in statics. Dispersement tells you how much your data is spread
out.
∑(𝑦− 𝑦̅)2
Sy = √ is the standard deviation of y
𝑛
And also I use line best fit to predict grade level below grade 8 so that the data
needed can be analyzed from grades 8-12 with line best fit
15
Muhammad Rifqi Dwi Budianto
002636-0089
X mean = 10
Y mean = 1814.8
Sx = 1.4142
Sy = 805.1647
m = gradient = r* Sx/Sy = 0.018 X1.4142/805.1647 = 10.367
m is positive hence the curve goes upward
b = (y mean) - (m*x mean)
b = 1814.8 – 10.367 *10 = 1711.13
Age vs Calories
5000
4500
4000 y = 10.367x + 1711.1
R² = 0.0003
3500
3000
Calories
2500
Calories (y)
2000
1500 Linear (Calories (y))
1000
500
0
8 9 10 11 12 13
Grade
Calories Mean
Grade (x) Mean (xi) (y) of (yi) (x-xi) (y-yi) (xi-x)(yi-y) (x-xi)2 (y-yi)2
12 10 650 1814.8 2 -1164.8 -2329.6 4 1356759
12 10 1250 1814.8 2 -564.8 -1129.6 4 318999
12 10 2250 1814.8 2 435.2 870.4 4 189399
12 10 1950 1814.8 2 135.2 270.4 4 18279.04
12 10 1750 1814.8 2 -64.8 -129.6 4 4199.04
12 10 1750 1814.8 2 -64.8 -129.6 4 4199.04
12 10 1650 1814.8 2 -164.8 -329.6 4 27159.04
12 10 2450 1814.8 2 635.2 1270.4 4 403479
16
Muhammad Rifqi Dwi Budianto
002636-0089
17
Muhammad Rifqi Dwi Budianto
002636-0089
18
Muhammad Rifqi Dwi Budianto
002636-0089
19
Muhammad Rifqi Dwi Budianto
002636-0089
Mean:
𝑋 = 10, 𝑌 = 1814.8
300
Sx = √ = 1.412
150
97243544
Sy = √ = 805.164
150
3110
Sxy = = 20.73
150
3110
r = = 0.018208
√300 𝑥 97243544
r2 = 0.000331542
20
Muhammad Rifqi Dwi Budianto
002636-0089
Hypothesis test in this research will be using comparing chi square and critical value in the
table. If the chi square is lower than the critical value, then the null hypothesis will be accepted.
If chi square is higher than critical value, the the null hypothesis will be rejected and the
hypothesis will be accepted.5% is used as the significance level for testing the null and
alternative hypothesis. .5% significance level means the 95% of time the null hypothesis will be
correct.5% significance level means the 95% of the time the alternative hypothesis will be
correct.
Observed
No Calories 8 9 10 11 12 Total
1 300-1300 12 8 4 14 9 47
2 1301-2300 15 16 10 14 15 70
3 2301-3300 3 5 11 1 6 26
4 > 3300 0 1 5 1 0 7
Total 30 30 30 30 30 150
Expected
No Calories 8 9 10 11 12 Total
1 300-1300 9.4 9.4 9.4 9.4 9.4 47
2 1301-2300 14 14 14 14 14 70
3 2301-3300 5.2 5.2 5.2 5.2 5.2 26
4 > 3300 1.4 1.4 1.4 1.4 1.4 7
Total 30 30 30 30 30 150
Observed- Expected
No Calories 8 9 10 11 12 Total
1 300-1300 2.6 -1.4 -5.4 4.6 -0.4 47
2 1301-2300 1 2 -4 0 1 70
3 2301-3300 -2.2 -0.2 5.8 -4.2 0.8 26
21
Muhammad Rifqi Dwi Budianto
002636-0089
p = 0.05 df = 12 21.026
Looking at the table above, as we can see the degrees of freedom is 12 therefore with the 5%
significance level, the critical value will be 21.03. With the value of 31.07809 I can conclude that
that the null hypothesis which stated that the calorie intake in a day and age are independent is
rejected / not accepted.
22
Muhammad Rifqi Dwi Budianto
002636-0089
Validity
chi square calculation < Chi
Square Critical Value This investigation turned out to be not what I expected to be, the
21.03 < 31.07809 chi squared the graph shows how both variable has no correlation.
therefore, we must reject Also from the graph we can see that the connection between the
our null hypothesis Ho and two variables are very weak. The null hypothesis or Ho turned out
conclude that calorie intake
to be true with using the mathematical process such as comparing
in a day and age are not
independent. the result of critical value and chi squared value.
In the near future, it is good to get more reliable data. Unlike the
recent data, it is not very helpful to use to investigate because no
subjects tested were aware of their calorie intake in a day. Processing the data, we can see a
lot of biases, such things happened because of their lack of knowledge of the investigation and
the variable itself. From the look of it, most of my questionnaire were filled honestly because I
guided my test subject on calculating the amount of calories they take per meal and add them
all.
Conclusion
From the investigation that has been conducted, it is proven that the result supports the
hypothesis. The results show that there was no correlation between the two variables which is
the age and the calorie intake in a day. Errors was found in this investigation mostly occurs in
the gathering data process. Talking about correlation, it is found that during testing the Ho with
the chi squared test and critical value, the chi square is lower than critical value meaning that
the null hypothesis is accepted. The null hypothesis for this investigation is age and calorie
intake in a day are independent. Some limitations that can be found is that people are not
actually aware of how much calorie they consume in a day. The data would be more accurate if
a chart of food with the quantity of calorie is provided to the test subjects.
To be precise the data shows that the degrees of freedom is 12 therefore with the 5%
significance level, the critical value will be 21.03. With the value of 31.07809 I can conclude that
that the null hypothesis which stated that the calorie intake in a day and age are independent is
rejected / not accepted.
23