Documente Academic
Documente Profesional
Documente Cultură
FINAL REPORT
Executive Summary
The two variables are time taken for exercise per week in minutes and weight of
student. These two variables are both quantitative variables. From the two
variables, the independent variable is time taken for exercise in minutes and the
dependent variable is the weight of students in kg. The method of data
collection used is survey with questionnaires. The questionnaires have been
design and distributed to the 20 students in class 1C2, foundation of commerce
and arts. The samples are selected by using drawing from hats.
We tabulate data collected from the questionnaires. We also draw graph such as
bar chart, pie chart, histogram, frequency polygon and ogive due to the data.
From the results of the survey, we have made calculation and conclusion
regarding the two variables. We also made analysis and assumption on the
results of those students.
1
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Set X Set Y
Variables/Analysis (Time taken for (Weight of student
exercise in minute) in kg)
Median 38 54.875
Covariance 259.19
2
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
3
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
1. The table below shows the simple frequency distribution table of the
amount of males and females in our samples.
30%
70%
The pie chart above shows that the number of male and female students who
selected as sample and taking part in the survey questionnaire is almost the
same. The ratio between male and female students is approximately 3:7. This
shows that the number of female students is more than male students. The
sampling technique used is simple random sampling .
4
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
2. The table below shows the simple frequency distribution table of the age of
20 students in class 1C2, foundation of commerce and arts.
8
6
4
2
0
15-16 17-18 19-20 21-22 23-24
Age
From the histogram, we know that the majority of the participants are 17 to 18
years old. This is followed by participants of 19 to 20 years old. There is no
21years old and 22 years old participants. We have also participant between 23
to 24 years old. This bar chat tell us that population of students in 1C2 where
mostly the age of 17 to 18 years old. There are student who are 19 years old and
above.
5
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
3. The table below shows the simple frequency distribution table of the number
of student who have obesity or not.
10
8
6
4
2
0
Yes No
Obesity
The bar chart above shows that the number of non-obese students is more than
number of obese students. There are 3 students who have obesity. 17 of the
students dont have obesity. This also shows that most of the students have good
health.
6
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
4. The table below shows the simple frequency distribution table of the amount
of student who like exercise or not.
Like Exercise Frequency, f Percentage Relative
Frequency (%) Frequency
Yes 10 50 0.50
No 10 50 0.50
Total 20 100 1.00
Yes
50% 50%
No
From the pie chart, the percentage of students who like to exercise and dislike
exercise is represented. 50% of the students like to exercise and 50% of students
dislike exercise. This again shows that all students in 1C2 have equal interest in
exercising.
7
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
5. The table below shows the simple frequency distribution table of the amount
of time spend for exercise(minutes) per week.
Times Spend For Upper Frequency, f Cumulative Frequency
Exercise Per Week (min) Boundary
0 0 0 0
0-30 30.5 8 8
31-45 45.5 5 13
46-60 60.5 4 17
61-80 80.5 0 17
81-90 90.5 3 20
Total 20
25
20
Cumulative Frequency
15
10
0
0 20 40 60 80 100
Time,mins
The ogive shows the time spend by students for exercise per week. From the
ogive, 8 students have spent less than 30minutes for exercise per week. 12
students have spent more than 30minutes for exercise per week. 17 students
have spent 60minutes for exercise per week. 3 students had spent more than
60minutes for exercise per week.
8
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
6. The table below shows the simple frequency distribution table of the number
of student who weight decrease after exercise consecutively per week.
Weight Decrease After Exercise Frequency, f Percentage Relative
Consecutively A Week Frequency (%) Frequency
Yes 10 50 0.5
No 10 50 0.5
Total 20 100 1.00
10
8
6
4
2
0
Yes No
The above bar chart show the number of students who successfully decrease
their weight after exercise consecutively a week. The bar chat show that 10 of
the students weight decreases after exercise consecutively a week. 10 of the
students did not lost their weight after exercise consecutively a week. This may
because of amount of food intake is different between them.
9
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
7. The table below shows the simple frequency distribution table of the weight
of student in kg.
6
Frequency
0
37.5 45 55.5 65.5 75.5 83 88
Weight,kg
The above frequency shows that most of the students have weight class of 51-
60kg with midpoint of 55.5kg that is around 8 students. This is following by
frequency of 7students who weight 40-50kg with midpoint of 45 as shown in
the above frequency polygon. Midpoint of 65.5kg and 75.5kg which is weight
about 61-70kg and 71-80kg has the same amount of students of two. There is no
students weight lower than 40kg and more than 85kg. This is show as the
frequency of midpoint of 37.5 and 88 is 0.This frequency polygon also show
that the students have a balance weight with their age.
10
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
8. The table below shows the simple frequency distribution table of the number
of student who eat diet pills or not.
Eating Diet Pills Frequency, f Percentage Relative
Frequency (%) Frequency
Yes 1 5 0.05
No 19 95 0.95
Total 20 100 1.00
5%
95%
From the pie chart, most of the students in 1C2 did not eat diet pills is showed.
19 students didnt eat diet pills. Only 1 student eat diet pills to keep fit. As the
students know that eating too much diet pills maybe harmful to the body health,
so most of them did not prefer to eat diet pills.
11
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Analysis Results
A) The table below shows the simple frequency distribution table of times
spend by student for exercise per week in minutes (X).
Times Spend For Exercise Per Week, Frequency (f) Class Midpoint ( ) Cumulative
minutes Frequency
0-30 8 15.0 8
31-45 5 38.0 13
46-60 4 53.0 17
61-80 0 70.5 17
81-90 3 85.5 20
Mean
k
fx i i
Mean, x i 1
k
f
i 1
i
(815)+(538)+(453)+(070.5)+(385.5)
=
20
778.5
=
20
= 38.925
Median
x n 1
Median position = 2
20+1
=
2
=10.5
Median Class = 31-45
45+46
Upper Boundary, U B =
2
= 45.5
30+31
Lower Boundary, LB =
2
=30.5
12
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Class Width = U B - LB
= 45.5-30.5
=15
n 1
( FB )
Median= LB C 2
fm
10.58
=30.5+15( )
5
=30.5+15(0.5)
=38
Mode
Modal class = 0-30
Lower Boundary, LB (for mode) =0
Upper Boundary. U B (for mode) =30.5
B
Mode = LB C ( )
B A
80
= 0+30.5 ( )
(80)+(85)
8
= 0+30.5 ( )
8+3
=22.1818
Frequency
13
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Variance
n
f x
2
i i
Variance, s x2 i 1
n
(x) 2
f
i 1
i
42186.75
= - (38.925) 2
20
= 594.1819
Standard Deviation
n
f x
2
i i
Standard Deviation, sx i 1
n
(x) 2
f
i 1
i
= 594.1819
= 24.3758
Skewness
3( x median)
Skewness, sk =
s
3(38.92538)
=
24.3758
= 0.1138
14
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
B) The table below shows the simple frequency distribution table of weight
of student in kg (Y).
Mean
k
fx i i
Mean, x i 1
k
f
i 1
i
(745)+(855.5)+(265.5)+(275.5)+(183)
=
20
1124
=
20
= 56.2
Median
Median position = x n1
2
20+1
=
2
=10.5
Median class = 51-60
61+60
Upper Boundary, U B (for median) =
2
= 60.5
51+50
Lower Boundary, LB (for median) =
2
=50.5
15
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Class Width= U B - LB
= 60.5-50.5
=10
n 1
( FB )
Median= LB C 2
fm
10.57
=50.5+10( )
8
=50.5+10(0.4375)
=54.875
Mode
Frequency
Weight (kg)
Mean Median Mode
16
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Variance
f x
2
i i
Variance, s x2 i 1
n
(x) 2
f
i 1
i
65687
= - (56.2) 2
20
= 125.91
Standard Deviation
f x
2
i i
Standard Deviation, sx i 1
n
(x) 2
f
i 1
i
= 125.91
= 11.2210
Skewness
3( x median)
Skewness, sk=
s
3(56.254.875)
=
11.2210
= 0.3542
17
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Quartiles
Table of Times spend for exercise per week in minutes, x and weight of
students in kg, y.
Weight of student in kg, y 45 45 45 45 45 45 45 55.5 55.5 55.5 55.5 55.5 55.5 55.5 55.5 65.5 65.5 75.5 75.5 83
Minimum value = 15
Maximum value = 85.5
1
Lower quartile position, Q1 = (20+1)
4
21
=
4
= 5.25 th
Difference between 5th and 6th values = 0.25 (15-15)
=0
Lower quartile, Q1 = 0+15
=15
1
Median position, Q 2 = (20+1)
2
= 10.5th
Difference between 10th and 11th values = 0. 5 (38-38)
=0
Median, Q 2 = 0+38
= 38
18
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
3
Upper quartile position, 3 = (20+1)
4
= 15.75th
Difference between 15th and 16th values = 0.75 (53-53)
=0
Upper quartile, 3 = 0+53
= 53
Interquartile Range = 3 - 1
= 53-15
= 38
Hence, -42 < Outlier < 110. Therefore, there is no outlier in the data.
19
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Minimum value = 45
Maximum value = 83
1
Lower quartile position, Q1 = (20+1)
4
21
=
4
= 5.25 th
Difference between 5th and 6th values = 0.25 (45-45)
=0
Lower quartile, Q1 = 0+45
=45
1
Median position, Q 2 = (20+1)
2
= 10.5th
Difference between 10th and 11th values = 0. 5 (55.5-55.5)
=0
Median, Q 2 = 0+55.5
= 55.5
3
Upper quartile position, 3 = (20+1)
4
= 15.75th
Difference between 15th and 16th values = 0.75 (65.5-55.5)
= 0.7510
= 7.5
Upper quartile, 3 = 7.5+55.5
= 63
Interquartile Range = 3 - 1
= 63-45
= 18
20
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
21
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Box Plot X
80
70
60
50
Upper Quartile = 53
40
30
Median = 38
20
Lower Quartile = 15
10
0
Minimum value = 15
22
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Box Plot Y
90
Maximum value = 83
80
70
60
Upper Quartile = 63
50
Median = 55.5
40
Lower Quartile = 45
30
Minimum value = 45
20
10
23
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Measure of Association
Weight of student in kg, y 45 45 45 45 45 45 45 55.5 55.5 55.5 55.5 55.5 55.5 55.5 55.5 65.5 65.5 75.5 75.5 83
70
60
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80 90
Time spend for exercise per week in minutes
Covariance is positive because the graph line is upward sloping: X and Y are positively
related. That means when X increases, Y also increases. Correlation Coefficient is strong
because the points are closely packed together to each other.
24
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Mean
(158)+(385)+(534)+(85.53)
Mean of x, =
20
778.5
=
20
= 38.925
(745)+(855.5)+(265.5)+(275.5)+(183)
Mean of y, =
20
1124
=
20
= 56.2
Variance
42186.75
= - (38.925) 2
20
= 594.1819
65687
= - (56.2) 2
20
= 125.91
Standard Deviation
= 24.3758
25
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Covariance
n
x y i i
Covariance, sxy xy
i 1
n
=
(45157)+(1555.5)+(3855.53)+(5355.52)+(5365.52)+(85.575.52)+(85.583)
20
(38.925)(56.2)
48935.5
= (38.925)(56.2)
20
= 259.19
Correlation Coefficient
s xy
Correlation Coefficient, rxy
sx s y
259.19
= (24.375811.2210)
= 0.95
Positive covariance shows positive relationship. In a positive relationship, as X variable
increases, the Y variable also increases. In the other way, when X variable decreases, the Y
variable also decreases. The correlation coefficient is near to +1, which shows that there is
strong positive linear relationship between the X and Y variables.
26
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Mean of x, = 38.925
Mean of y, = 56.2
Variance of x, 2 = 594.1819
Variance of y, 2 = 125.91
Covariance, = 259.19
Covariance
b, Slope =
Variance of x
259.19
=
594.1819
= 0.4362
=56.2 16.9791
=39.22
27
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
There are two factors that affect the accuracy of prediction of a linear regression
line, that are:
Regression can be used for prediction. With the equation y = 39.22 + 0.4362x,
prediction can be easily made.
(a) The stronger the relationship between two variables, the higher the
accuracy of the prediction. When the correlation coefficient is higher, the
relationship is stronger. This mean when the correlation coefficient is
higher, the prediction will be more accurate and reliable. The correlation
coefficient is 0.95, which is near to +1. This show there is a strong
positive relationship between the two variables. Therefore, the value
predicted is accurate and reliable.
y = 39.22 + 0.4362x
Interpolated value = 38
y = 39.22 + 0.4362(38)
= 55.7956
Since 38 is the interpolated value which is in the range of 15< x <85.5,
the prediction is very accurate and reliable.
28
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
70
60
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80 90
Time spend for exercise per week in minutes
29
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Subject x y 2
A 15 45 45.763 -0.763 0.5822
B 15 45 45.763 -0.763 0.5822
C 15 45 45.763 -0.763 0.5822
D 15 45 45.763 -0.763 0.5822
E 15 45 45.763 -0.763 0.5822
F 15 45 45.763 -0.763 0.5822
G 15 45 45.763 -0.763 0.5822
H 15 55.5 45.763 9.737 94.8092
I 38 55.5 55.7956 -0.2956 0.0874
J 38 55.5 55.7956 -0.2956 0.0874
K 38 55.5 55.7956 -0.2956 0.0874
L 38 55.5 55.7956 -0.2956 0.0874
M 38 55.5 55.7956 -0.2956 0.0874
N 53 55.5 62.3386 -6.8386 46.7664
O 53 55.5 62.3386 -6.8386 46.7664
P 53 65.5 62.3386 3.1614 9.9944
Q 53 65.5 62.3386 3.1614 9.9944
R 85.5 75.5 76.5151 -1.0151 1.0304
S 85.5 75.5 76.5151 -1.0151 1.0304
T 85.5 83 76.5334 6.4666 41.8169
Total 778.5 1124 1124 0 256.7209
30
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Mean of y, = 56.2
Subject x y ( )2
A 15 45 -11.2 125.44
B 15 45 -11.2 125.44
C 15 45 -11.2 125.44
D 15 45 -11.2 125.44
E 15 45 -11.2 125.44
F 15 45 -11.2 125.44
G 15 45 -11.2 125.44
H 15 55.5 -0.7 0.49
I 38 55.5 -0.7 0.49
J 38 55.5 -0.7 0.49
K 38 55.5 -0.7 0.49
L 38 55.5 -0.7 0.49
M 38 55.5 -0.7 0.49
N 53 55.5 -0.7 0.49
O 53 55.5 -0.7 0.49
P 53 65.5 9.3 86.49
Q 53 65.5 9.3 86.49
R 85.5 75.5 19.3 372.49
S 85.5 75.5 19.3 372.49
T 85.5 83 26.8 718.24
Total 778.5 1124 0 2518.2
= 2518.2-256.7209
= 2261.4791
31
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Goodness of Regression, 2 =
2261.4791
=
2518.2
= 0.8981
The higher the value of2 , the better is the regression. Since the 2 close to +1,
the regression is good. Therefore, the predictions using this model are accurate
and reliable. Around 93% of the variation in the Weight of student in kg (y) can
be explained by the number of time student consumes fast food per week (x).
89% of variation in y can be explained accurately and reliably using this model.
32
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Appendix
Male Female
Yes No
Yes No
State reason:
5) How much time you have spent during exercise per week?
Yes No
Yes No
33
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
Appendix
Gender Frequency
Female 14
Male 6
Total 20
Age Frequency
15-16 0
17-18 15
19-20 4
21-22 0
23-24 1
Total 20
3) The table below show number of student who have obesity or not.
Obesity Frequency
Yes 3
No 17
4) The table below shows amount of student who like exercise or not.
34
BUSINESS STATISTICS 061 - FP-036
FINAL REPORT
5) The table below shows the amount of time student spent for exercise per
week.
6) The table below shows number of student who weight decrease or not after
exercise consecutively per week.
Weight of Frequency
student, kg
40-50 7
51-60 8
61-70 2
71-80 2
81-85 1
Total 20
8) The table below shows number of student who eat diet pills or not.
35