Documente Academic
Documente Profesional
Documente Cultură
(BST)
Dr. Pritha Guha
Session: 1,2
Teaching and Grading
Pritha Guha (email: pritha@xlri.ac.in)
Text Book: Bowerman B., O’Connell R., Murphy E., Business Statistics in Practice, 8th ed.,
McGraw Hill Education (India)
Grading:
Mid term (based on session 1-8): 25%
End term (based on all the sessions): 75%, Take home exam
Students are required to bring their laptops for the sessions. For the Midterm they have
bring non programmable scientific calculators. Laptops would not be allowed.
Scales of Measurement
Examples:
• Gender: Male____, Female_____
• Students in a college are classified by the department in which they are enrolled
using a nonnumeric label such as Architecture and Planning(AP), Management(M),
Law(L), Technology(T) and so on.
• Alternatively, assign a numeric code for school variable (eg 1 denoted Architecture
and Planning, 2 denotes Management, 3 denotes Law, 4 denotes Technology)
Examples:
• Mili has a GATE score of 1205 while Kiran has a GATE score of 1090. Mili scored 115
points more than Kiran.
• Example: Temperature of Jamshedpur today at 8AM was 22oC, in Shimla it was
19oC.
Nominal
Variables Non-
numeric
Ordinal
Interval
Quantitative Numeric
Ratio
• If we record the total number of cars sold by a particular car salesperson in each
of the years 2011, 2012, 2013, 2014 and 2015, are the data
a) Time series b)Cross-sectional c)Panel Data?
1 6 4 4 2 4
2 7 3 4 3 3
3 4 2 6 4 2
4 3 3 6 6 2
5 4 6 2 5 2
6 6 3 5 5 2
33 Basic Statistics, June 2019
A new bottle design for a popular soft drink: Consumer
reaction
6 3 1 7 5 2 5 2 3 7
7 2 5 3 6 7 7 5 7 6
4 7 2 1 6 2 2 3 1 1
3 5 7 4 2 7 4 6 5 7
4 2 5 7 6 4 5 7 3 5
6 4 7 7 6 7 4 6 1 4
35 Basic Statistics, June 2019
Computations Using R
• Download R from https://cran.r-project.org/
22
19
16
18
13
Bond 15 3 12 30
Stock 24 2 4 30
Taxdef 1 15 24 40
Total 40 20 40 100
x i
x i 1
n
• Population Mean m
N
x i
m i 1
N
• Sensitive to outliers
n 1 th ranked value
Q1
4
• Second Quartile (Q2) = 50th Percentile = Median
3(n 1)
Q3 th ranked value
4
79 Basic Statistics, June 2019
Mode
• The mode is another measure of central location.
• The most frequently occurring value in a data set
• Used to summarize qualitative data
• A data set can have no mode, one mode (unimodal), or many modes
(multimodal).
84 IBS 2017
Variance
• The variance is a measure of variability that utilizes all the data.
(x i x)
2
i
( X m )2
s =2 i 1 2 i 1
n 1 N
• Sample SD : • Population SD:
n N
(X i X) 2
i
( X m )2
s s 2 i 1 2 i 1
n 1 N
87 Basic Statistics, June 2019
Summarizing Grouped Data
• When data are grouped or • mi = midpoint of i-th class
aggregated, we use these • i = frequency of the i-th class,
formulas:
n
m
• n = Total frequency= f i
Mean: x i i
i 1
n
x i
2
m
Variance: s 2
i
n 1
Standard Deviation: s s 2
88 IBS 2017
Five Number
Summary and Box
Plot
Five Number Summary
90 IBS 2017
Box Plots
• A box plot allows you to:
• Graphically display the distribution of a data set.
• Compare two or more distributions.
• Identify outliers in a data set.
Outliers Whiskers
Q1 Q2 Q3
**
91 IBS 2017
Outliers
92 IBS 2017
5 Number Summary for Payment Time Data