Documente Academic
Documente Profesional
Documente Cultură
Submitted By:
Harshit Rastogi
Vivek Kumar Jha
VGSoM, IIT Kharagpur
Contents
Case Objectives
Slide# 3
Slide# 4-5
Slide# 6
Slide# 7
Slide# 8
Slide# 9
Slide# 10
Slide# 11-12
Summary
Slide# 13
Slide# 14
Case Objectives
To draw inferences from the data on the basis of
Descriptive analysis of the dataset.
To come up with a model showing dependencies on the
various factors that contribute to booking of cars from
CalTaxi.
To predict number of extra cars that should be bought to
reduce dependency on external parties.
3
Sunday
Saturday
Friday
Thursday
Tuesday
Wednesday
Monday
90
79
80
78
70
77
60
76
75
50
74
40
73
30
72
20
71
10
0
70
Holiday
Non-Holiday
69
Cloudy
Rainy
Sunny
Determination of Response
Variable
Variable cnt gives the value of total bookings (sum of
ac_cars & non_ac_cars) each day.
cnt shows the engagement of cars every day and hence,
is the Response variable.
As the Maximum rent time is 24 hours, a car can be booked
multiple times a day.
7
Determination of Predictor
Variables
Correlation table between cnt and other variables of the
dataset*
cnt
Climat
e
Weekd
ay
Holiday Temp
aTemp
Hum
0.1151
44
0.52736
9
0.56022
5
0.21507
8
0.2973
84
0.401956 0.00451
6
0.11536
8
0.36554
2
Holiday
Temp
Atemp
Hum
Windspeed
Avg_dist
8
A mean and media value of around 75 proves the validity of the model as this is
the approximate values that we got from Descriptive Analysis earlier.
10
2. If any car has been booked for more than 10 hours in a day, it wont be
booked again (considering its maintenance and performance).
Using
this
assumption,
we
calculated
average
number
of
visits
per
car
(visits_per_car)
11
12
CalTaxi
should
proceed
with
buying
8-10
cars
to
reduce
15