Sunteți pe pagina 1din 4

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/326416741

Prediction of Live Cricket Score and Winning

Article · July 2018

CITATIONS READS

0 1,603

3 authors, including:

Pramila M. Chawan
Veermata Jijabai Technological Institute
136 PUBLICATIONS   228 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Cloud based Automated Attendance Monitoring System using RFID and IOT View project

Decision Tree View project

All content following this page was uploaded by Pramila M. Chawan on 16 July 2018.

The user has requested enhancement of the downloaded file.


International Journal of Trend in Research and Development, Volume 5(4), ISSN: 2394-9333
www.ijtrd.com
Prediction of Live Cricket Score and Winning
1
Rameshwari A. Lokhande and 2Pramila M. Chawan,
1
Student, 2Professor,
1,2
Computer and IT Dept, Veermata Jeejabai Technological Institute, Mumbai, India

Abstract— Winning is the goal of any sport. Cricket is one the down the list of attributes to only important ones and then the
most watched sport now a days. Winning in Cricket depends on data mining algorithms which can be applied on those attributes.
various factors like home crowd advantage, performances in the The game prediction problem that I am studying does not take
past, experience in the match, performance at the specific venue, into consideration the player’s performance but it does take into
performance against the specific team and the current form of consideration the team’s past performance at a high level extent
the team and the player. During the past few years lot of work along with the other factors like toss winner, toss decision, home
and research papers have been published which measure the support, etc.
player performance and their winning predictions. In this work a
B. Factors
model has been proposed that has two methods, first predicts the
score of first innings not only on the basis of current run rate but Cricket winning can be predicted like all other games.
also considers number of wickets fallen, venue of the match and We need to find the best attributes or factors that influence the
batting team. The second method predicts the outcome of the match outcome. The result of a cricket match depends on more
match in the second innings considering the same attributes as of of in-game and pre-game attributes. Pre-game attributes like
the former method along with the target given to the batting Pitch, Team Strength, Weather, Venue etc. and in-game
team. These two methods have been implemented using Linear attributed like run rate, total run, strike rate, wickets in hand etc.
Regression Classifier or Q-Learning base decision tree approach influence a match result predominantly. Below are the attributes
and Naïve Bayes Classifier for first innings and second innings that decide outcome of the cricket match:
respectively. In both methods, 5 over intervals have been made
Pitch:
from 50 overs of the match and at each interval above mentioned
attributes have been recorded of all non-curtailed matches Unlike other sports, cricket stadium’s size and shape is
played between 2002 and 2014 of every team independently. not fixed except the dimensions of the inner circle and pitch
which are 30 yards and 22 yards respectively. Outfield
Index Terms—Regression; Naïve Bayes; Reinforcement
variations and pitch can have a substantiate effect on bowling
algorithm: Data Mining; Projected Score; Winning
and batting. The spin of the ball, seam movement and the bounce
Probability.
depends on the nature of the pitch. It depends on how wet is the
I. INTRODUCTION pitch. The more wet the pitch, the slower it will play. On the off
chance that it is drying out, those balls will change significantly,
A. Overview
yet all it will get less difficult those drier it gets
Cricket is being played in many countries all around the world.
Green pitches tend to get easier to bat on. Wickets can
There are a lot of domestic and international tournaments being
get significantly more dry or wet (on the off chance that it
held in many countries which play cricket. Cricket is a game
downpours). They might start to break up if they are soft (which
played between two teams comprising of 11 players in each
would help the bowlers).
team. The result is either a win, loss or a tie. However,
sometimes due to bad weather conditions the game is also Toss:
washed out as Cricket is a game which cannot be played in rain.
According to cricket analysts, there is sure measure of
Moreover, this game is also extremely unpredictable because at
advantage for a team if it wins the toss. This might not be
every stage of the game the momentum shifts to one of the teams
deciding factor in a match but it would give the team the
between the two. A lot of times the result gets decided on the last
opportunity of choosing “what they want”.
ball of the match where the game gets really close. Considering
all these unpredictable scenarios of this unpredictable game, Team strength:
there is a huge interest among the spectators to do some
The team strength should be balanced for winning a
prediction either at the start of the game or during the game.
match. Captainship in a team is also a deciding factor. Past
Many spectators also play betting games to win money. So,
Records: The past team performances can be considered to
keeping in mind all these possibilities, this report aims at
predict the outcome of a match. History of games at that venue
studying the problem of predicting the game results before the
how did the teams perform, performance at that specific venue,
game has started based on the statistics and data available from
Performance against the specific opposition and experience at
the data set. There are different ways to do the prediction. The
the specific venue.
prediction can be done taking into consideration the player’s
performance as well as the team performance. There are many Home Ground Advantage:
unpredictable things that happen in a cricket game like matches
This is another attribute which determines the winner in
being washed out due to rain, a key player getting injured before
the match. If you are playing in the home ground conditions
the game, players changing their teams, etc. Sometimes a key
everything would be in your hands like climatic factors, pitch
player also gets inured during the game and hence is not able to
nature and major role is played by the home crowd. Home Team
take further part in the game. All these factors do affect the
gets better motivation.
prediction to some extent. The report discusses a methodology
that I followed for the game result prediction. The methodology
consists of first the attribute selection algorithms which trim

IJTRD | July – Aug 2018


Available Online@www.ijtrd.com 91
International Journal of Trend in Research and Development, Volume 5(4), ISSN: 2394-9333
www.ijtrd.com
II. PROBLEM STATEMENT distinguishes reinforcement learning from supervised learning is
that only partial feedback is given to the learner about the
In this to design a system that can be provide the Score
learner's predictions. Further, the predictions may have long
and Winning Prediction in Cricket match, the system can
term effects through influencing the future state of the controlled
analyze multiple parameters like winning toss, batting side, DL
system. Thus, time plays a special role. The goal in
approach, Home ground advantages, player wise performance
reinforcement learning is to develop efficient learning
etc.While declaring a time for the particular championship it is
algorithms, as well as to understand the algorithms' merits and
very important to select the best team so that the chances of the
limitations. Reinforcement learning is of great interest because
team winning the championship become easy. This problem had
of the large number of practical applications that it can be used
to be solved to generate the best players from both the teams for
to address, ranging from problems in artificial intelligence to
the best battle. To solve this problem we have collected the
operations research or control engineering. In this book, we
historical data of all some team like (India , Pakistan, Australia ,
focus on those algorithms of reinforcement learning that build
New Zealand etc.), and using prediction algorithm like Naive
on the powerful theory of dynamic programming. We give a
Bayesian algorithm we are predicting the best starting players
fairly comprehensive catalog of learning problems, describe the
for both the teams that can be used in fantasy league for winning
core ideas together with a large number of state of the art
the maximum points.
algorithms, followed by the discussion of their theoretical
III. METHODOLOGY properties and limitations.
A. System Flow
I have followed the following methodology in the course of my
project. The methodology consists of 5 different phases as
shown in Figure 1 i.e. Data Set Generation, Data Cleaning,
Attribute Selection, Data Mining and Analysis of Results. Each
of these phases was part of my project milestones submitted and
I had created a Gantt chart to keep track of the timeline. I will be
discussing each phase in the following sections in detail.

Figure 3.3: The proposed reinforcement learning scenario


Algorithm 1 The function implementing the tabular TD(0)
algorithm. This function mustbe called after each transition.
Function TD0 (X;R; Y; P; V )
Input: X is the last state,
Y is the next state,
R is the immediate reward associated with this transition,
B. Dataset P is the immediate penalty associated with this transition,
V is the array storing the current value estimates
The data was collected from the http://cricsheet.org [1] website. Step 1: δ R +Y* V [Y ] - V [X]
The website has data about all 8 seasons (from 2008 to 2016) of Step 2: V [X]  V [X] + a*δ
domestic tournaments held in India i.e. the Indian Premiere Step 3: return V
League. It is a 20-20 format of tournament. It means that each
team bats or bowls for maximum 20 overs each and the result of Algorithm 2
the game is decided at the end of the total 40 overs played. The The function that implements the every-visit Carlo
data set downloaded from this website was of two formats; one algorithm to estimate value functions in episodic MDPs. This
was the ball by ball detail of each match in the .csv format and routine must be called at the end of each episode with the
the other data was the high level data about the match in the .json state-reward sequence collected during the episode. Note that
format. I used the Java classes File and the FileWriter to read the the algorithm as shown here has linear time- and
.csv files and write the contents to a new file. The .csv files had space-complexity in the length of the episodes.
data about 577 different matches and the Java code basically
copied the important data from all the 577 files and combined Function EveryVisitMC (X0;R1;X1;R2; : : : ;XT-1;RT ; V )
them into a single file. The data from the .csv file consisted of Input: Xt is the state at time t,
many irrelevant information like gender, date, umpire Rt+1 is the reward associated with the tth transition,
information, etc. which were all discarded and the java code T is the length of the episode,
copied only data about the team, season, venue, toss, V is the array storing the current value function estimate
toss_winner, player_of_match and the ball by ball data. The .csv Step 1: sum  0
file had data as shown in following figure. Step 2: for t T1 downto 0 do
Step 3: sum Rt+1 + Y* sum
C. Algorithms Step 4: target[Xt]  sum
1: Q-Learning Approach Step 5: V [Xt]  V [Xt] +a* (target [Xt] - V [Xt])
Step 6: end for
IT is a learning paradigm concerned with learning to 7: return V
control a system so as to maximize a numerical performance
measure that expresses a long-term objective. What

IJTRD | July – Aug 2018


Available Online@www.ijtrd.com 92
International Journal of Trend in Research and Development, Volume 5(4), ISSN: 2394-9333
www.ijtrd.com
Base on the proposed algorithms we will implement the [3] Predicting the Winner in One Day International Cricket
current research work. We first design the simulation with the Ananda Bandulasiri, Ph.D.
consistence input dataset. The intermediate process will learn [4] Ananda Bandulasiri, “Predicting the Winner in One Day
input parameters and then predict the output, it should be reward International Cricket” Journal of Mathematical Sciences &
as well penalty. Mathematics Education.
[5] Tejinder Singh, Vishal Singla and Parteek Bhatia, “Score
The system having the ability to predict the first batting
and Winning Prediction in Cricket through Data Mining” 8
score using historical trained data, we will use N to denote the
October 2015.
total number of matches in the training dataset. Recall, n denotes
[6] Amal Kaluarachchi and Aparna S Varde, ”CricAI: A
the segment up to which match state is known. The first feature
classification based tool to predict the outcome in ODI
is calculated by dividing the total runs scored by the given team
cricket”.
across the number of matches it played.
[7] Viraj Phanse, Sourabh Deorah,”Evaluation & extension to
the Duckworth Lewis method: A dual application of data
mining techniques”.
[8] K. Raj and P. Padma. Application of association rule
mining: A case study on team India. In International
2: Naïve Bayes Classification Conference on Computer Communication and Informatics
Input: User input file data record which contains {CScore, (ICCCI), pages 1–6, 2013.
Overs, WF}, segment of average score from train database of k [9] T. B. Swartz, P. S. Gill, and S. Muthukumarana. Modelling
to n over’s. and simulation for one-day cricket. Canadian Journal of
Output: Projected score Sc Statistics.
Step 1: Read R {Runs, Overs, WF} from current parameters. [10] M. Bailey and S. R. Clarke. “Predicting the match outcome
Step 2: Map with train features with each sample. in one-day international cricket matches, while the game is
Step 3 : calculate average score of train DB with same in progress”. Journal of sports Science and Medicine.
evidences [11] Sohail Akhtar and Philip Scarf,“An analysis of strategy in
the first three innings in test cricket: declaration and the
follow-on “.
[12] D. Roy Choudhury, Preeti Bhargava, Reena and Samta
Kain, “Use of Artificial Neural Networks for Predicting the
Outcome of Cricket Tournaments “.
Step 4 : PreScore = (CScore + ) [13] O.B. “Issue of the effect of umpiring errors in cricket
Statistician”,1997
Step 5: Return PreScore
CONCLUSION
The Fantasy Cricket has the problem of churning of
customers within the duration of the season. This problem is
therefore addressed by using interactive models of Predictions
where a user predicts the results of each game in order to be
rewarded which would further help him strengthen his Fantasy
squad. The project thus, aims not only to attract more users to
this game that is Fantasy Cricket, but also aims at improving the
general attraction to the Premier League. This happens because
in a predictive model, a user makes a prediction on every game,
and ends up watching that game to check if his prediction is
going right Thus our project will not only improve the existing
system of Fantasy Cricket, but will also augment the reach of
Cricket in India.
Future work
To implement a ensemble module with the help of
multiple learning algorithm which can be applicable for all type
of cricket matches like 20-20, ODI and test cricket also.
References
[1] Auto-play: A Data Mining Approach to ODI Cricket
Simulation and Prediction Vignesh Veppur
Sankaranarayanan, Junaed Sattar and Laks V. S.
Lakshmanan Department of Computer Science University
of British Columbia Vancouver, B.C. Canada V6T 1Z4.
[2] Analysis and Prediction of Cricket Statistics using Data
Mining Techniques Anurag Gangal VESIT, Mumbai
Abhishek Talnikar VESIT, Mumbai Aneesh Dalvi VESIT,
Mumbai Vidya Zope VESIT, Mumbai Aadesh Kulkarni
VESIT, Mumbai.

IJTRD | July – Aug 2018


Available Online@www.ijtrd.com 93
View publication stats

S-ar putea să vă placă și