Documente Academic
Documente Profesional
Documente Cultură
Rules of baseball
THE FIELD
BASEBALL PYTHAGOREAN THEOREM
Artisan of Framing
Catcher: Receiving skills
Pitcher: Ability to locate the pitch
Batter: Body language
Umpire: Personal standards
SIGNIFICANT VARIABLES
Pitch Location
Ball/Strike Count
Command
Pitcher Handedness
Batter Handedness
Home/Road
Pitch Type
VARIABLES ANALYSIS
PLATE SIDE
Plate Side (Pitcher's Plus/Minus
POV) Bat Side Strike
RHB 1.70%
Left Half
LHB -4.00%
RHB -2.80%
Right Half
LHB 2.00%
CALCULATE STRIKE ZONE PLUS/MINUS
Calculation:
Total points allotted to players involved = Plus-0.57
We treat all four artisan to be equally involved
Thus, Individual contribution: Plus-0.1425
After iteration 1:
Until recently, ERA was the most frequently used technique for evaluating the
performance of pitchers
Any of the base runners who score or would have scored if Pitchers team made no
fielding miscues causes the pitcher to be charged with an earned run
For e.g. If Joe gives up a triple with two outs in an innings and the next batter hits a
single and a run is scored, Joe is charged with an earned run
Now suppose instead of a single the next batter hits a ball to the shortstop, who
misplays the ball and is charged with an error. If the runner scores, this is an unearned
run
ERA= (earned runs allowed)*9/innings pitched
PROBLEMS WITH ERA
Errors are subjective: Official scorers are biased in favor of the home team
When a starting pitcher is pulled from the game and there is atleast one base
runner, the number of earned runs he gives up depends greatly on the
performance of the relief pitcher
A pitcher with good fielders behind him will clearly give up fewer earned runs
than a pitcher with a leaky defense
Starting pitchers are often evaluated on the basis of their win-loss record
which depends on the batting support the pitcher receives
EVALUATING RELIEF PITCHERS
Relief pitchers are often evaluated on the basis of how many saves they have given in a season. Most
saves credited to relief pitchers are given to a relief pitcher who faces a batter representing the tying
run. The following extract provides the official definition of a save.
The official scorer shall credit a pitcher with a save when such pitcher meets all four of the following
conditions:
1. He is the finishing pitcher in a game won by his team
2. He is not the winning pitcher
3. He is credited with at least a third of an inning pitched; and
4. He satisfies one of the following conditions:
a. He enters the game with a lead of no more than three runs and pitches at least one inning;
b. He enters the game, regardless of the count, with the potential tying run either on base, or AB or on deck; or
c. He pitches for at least three innings
USING PAST ERA TO PREDICT FUTURE ERA
For all the pitchers who pitched at least 100 innings during two consecutive
seasons
(next years ERA)= 2.8484 + .353*(last years ERA)
y = 0.353x + 2.8484
r = 0.34
R-square = 0.1158
VOROS MCCRACKEN STUNS THE BASEBALL
WORLD
McCracken observed that a pitchers effectiveness is primarily based on the following:
1. The fraction of BFP (Batters Faced by Pitchers) that results in balls in play (a ball in play is a plate
appearance that results in a ground out, error, single, double, triple, fly out or line out)
2. The fraction of balls in play that result in hits (i.e. BABIP or Batting Average on Balls in Play)
3. The outcome of BFP that do not result in balls in play. What fraction of BFP that do not yield a ball
in play result in strikeouts, walks HBP or HR?
. McCrackens brilliant insight was that a pitchers future performance with regard to the situations
outlined in (1) and (3) can be predicted fairly well from past performance because these results are
independent of the teams fielding ability (DIPS or Defense Independent Pitching Statistics), but it is
very difficult to predict (2)
. He concluded, The pitchers who are the best at preventing hits on balls in play one year are often
the worst at it the next.
DICE: A BETTER MODEL FOR PREDICTING
A PITCHERS FUTURE PERFORMANCE
DICE = 3.00 + (13HR + 3(BB + HBP) 2K)/IP
For e.g., in 1997 Roger Clemens had the following statistics:
68 BB
7 HBP
292 K
9 HR
264 IP
DICE = 2.08
Clemenss actual ERA in 1998 was 2.05
EVALUATING FIELDERS
Number of strike-outs
Left handed pitchers v/s Right handed batters (platoon effect). Right handed
batters are believed to hit more ground balls to shortstop than are left-
handed batters. Thus will have more hits near him resulting in more RF
THE FIELDING BIBLE: A GREAT LEAP
FORWARD
John Dewan determine the chance (based on all plays during a season) that a
ball hit at a particular speed to a zone would be successfully fielded
For e.g., they might find that 20% of the balls hit softly over second base are
successfully fielded by shortstops
A shortstop who successfully fields such a ball has prevented one hit
So this SS has prevented 1 - .2 = .8 hits more than average SS
If not then, SS has given 0 - .2 = -.2 hits more than average SS
CONVERTING FIELDERS SCORES TO RUNS
State Avera Number of Plate State Avera Number of Plate
ge Appearances for this ge Appearances for this
Runs Situation Runs Situation
0000 .54 46180 0100 .93 11644
1000 .29 32821 1100 .55 13483
2000 .11 26009 2100 .25 13588
0001 1.46 512 0101 1.86 1053
1001 .98 2069 1101 1.24 2283
2001 .38 3129 2101 .54 3117
0010 1.17 3590 0110 1.49 2786
1010 .71 6168 1110 .97 4978
2010 .34 7709 2110 .46 6545
0011 2.14 688 0111 2.27 805
1011 1.47 1770 1111 1.6 1926
2011 .63 1902 2111 .82 2380
CONVERTING FIELDERS SCORES TO RUNS
Suppose a SS fails to field a ball with 0 outs and bases empty. Before this ball was hit
the state was 0000 and an average team was expected to score 0.54 runs
If SS gives up a hit to the next batter the new state is 0100 and the average batting
team is expected to score 0.93 runs
If the SS turns a potential hit into an out, then the new state is 1000 and the average
batting team is expected to score only 0.29 runs
Thus, in this situation the SSs failure to prevent a hit cost his team 0.93-0.29=.64 runs
If we average the cost of allowing an out to become a hit over all possible states, we
find that a hit allowed costs a team around .8 runs
CONVERTING RUNS SAVED BY A FIELDER
INTO WINS
Pythagorean Theorem: runs scored/(runs scored + runs allowed)= estimate
percentage of games won
R = runs scored/runs allowed
R/(R+1)= estimate percentage of games won
An average team scores 775 runs and gives up 775 runs (2000-2006) during a season
If a fielder saves 10 runs, then our average team now outscores its opponent 775-765,
which yields a scoring ratio R=775/765 or 1.013
Using Pythagorean Theorem, 162*(1.013/(1.013+1))=82.05
Therefore, 10 runs translates into around 1 game won
WHY DO THE YANKEES UNDERPERFORM?
Team total -139
Position Player Fielding Bible rating
1B Giambi -8
2B Cano -27
SS Jeter -34
3B Rodrigues 2
LF Matsui 3
CF Williams -37
RF Sheffield -38
Putt- light golf stroke made on the putting green in an effort to place the ball into the
hole.
Par- predetermined number of strokes that a scratch (or 0 handicap) golfer should
require to complete a hole, a round (the sum of the pars of the played holes), or a
tournament (the sum of the pars of each round).
Birdie, Eagle and double eagle- scoring under par on any individual golf hole.
Bogey and Double Bogey- score of over par on an individual golf hole
Greens and fairways- golf course comprises a series of holes, each consisting of a
teeing ground, a fairway, the rough and other hazards, and a green with a flagstick
("pin") and hole ("cup").
PROFESSIONAL GOLF- PGA TOUR
Loss Aversion
Losses are valued more than the commensurate gains are valued
Value function curve kinked at reference point with steeper gradient for losses
than gains
Risk Shift
Risk seeking in losses and risk averse in gains
Utility function convex in loss domain and concave in gain domain
CONCEPTUAL FRAMEWORK
Objective- to describe the influence that loss aversion may have on putting
Equations:
1. Probability of making a putt
2. Utility function
Utility of a player is value that player places on making or missing a putt weighted by
their probabilities and subtracting the cost of the effort.
V is the value function
x is the score earned on making putt and x-1 on missing the putt
Cost(e) is cost of the effort which we again assuming to be strictly increasing and
concave
3. Value function
Predictions
1. Controlling for putt characteristics, z, putts attempted for par, bogey and double
bogey will be more accurate than putts attempted for birdie and eagle
2. Controlling for putt characteristics, z, the probability of making a birdie is greater
than the probability of making an eagle putt. In addition, controlling for putt
characteristics, z, the probability of making a par putt is greater than the
probability of making a bogey putt, and the probability of making a bogey putt is
greater than the probability of making a double bogey putt
3. Controlling the putt characteristics, z, players will be more risk averse when
putting for birdie and eagle than when putting for par, bogey or double bogey
DATA AND EMPIRICAL STRATEGY
Aim was to test the predictions made on the basis of equations with the analysis
of the collected data
To test why variation in shot values exists after controlling for distance and other
factors
250 workers employed at PGA tour since 2002 to gather data and info. each week
Data from 239 tournaments completed between 2004 and 2009 is taken for
analyses
Focus on putts and dataset restricted for putts attempted for eagle, birdie, par,
bogey or double bogey.
Data only for players who have at least 1000 putts was taken which leads to data
for 2,525,161 putts and 421 professional golfers
RESULTS: MAIN EFFECTS
Overconfidence or cockiness
Positive autocorrelation between shots
Cant account for the findings
Nervousness
Valuing birdie putt more than par putt
Cant be taken into account as professional golfers attempt as many
birdie putts as par putts
Cant explain many other findings
DIFFERENCES ACROSS ROUNDS
Difference in accuracy between birdie and par shots diminishes with progress
rounds
To remove bias, analysis of players playing all the 4 rounds are also taken
Discrepancy in accuracy between birdie and par shots is not automatic
Discrepancy can exist because of reference point theory
DIFFERENCES IN RISK AVERSION
Objectives of a putt: hit the ball into the hole or limit the difficulty of
follow on shot
Risk averse putts sacrifice the likelihood of hitting the ball into the
DIFFERENCES IN RISK AVERSION
Positive numbers represents birdie putt more likely to stop in the box than par and vice-versa
Panel B is for putts attempted from a distance longer than 270 yards
Figure shows that birdie putts are hit less hard than par putts for easy follow-on shots
Increased probability of making a follow-up shot doesnt compensate for loss of accuracy in hitting birdie putts
Birdie putts are missed more often because of both risk aversion and left-right mistakes
KOSZEGI-RABIN REFERENCE POINTS
H1: Putting performance improves when playing partner is playing for birdie
H2: Larger field decreases birdie putting performance
DATA ANALYSIS
RESULTS