THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, August 22, 2007

Career DIPS numbers

The following file shows the career BABIP of all pitchers and their teammates since 1916:


BABIP, pitcher and mates, career since 1916, min 500 PA (Google Docs).

minYear: first season
maxYear: last season
careerBIP: total BIP, calculated as BFP minus (HR+BB+HBP+SO)
BABIP: (H-HR)/BIP
mateBABIP: his teammates’ BABIP, weighted by his BIP for each season
gap: difference between BABIP and mateBABIP
OneSD: sqrt(mateBABIP*(1-mateBABIP)/careerBIP)
SD: gap / OneSD
trueGap:  gap * careerBIP/(careerBIP+3700)

The standard deviation of the SD for all the 2828 pitchers is 1.35.  We expected 1.00 if it was purely random (i.e., pitchers have no influence on outs per BIP).  This is hardly the case.  However, anything with a modicum of influence can achieve an SD = 1.35 if the “n” (BIP in this case) is large enough.  The halfway point (r=.50) is with BIP = 3700 or so. 

This means that if you have a pitcher with 3700 BIP, you regress his sample BABIP 50% towards his teammates’ BABIP.  If you have 1850 BIP, you regress two-thirds toward your teammates’ BABIP.  With 7400 BIP, you regress one-third toward your teammates’ BABIP.  For your typical pitcher with 500 BIP in a season, the regression amount is 88% (i.e., r=.12, rSquared=.01).  It’s on this basis that you will often hear that pitchers have little influence.  What’s really true is that one-season’s worth of stats is hardly indicative of his skill.

The regression equation is: 3700/(3700+BIP), and this is how I get the “trueGap”.

(13) Comments • 2010/08/08 • SabermetricsDataPitchers
Page 1 of 1 pages

<< Back to main