THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, December 30, 2009

Fangraphs park factors for pitchers

By Tangotiger, 11:02 AM

My answer must be: Yes, you are totally right, in response to this reader mail:


I had a thought about the FIP-based pitcher ratings on Fangraphs, and was interested to see if you agreed:

Fangraphs bases its pitching win values around FIP, rather than around runs allowed.  This is an entirely defensible decision in the effort to remove fielding effects from the value of the pitcher, even if it will miss any and all non-FIP value accrued by the pitcher.

Here’s the problem:

http://www.fangraphs.com/blogs/index.php/pitcher-win-values-explained-part-six/

If I’m reading that correctly, the park adjustments they use are based on actual runs allowed.  But part of what makes a park factor is how the park affects BABIP.  You can’t give every pitcher the same theoretical BABIP and then use a park adjustment that will vary based on how the park influences balls in play, or you’ll be adjusting for the same bias twice.

Example: Rockies home/road BABIP splits over the last four years.
Hitters: .323/.286, .316/.300, .333/.311, .331/.289.  Average inflation of 29 points.
Pitchers: .311/.297, .314/.312, .300/.287, .311/.304.  Average inflation of 9 points.

(I used both hitters and pitchers to avoid being biased in either direction by home-field advantage.)

So Coors seems to inflate BABIP by roughly 19 points, which is presumably a large factor in why it’s a strong hitter’s park.  When evaluating the Rockies pitching staff, Fangraphs uses a system that ignores hits on balls in play entirely, and then gives the pitchers a bonus for toiling in what is a strong hitter’s park largely because it inflates hits on balls in play.

Two questions here, I suppose.
1. Am I right about this?
2. How big is the effect, in terms of, say, a 200-inning starter?

Basically, the FIP adjustment, that “3.2” should be calibrated not by league, but by park, if you are going to then use standard run-based park factors.  Otherwise, you need to use a FIP-based run adjustment.

Cool insight…

#1    dkappelman      (see all posts) 2009/12/30 (Wed) @ 13:57

I can put this somewhere not at the bottom of my to-do list.

For the extreme parks, the difference may be non-trivial (i’m just guessing at +/- 5 runs?), but I would think for most parks things would remain pretty similar.


#2    Colin Wyers      (see all posts) 2009/12/30 (Wed) @ 14:02

I’d say that’s fair to say. It may be that the extreme parks aren’t the intuitive ones, though - in other words, a strong home run park may not be affected at all, if the BABIP factor is neutral.


#3          (see all posts) 2009/12/31 (Thu) @ 02:30

Tom,

Thanks for the response.

Here’s what I have for park effects on BABIP over the last four years, using the same approach as I did for Coors above:
BOS +21
COL +19
KC +15
ARI +13
CHC +9
TEX +8
SF +8
LAA +7
FLA +7
BAL +5
CLE +5
PIT +4
PHI +4
HOU +3
ATL 0
DET 0
LAD 0
STL -2
MIN -4
SEA -4
CIN -5
TB -6
MIL -9
CHW -11
TOR -11
OAK -18
SD -27

Teams with time in multiple parks over the last four years:
NYM: Citi -10, Shea -17
NYY: NYS -11, OYS -6
WAS: NatPark -9, RFK -8

There’s quite a bit of year-to-year variance in those figures, so multi-year data should probably be used wherever possible.

I’m not sure how the BABIP changes convert into runs; anyone have an idea on that?


#4    Tangotiger      (see all posts) 2009/12/31 (Thu) @ 11:46

Sure, the difference times 0.8 runs or so.

So, if BABIP adds 29 points, and there are say 4200 BIP, then that’s .029 x 4200 = 121 more balls in play.  That adds 97 runs to the ledger.

As long as the HR, BB, SO effect is proportional to this, then we don’t have an issue.  If you want to make your point, look for a park that has excess hits on BIP, but is FIP-friendly for pitchers (or vice versa).

From your list above, maybe the Reds home park is one candidate?


#5          (see all posts) 2009/12/31 (Thu) @ 12:21

Using Pac Bell park, which I listed above as having a BABIP increase of 8 points:

Based on the last 4 years, the park reduces strikeouts by 2% per PA, reduces walks by 3% per PA, and reduces HR by 16% per PA.  Taking the 2009 NL averages of each of those things, I get an expected FIP reduction of 5% in Giants home games.

US Cellular goes the other way: strikeouts inflated 1%, walks inflated 6%, HR inflated 26%; using the ‘09 AL rates, that’s a 10% increase in FIP, in a park that suppressed BABIP by 11 points over the last 4 seasons.


#6    Tangotiger      (see all posts) 2009/12/31 (Thu) @ 13:22

Great job.  Ok, when I plug those numbers in some standard pitching line (4.62 runs per 9 IP), I get the following for Pac Bell: runs allowed is 4.55, but FIP is 4.38.  For US Cellular, runs allowed is 4.82, but FIP is 5.06.

Remembering that half the games are affected, we’re talking about a 0.10-0.15 runs per 9 IP difference at the extreme, or about 2 or 3 runs for a full-time starter.

It’s close to the point that it should be on a to-do list, but on a low priority.

Best thing for someone to do is to come up with the appropriate methodology for David to implement.  I think it works easier for him if the rest of us to all the heavy lifting, and he plugs in with minimal effort.

If someone wants to take a stab at it, I can vet it, and then David can put it on his todo.

Just one question right now: David, what “run park factors” are you using? Patriot’s?


#7    dkappelman      (see all posts) 2009/12/31 (Thu) @ 16:11

Yep, I’m using exact same methodology as Patriot’s 5 year regressed.


#8          (see all posts) 2010/01/03 (Sun) @ 22:54

I can try to lay the groundwork this week.  The method I have in mind would be a separate set of park factors based on FIP components.  Once I’ve got the raw data collected, I’ll come back here and we can work through the technical issues in turning BB/K/HR factors into an overall FIP factor.

One question before I get started - I assume regressed 5-year data would be the eventual goal for this as well?  Given that, and the fact that WAR is currently calculated from 2002 on, I’d plan to focus on 2000-2009, with the possibility of adding more years if they’re needed in the future.  Sound good?


#9    Sky      (see all posts) 2011/01/23 (Sun) @ 12:22

I wanted BABIP park factors and this thread was the closest I could find.  So I took the runs values from #3 and converted into expected BABIP assuming .300 league average and 4200 BIP.  I’m sharing in case anyone else is interested:

BOS .306
COL .306
KC .304
ARI .304
CHC .303
TEX .302
SF .302
LAA .302
FLA .302
BAL .301
CLE .301
PIT .301
PHI .301
HOU .301
ATL .300
DET .300
LAD .300
STL .299
MIN .299
SEA .299
CIN .299
TB .298
MIL .297
CHW .297
TOR .297
OAK .295
SD .292


#10    Sky      (see all posts) 2011/01/23 (Sun) @ 12:25

Wow, I totally misunderstood the table in #3—it’s actually point differences of BABIP, not runs.  So ignore my previous post, #9.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

Feb 09 19:10
Who’s evaluating the 2011 forecasts this year?

Feb 09 18:35
MGL: Today on Clubhouse Confidential

Feb 09 17:36
New PECOTA

Feb 09 16:38
The will of the people?

Feb 09 16:25
Correlation of pitcher metrics: FIP strikes again

Feb 09 11:56
Forecaster’s Challenge: 2012?

Feb 09 11:45
When is a life entity considered a person?

Feb 09 10:08
Change in fastball velocity by going from starter to reliever

Feb 08 22:41
Batman, the webslinger?

Feb 08 22:24
When to purposefully lose the lead