Thursday, February 01, 2007
Lake Wobegon
http://www.baseballprospectus.com/unfiltered/?p=171
Read that first. Here’s what Marcel says:
For hitters, total forecasted PA, H, HR, R:
281414, 68117, 8139, 35698
Based on 862 non-pitchers, with each having at least 200 PA. Obviously, the overall forecasts are too high.
Here are the pitchers IP, H, HR, R:
59035, 61588, 7355, 32294
That’s based on 915 pitchers, with each having at least 25 IP.
Let’s baseline them to the 2006 totals of 188,052 PA and 43,258 IP. That gives us these adjusted lines for hitters of H. HR, R:
45518, 5439, 23854
And for pitchers:
45129, 5389, 23663
The RS and RA would give you a pythag win total of 81.6 wins.
A slight on the optimistic side. But just barely.
***
What if I only select the players with the most PA, which totals 188,052? All batters forecasted with at least 269 PA (425 of them) put up this line of PA, H, HR, R:
188233, 46138, 5604, 24216
Repeating for pitchers (44 pitchers, at least 54 forecasted IP), IP, H, HR, R:
43259, 44953, 5348, 23381
This time, not as good. The pythag win is 83.7 wins.
***
My problem is that I always regress toward the “league mean”. However, the “league mean” is not the average player, since the better players have a higher PA than the worse players. We are, in effect, overweighting the good players. What we should do, in regression, is regress a player’s performance to the average performance for that PA class. This way, a rookie will get weighted not to the .340 OBP league mean, but say the .320 OBP mean for that PA class.
Fun stuff.
I’ve thought about this, and wondered if projection systems would benefit from the normalization of aggregate W/L records, or total PA/IP based on previous years. Was planning to track all the major systems’ ‘07 success anyway, and maybe I’ll do their normalized versions too. Also, even when PAs or IP are baselined, their individual distributions may still not reflect reality well (Marcel has this problem at the lower end at least, but problem isn’t the right word.) Would it be more accurate still to baseline, and then fit the dstribution to something more normal? The “area under the curve” would be the same, of course.