Monday, March 01, 2010
Chone v2.0 - adding objective scouting data
Niiiiiiice. This is exactly the kind of place where you need to look to improve forecasts. What John Mayne has done here is to show that the observed past performance means more if you are a hard-tosser than a soft-tosser.
This can be an issue of selection bias: if you are a soft-tosser with a bad performance, you don’t get much chances. If you are a hard-tosser with a bad performance, you keep getting chances. So, when you look at the results, all the bad performances you have left above a minimum IP threshhold will likely be filled with hard-tossers.
So, what I would prefer is to look only at pitchers with an ERA under 4.50 in year X. This at least controls for quality in some respect.
Otherwise, fantastic work.


Tango - If I understand your critcism correctly, I don’t think selection bias is a problem in Mayne’s study. First, he doesn’t seem to be using an IP threshhold, including examples with 0 innings and 12 innings in his tables. Second, the hard tossers tend to over perform their projections, not under perform, so if anything this methodology is understating the trend. Third, Mayne seems to be using the charts just to illustrate a possible tendency and not for hard core analysis to show the methodology used to actually make an improved projection. From his comments at the end, it seems he has used analysis to create better projections, but prefers to keep the actual methodology proprietary. Fourth, it appears from looking at his charts carefully and his response to MGL’s comment at THT that he has error bars that vary by IP to set the boundries of whether a performance is actually even with, or better or worse than a performance. Fifth, Sean’s follow up analysis in the comments at THT confirm that analysis along more conventional lines supports Mayne’s conclusions.
Overall a very nice and important study.