Thursday, May 28, 2009
Simple Zone Rating
Colin gives us his fielding system. One thing I’d like to see is to have this data (plus the OF and C ones) normalized so that the plays made adds up to the total number of outs made:
1B:
.85*A + .08*PO
2B:
.85*A
SS:
.85*A
3B:
.90*A+.06*PO
That is, perhaps some team should have .86 or .83, etc.


Colin-
Using 2005-2008 data I get very different infielder assist put out to plays made conversion rates than you.
1B = .91A + .09PO
2B = .83A + .01PO
SS = .87A + .02PO
3B = .98A + .05PO
Perhaps I have made an error, but you may want to check your math just to make sure of your numbers.
Groundball rates for hitters and pitchers were regressed to the mean before use. I used the weighted average method from Tom Tango, figuring R from the method involving random and observed variance.
I am not sure why you would do this, as regression is usually only used to project what a player will do in the future. When dealing with past events the actual rates should be used. And Odds Ratio is a better method than Log 5, which can have problems with rates that differ markedly from average.
I know you want to keep it simple, but I would prefer to see 3 of the hit allocation charts; 1 for singles, 1 for doubles, 1 for triples and inside park homers. No reason to give partial hit values for triples and home runs to catchers, pitchers, 2Bs, and SSs as these almost never occur. And the allocation of doubles and singles would differ a lot as well.
Since out of the park HRs are not usually separated from inside the park HRs, I assume you must have made a decision somewhere in the transition from dead ball era to modern era to have all HRs either one or the other. Where did you make the dividing line?