Friday, September 29, 2006
Why Does Runs Created Make Sense?
Boring and with math gyrations that I typically don’t like, here’s how I see it:
Here’s the RC equation that I like to use:
Runs = .87 * PA * OBP * SLG
The .87 * PA term is better than AB, because this addresses the weakness in the RC with respect to walks.
And, we all know that true run scoring is:
Runs = Baserunners * ScoreRate + HR
Presuming that 92% of the OBP rate is baserunners, we can rewrite this equation as:
Runs = (.92 * OBP * PA) * ScoreRate + HR
The scoreRate is about 40% of (SLG+OBP), more or less. Not an exact equation of course, but will do the job for our purposes. So:
Runs = (.92 * OBP * PA) * .40 * (SLG + OBP) + HR
Which gives us:
Runs = (.368* PA * OBP * (SLG + OBP)) + HR
Compare this to Runs Created:
Runs = .87 * PA * OBP * SLG
So, we see that the main portion, the PA * OBP * something on both sides. Runs Created says it should be SLG, while in reality the HR portion should be stripped somewhat from that term. The basic essence of RC makes sense, that you want OBP multiplied by SLG somewhat.
My illustration here makes most sense if the balance of OBP and SLG is what historically happens.
When Bill James was asked about OBP + Slug, he stated you should probably multiply them instead. I recently reread that, and looked to see how accurate it was, and whether anyone was using it.
I found it to be about as accurate as the other methods out there. I also looked at game logs from
Retrosheet, and saw that on a game by game basis did as well as the other estimators.
I was surprised to see very very little use of this formula.
OTS, or PA * OBP * SlugPct * .87 (this constant has declined over the years) is a wonderful stat.
It is (1) just about as accurate as most any other
run estimating formula
(2) Converts to a rate stat (OBP * Spct * .87) very easily, making it easy to do analysis, and is better to use than OPS, which so many people use
and (3) can be shown to an average fan and he can understand it.
To me, it seems like people have really tried to fine tune things, to get maybe .5% better on estimating runs - that is like 4-5 runs a year per team - remember this is an estimation, not an equation.
I’ve been thinking about writing about the accuracy and simplicity of this.