Monday, February 08, 2010
SIERA
Matt and Eric introduce their metric. A correction:
Nate Silver invented QERA back in 2006 for Baseball Prospectus to adjust for a few issues with FIP and xFIP, and while he referred to the stat as a toy, it represented a big step upward in the methodology of estimators.... QERA has another problem of its own, in that GB% is really GB/Ball in Play (or, GB/BIP), while BB% and K% are measured per batters faced (SO/PA and BB/PA) ...Further, while QERA picks up some of the interaction between walk, strikeout, and ground-ball rates, it does not necessarily weight them correctly
Good for Matt and Eric for noting the two problems with QERA. I dispute the big/upward claim however. Also, while FIP has a glossary page, “tRA” is unlinked. A BPro reader will have no idea what it is.
Anyway, here it is:
SIERA = 6.262 – 18.055*(SO/PA) + 11.292*(BB/PA) – 1.721*((GB-FB-PU)/PA) +10.169*((SO/PA)^2) – 7.069*(((GB-FB-PU)/PA)^2) + 9.561*(SO/PA)*((GB-FB-PU)/PA) – 4.027*(BB/PA)*((GB-FB-PU)/PA)
I’ll need a few hours to test this to see why it works, when it breaks down, and how much of a gain we’re getting over FIP, xFIP and tRA.
Ok, I have a problem with the GB term. I just did a little test where I started with my baseline, and then I turned 1 GB into 1 FB (based on 1000 PA). My runs total went up by +.10 runs, pretty much what I expected.
Good so far.
If I turned 10 GB into 10 FB, I ended up with +.90 runs. Good so far.
I turned 20 GB into 20 FB and I got +1.45 runs. Not enough, but no panic yet.
I turned 30 GB into GB and I got +1.73 runs, which is not keeping pace.
I turned 40 GB into FB and I got +1.70 runs… a downturn, which is really wrong. Because those FB are becoming HR and it should always go up.
Indeed, I can get back to my baseline by turning 68 GB into 68 FB.
And if let’s say I turn 200 GB into 200 FB, I end up getting 40 less runs.
Why does this happen? It’s because of that exponent.
Whether this terms if negative or positive, squaring it will turn it the same sign:
(GB-FB-PU)/PA)
And so, what the equation says is that the more extreme you get, the more you are going to go in a negative direction.
Indeed, the idea to multiply something by using the differentials is not a good idea. The term I just quoted is a relative term, and when you use a relative term, you better be darn careful when you start multiplying with it, as it happens here.
Sorry guys, but you’re going to have problems here in some instances.