Friday, June 08, 2007
More about WPA
There’s an on-going Q&A thread over on a Cardinals forum, which I will cut/paste my comments here, but they make more sense in context there:
The first thing to remember is why does WPA exist. It exists to give you a snapshot of the game. That is, at this point in time,
- what are the chances of winning
- after this at bat, what are the chances of winning
You capture the difference, and associate it to the batter/runner/pitcher.
Now, the next temptation is to aggregate this over a number of games, or season. But, that’s not really why it exists. That is a byproduct.
Even so, let’s say we do want to aggregate it, does it tell us anything. If you look at this thread:
http://www.fangraphs.com/blogs/index.php/is-wpa-predictive-for-batters /
You will see that WPA is NOT as predictive as OBP, SLG, or OPS. But, it’s still fairly strong.
However, if you look at this post:
http://www.fangraphs.com/blogs/index.php/is-wpa-predictive-for-batters /#comment-10954
You will see that WPA divided by LI is as predictive as all the major ones.
***
In the long run, say several years, WPA will converge to the major numbers (OPS, RC, LWTS). So, it definitely means something.
WPA applies as well to pitchers as hitters.
As for “overrating” relievers, it does no such thing, any more than an average reliever having a better ERA than an average starter overrates relievers.
It simply is what it is. What you need to do is interpret it on that basis, that relievers need a different baseline, not only for WPA, but for ERA, K per IP, etc.
***
Let me ask you the question:
1. If Albert Pujols gets a few walk offs, where the Cards were down to their last out, and he pulled one out of his back pocket, turning a sure loss into a real win, is that impressive?
2. If Mariano Rivera comes in with the bases loaded, 0 outs, up by 1 in the 9th inning, and strikes out the side, is that impressive?
What WPA does is measure that. It takes the pulse of the game before the batter comes to bat, and then after, and says “here you go, this is what happened! This is why Scutaro gets a huge WPA gain and Mariano must get the exact opposite WPA loss”. If you look at it from both perspectives (hitter, pitcher) at the same time, you are forced to conclude that not all runs are created equal, without the benefit of hindsight. That the evaluation of the game, of how you feel, exists in real-time, and not in a “if I had known”. If you knew Mariano was going to give up that HR to Scutaro, why would you even bother going to the game?
In fact, if Vlad hits a grand slam against the Redsox in the playoffs to send the game into extra innings, only for the Angels to have subsequently lost the game, he may have as well struck out, since it’s the exact same results.
***
The base comparison is the sum of each WPA/LI (that is WPA divided by LI on a PA-by-PA basis). So, the question is: what does the sum(WPA) give you over the sum(WPA/LI)?
Let’s look at the best hitter in the league:
http://www.fangraphs.com/statss.aspx?playerid=1177&position=1B
In 2006, his WPA/LI is +6.1 wins. In 2005, his WPA/LI is also +6.1 wins. This measure is purely a reflection of his hitting skill with respect to the game situation, but without the extra leverage. So, a HR in the 9th inning or 1st will still be worth around +.12 wins. (It changes a little, but not enough to concern us at the moment.)
Similarly, you can use OPSwins as:
wins = PA * .025 * (1.7*OBP+SLG-1)
to get to the same results.
In 2006, Pujols would be:
634 * .025 * (1.7*.431+.671-1) = +6.4 wins
And in 2005:
700 * .025 * (1.7*.430+.609-1) = +6.0 wins
So, that’s what he is, a +6.0 to +6.4 wins hitter.
In 2006 however, he put on one of the best clutch hitting performances you will ever see. His WPA in 2006 was +9.6 wins, giving him a clutch impact (though not clutch skill) of +3.5 wins. It is a fantastically high number.
If you sort by LI:
http://www.fangraphs.com/statsp.aspx?playerid=1177&position=1B&season= 2006
You will see that in high-leverage situations, he was unbelievable. In the 10 most important at bats last year (other than the IBB), he made an out only TWO times! That is clutch.
However, in 2005, in the 10 most important at bats, he made an out EIGHT times. That’s not so clutch.
So, that’s what WPA captures. Pujols was lucky in 2006… not that he was lucky to hit HR, but lucky that he TIMED the HR when he did. What you have here is a great hitter, who was great in 2005 and great in 2006, and yet, somehow, was better able to time that greatness in 2006 when the Cards needed him the most.
While clutch hitting does exist, the amount of PA you need to even determine that someone is a clutch hitter is huge, on the order of several thousand (a career’s worth). At that point, who cares, other than for retrospective thought.
You certainly wouldn’t base your decision on whether to bring in a certain player based on his “clutch stats”. You *could* bring someone in if you believe in something beyond the numbers. But, the numbers aren’t going to help you out.
***
Yes, it makes Vlad a star, in real-time.
But, in hindsight, he may have as well struck out. That’s why, you need to decide:
1. do I care about what happens in real-time
2. do I care about what happened, in hindsight
3. do I care about what the past tells me about the future
Choose one, not all.
WPA *can* be used for all three of those things, but it is *not* the best one for each of those three things. WPA is perfect for real-time.
God, I hate WPA. On top of everything, now it confuses people.
It tells you the impact of the result of a PA on the WE of one team or the other. Nothing more and nothing less. That corresponds, more or less, to the “excitement” of the PA. By definition, a PA with a low WPA was, “ho-hum,” no big deal, like a 2 out walk with the bases empty in a 5 run game in the 9th inning. OTOH, a PA with a high WPA was mostly likely a “wow” event, or that was a great AB (for the pticher or batter) like a 2 out HR in a 1 run game (batting team losing), or even a no out walk in a one run game in the 9th.
If that floats your boat, that’s fine with me. What can it be used for? I suppose for pats on the back or for some kind of MVP type award or some fairly interesting discussion over a beer or two (whose performance really impacted their team or had the potential to impact their team?).
The problem (and I thinkit is a big one, but that is a qualitative argument, not a quantitative one) with WPA as a “wow,” MVP, or “over a beer” metric is that it does not distinguish between whether the team won or lost or even whether a run scored or not.
If a player got a lot of those no out walks with his team down a run in the 9th, but a run never scored and his team never won, NO ONE is going to remember those walks or that player, and you would be laughed out of the bar if you tried to tell everyone how great and clutch this player was. Some really sophisticated and deep thinking bar patrons might say that this player did indeed do a great job when his team needed him the most, but that his teammates let him down.
But let’s face it, in the real world people only get props for helping to cause success (wins, and important ones at that) and NOT POTENTIAL SUCCESS (WE). Only nerdy stat-heads care about that.
In any case, of what other use is WPA? None that I can think of. While it will converge with context-neutral linear weights or ERC or ERA (or any metric without a bias) in the long run, it is NOT at all useful in making real-life decisions or predicting any future events, beyond what a context-neutral metric provides.
In fact, it is ALWAYS worse, in less than an infinite number of historical PA!
I take that back. WPA probably captures what little clutch skill we think exists among MLB players, so that it will NOT quite converge with a context-neutral measure of performance. In an infinite number of PA if any or every player had some unique clutch skill, no matter how small or large, it will be captured in the difference between their linear weights and what their linear weights would suggest in WPA (given their teammates, opponents, etc.).
For example, if player A and B both had the same lwts projection, then you go with the one who has the higher WPA just in case he has a slightly higher clutch skill, which we presume he does (no matter how small).
But if player A has a +5 lwts projection and player B has a +10, you go with player B pretty much no matter what their difference in WPA is for the current year or for their careers.
I suppose that there is a point at which if their context-neutral projections are close enough and their WPA difference is large enough and their PA history is large enough that you might go with the player who has the smaller projection but the larger WPA, but those situations are very rare.
Almost everything we do as sabermetricians helps us understand the game better from the standpoint of which players and teams have the better true talent and what strategies help teams to win.
This ain’t one of them. It is kind of like an interesting (to some people) yet useless sidebar in sabermetrics.