Tuesday, September 18, 2007
Unleveraging Win Probability
More about WPA/LI (linear weights by game state, with the leverage aspect deflated). Not for the faint of heart:
I’m having a conversation at Baseball Fever, and below is a copy of post 26:
If we stick with the simpler “walk or hit wins, out sends to extra innings” situation:
- assuming a league average OBP of .333 (my definition, including all events), the expected winning probability is easy enough to calculate (1/3 wins the game, and 2/3 sends to extra innings with a 50% chance to win) as .667.
- the LI is equally easy to calculate (.333 gain in wins happens 1/3 of the time, and .167 wins downward happens 2/3 of the time, for an average absolute movement in some direction of .2222, compared to the random of .0346, or an LI of .2222/.0346 = 6.4)
***
Now, say you have an Ichiro-type, where his OBP, overall, is .400. In this case, when Ichiro is at the plate, the Mariners will have a 70% chance of winning, if he does his Ichiro thing. (40% wins the game, and 60% sends it to extra innings where they have 50% chance of winning.) However, suppose in clutch situations, Ichiro has an OBP of .460. That means, the Mariners will win .730 with Ichiro as a clutch hitter, as opposed with .700 with Ichiro as his usual self. So, he adds +.030 wins with his clutch play, per PA (in this situation), over and above his own great self. This is the method you are describing.
Let’s look at the other way. His WPA/LI would be (.730-.667)/6.4= .012. His WPA is .730-.667= .063.
That leaves his WPA minus WPA/LI as +.051 wins.
Is one way better than the other?
***
Let’s consider another example. Let’s assume that Ichiro hits .400 regardless of situation, in a league that hits .333. In this case, your method would say that Ichiro has no clutch skill at all. The other method would do the following:
WPA/LI = (.700-.667)/6.4= .005. His WPA is .700-.667=.033. This leaves him with WPA minus WPA/LI of +.028 clutch wins.
But also note that in low leverage situations, he’ll end up with a minus. For example, if the LI was 0.2, and the then his WPA/LI for some blowout situation would be something like (.110-.109)/.2= .005, and his WPA would be a tiny change (.110-.109=.001), giving him a WPA minus WPA/LI of -.004 wins.
In this simplified example, if you had 87% of the games with an LI of .2, and 13% with an LI of 6.4 (overall LI of 1.0), then Ichiro’s clutch would be:
+.028 * 13%
-.004 * 87%
= .000 (rounding error)
As you can see, this method, and your method, both agree that Ichiro, overall, gets a clutch rating of 0.
And, WPA minus WPA/LI is superfast to calculate, and doesn’t require prior knowledge of the hitter’s overall stats and how that translates for each game state! Cool, right?
***
As for what does WPA/LI represent, it’s his win impact, with the game situation unleveraged. For example, a hitter with an OBP of .400, in a league of .333 is worth about +.055 runs per PA, or roughly +.005 wins per PA, in a random situation, compared to the average hitter. In the game-ending situation in question, his WPA was +.033 and the LI was 6.4, making his WPA/LI as +.005 wins. So, what WPA/LI does is two fold:
1. It rebalances the walks, hits, HR relative to the out, based on how each of those events will impact the win probability
2. It deflates the leverage aspect
This metric (WPA/LI) is especially ideal for players like Ichiro, who can rebalance their game to take advantage of the new balance required for a game state (sometimes avoiding a K is necessary, sometimes getting a walk isn’t all that important, etc).
I’m bringing this thread forward, with a focus on the last section (WPA/LI).
In a recent chat, David Gassko said this:
http://sonsofsamhorn.net/index.php?s=&showtopic=25771
Now, David is someone who is immersed in things like this. And, if he doesn’t get it, I must figure most people don’t.
WPA/LI has nothing--nothing at all--to do with WPA. WPA is concerned completely about the chances of winning, and assigning the difference to the player, even though he had nothing to do with being there to begin with. This is like you cash in your chips, and you end up being given 10x more than you deserve because there’s a new employee dishing out the change. Sure, you get to keep the money. But, you didn’t earn it. The only thing you deserve is your money, irrespective of things outside your control, but within the context in which you earned it.
If you are in a situation where a HR and a walk are equally valuable, whether that is in a blowout game, or a bottom of the 9th, tie game, it would be insane for me to give you .13 wins for the HR and .03 wins for a walk (which is what a pure LWTS approach would say).
Do you give a QB 12 yards for a completion, regardless if he makes a 5yd screen pass, or a 30yd long pass? No, of course not! The QB responds to the context presented to him, and you evaluate him under that context.
It’s the same thing here. If a HR and walk are equally valuable in a given situation, you give them both +.05 wins. If a player knows that a walk and HR are equally valuable, he’d be crazy not to change his batting approach. It’s guaranteed that the pitcher is changing his pitching approach, even more so as he’s proceeding through the count.
The only issue is if the batter can really change his approach appreciably enough. For example, given situations where a HR and walk are very very close (say a walk is worth at least 80% as valuable as a HR), and you have two groups of hitters (Raineses and Dawsons), do their walk and HR rates change drastically enough? Do the Dawsons continue to hit HR at the same rate, regardless of the impact of those HR? Do the Raineses recalibrate their game? And what about when a HR is worth at least, say, 8 times more than a walk. What kind of differences do we have?
Certainly, after several years of data, I’ll go with the game-state Linear Weights (i.e., WPA/LI). Certainly, after a couple of months of data, I’ll stick with context-neutral Linear Weights. But, there’s a point when one becomes better than the other.
WPA/LI should not be confused, at all, with WPA. The entire problem with WPA, the leveraged aspect of it, is undone with WPA/LI.
And when evaluating a forecast of “future wins”, you are comparing it to WPA/LI. That’s what a player contributes.
***
If someone is still confused about game-state Linear Weights, please, elaborate.