THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Thursday, July 10, 2008

WPA is… WPA is not…

By Tangotiger, 04:23 PM

What WPA represents is the quantification of your feelings as the game unfolds.  Imagine if Youk hit into a triple play his first 2 AB, with the score tied 0-0 the whole time, then the Sox lead 15-0 (and he gets two outs), then he hits two HR.  How is it that you felt with Youk, if you tracked it in real time?

Well, his first two AB, you are cursing his name like there’s no tomorrow, then when the team batted around (twice), your blood pressure starts to go down, and then, with the score at 15-0, you’re probably not even watching the game any more.

That is what WPA captures.... the quantification of your feelings as the game unfolds, assigned to the players involved.

WPA is not a way to evaluate the talent of a player.  WPA is exactly the same as counting a PA 11 times when the bases loaded down by 1 with 2 outs in the bottom of the 9th and counting a PA as almost zero in a blowout game.  It is basically ridiculous to think that one PA can inform you on the talent level of a player 1000 times more in one situation than another.

WPA/LI however might be a way to evaluate the talent of a player, since now each PA is exactly worth 1 PA.  The only thing we are doing is realizing that baseball might not be a random game, and that a player might tailor his approach based on the base/out inning/score.  We don’t know how much he does tailor his approach.  That needs to be studied.


#1    Melvin Nieves      (see all posts) 2008/07/10 (Thu) @ 17:20

Would it be fair to say that WPA is a quantification of what people mean when they say Youk is or isn’t “clutch”?

Though now that I think about it, perhaps what people mean when they say “clutch” can’t be decided by anyone but the person doing the speaking.


#2    MGL      (see all posts) 2008/07/10 (Thu) @ 21:10

I would say that that is a fair statement, as WPA should track what people consider clutch.


#3    tangotiger      (see all posts) 2008/07/10 (Thu) @ 22:17

Melvin: right, Clutch is like pornography… you can’t define it, but you know it when you see it.

***

One way to define Clutch is the way MGL defined a hot player: purely by the performance.  No one talks about Juan Pierre being a “clutch” player if he bumps his overall play from 20% below league average to league average in clutch situations.  Pujols is clutch because he is good.

Another way to define it is by comparing a player to himself in the same time period.  This way, you won’t have Pujols being called a clutch player if he has a .400 OBP with a .600 SLG in the clutch, if he also does that performance in a blowout.  He’s not clutch per se, just really really good, regardless of situation.

Another way to define it is by comparing a player’s performance level in high leverage situations to his true talent level.  This way, you don’t have to be restricted to looking at his non-clutch performance in the same time period.  A guy can be a true .350 OBP hitter, who hits .400 in the clutch.  That he happens to also hit .380 or .420 or .280 in the same time period in non-clutch doesn’t take away from what he did, compared to your prior expectation as to how he should have hit. Now though you get the chance that he not only turned in a clutch performance, but he turned in an overall better performance.

Your prior expectation however is constantly being revisited (after every PA actually).  So, he may have been a .350 at the start of the season, but by now, he may be treated at a .370 hitter, based on his performance-to-date.

This last one is the one I prefer. MGL I think prefers the second one.

MGL and I have long disagreements on this issue, and you can find it by searching for the word Clutch in this blog.


#4    Jeremy      (see all posts) 2008/07/10 (Thu) @ 22:48

Fangraphs has a statistic called clutch, which measures a player against his own performance through the formula WPA / pLI - WPA/LI.


#5    tangotiger      (see all posts) 2008/07/11 (Fri) @ 07:14

Right, that was actually introduced here and is definition #2.

You take the sum of his WPA and divided by the average of his LI for term #1.

You take each individual WPA and divided by the LI for that play… then add all that up for every PA.  That’s term #2.

The difference is his clutch score.  Ortiz was out of this world for those 2 years.  Pujols was unbelievable two years ago.  ARod s-cks -ggs.


#6    cam      (see all posts) 2008/07/11 (Fri) @ 12:10

In close, late game situations, WPA is not just measuring feelings, but providing a quantifiable value on how important those situations are. If managers had not felt those situations were different, the modern day reliever would not have been developed. With WPA, an objective number can be given to all those situations, allowing a manager to understand how important they are, instead of just relying on his feelings.


#7    Tangotiger      (see all posts) 2008/07/11 (Fri) @ 12:24

Cam, actually what you are referring to is LI (leverage index).

LI is the change in Win Expectancy (WE), expected at a given time, compared to some random situation.

WPA however has decided to take the change in WE and ASSIGN it to the player(*).  It is in that assignment that those who decry WPA speak up.  You can make a strong case that the assignment of the WPA should go mostly to some “timing/luck” bucket, and just a portion to the player, since the player doesn’t control where he is.  He just happens to be there when the lottery ticket was handed out. 

It’s a question of whether a monkey winning a million$ lottery should mean anything.  It really doesn’t, but then again, he’s the one who is actually living a different life, partying it up, while we are here writing about it.

Anyway, your basic point stands, but I just wanted to get the nuanced point out there.

(*)Hidden behind all the WPA, it can be approximated, more or less, as sum(WPA) = sum(LI * staticLWTS), for each play.  In the case of relievers, his talent level will allow him to have a higher LI than an average pitcher.  But still, do we necessarily want to count one of his PAs 5x, and another 0.7x, when talking about his talent level?  No, of course not.  But, he is out there collecting the lottery ticket.


#8    cam      (see all posts) 2008/07/11 (Fri) @ 14:28

To clarify a little, you could evaluate the situation/importance/value using either LI or WE(pitcher enters game) - WE(pitcher leaves game). If a pitcher throws a shutout in the 9th, he will accumulate a lot more WPA coming in with a 1 run lead, than a 6 run lead, obviously.

Before WE and similar concepts, there was no way to numerically measure the effectiveness of relievers, other than with saves then holds. Which then led to ideas such as tough saves and easy saves and later to your concept of LI. Now the value of relievers to their teams can be looked at by comparings their WPAs.

But my point was more about how the different game situations are not just subjective feeling states, but objective value states, which WE seems to do a good job of quantifying.


#9    studes      (see all posts) 2008/07/11 (Fri) @ 18:09

What WPA represents is the quantification of your feelings as the game unfolds.

I believe I originally penned that one.  Sam Hutcheson also had a nice way of putting it over at BBTF.  WPA tells the story of how a game unfolds, numerically.  When you pull out a player’s individual WPA, you’re quantifying his role in the story of the game.

I like to call WPA the “story stat” cause it tells the story of the game and each player’s role in the game.


#10    tangotiger      (see all posts) 2008/07/11 (Fri) @ 22:19

Agreed that studes originally coined the thought.


#11    Tangotiger      (see all posts) 2008/07/24 (Thu) @ 10:50

I made a post at #18 here that simply summarizes this thread a bit:
http://www.baseballthinkfactory.org/files/newsstand/discussion/fan_graphs_cameron_rangers_mvp/


#12    Tangotiger      (see all posts) 2008/07/24 (Thu) @ 16:21

http://www.baseballthinkfactory.org/files/newsstand/discussion/tht_studeman_do_veteran_players_know_how_to_win/

More words here, which I will reprint:

Mike/13: thanks for the kind words. 

For those who haven’t checked, Fangraphs shows LI numbers for each play, game, and seasonal totals for all players going back a few decades (so far).  It’s really cool to see how the leverage numbers of pitchers change.  As an example, and focus on the column “pLI” near the bottom of the page: Keith Foulke

That 2002 season, where he was pulled from the bullpen (and you see it from his LI plumetting so much that season), and his clutch score being so negative.  But, his WPA/LI, which gives a better view as to how he was actually pitching, really shows him to be the same year after year.

That’s what WPA and LI deliver for you, something that the traditional lines don’t: the LI shows that he wasn’t used in as high leverage.  His WPA shows that he had much less impact than in his surrounding years, and yet his WPA/LI shows that he was really the same pitcher, and his clutch score shows that he pitched bad in some crucial situations, which probably led to his demotion.

***

Chris/16: “I think LI and WPA are flawed ways “

LI has nothing to do with your concern.  Your WPA concern is legitimate, but your LI one is not. 

***

Bob/21: agreed.

***

studes/25: agreed, it is harder to explain WPA/LI compared to WPA.

***

The issue of baserunning and fielding in WPA is one of implementation, not the framework.  The more granular you have the data, and the more programming you want to do, the more you are going to represent it to what it should.  Obviously, the hitter should not get penalized for the runner being thrown out.  That’s not a WPA problem, but either a data collection or programming problem.

***

As for the HR in the 1st or 9th: the assumption of win expectancy, WE, (as in life) is that everything in the future is unknown, and therefore, your expectancy is that average things will happen.  So, hitting a HR in the 3rd increases your win expectancy by, I dunno, +.130 (say from .500 to .630), because you have a certain expectancy of what will happen in the following 6 innings.  So, the WE represents how much you are willing to bet on the game’s outcome at that point in time, and the change in WE (i.e., WPA) simply pays off on that bet.

In the 9th inning, you are “all in”.  You may have say won most of the poker hands before that, but if someone goes all in, and he wins, you lose.

That’s all WPA is… an accounting of money.  In real-life terms, as studes put it, it simply tracks your emotions, and quantifies it.

So, I can see why WPA gets a bad rap… either because it is misunderstood or misapplied.  OBP doesn’t get a bad rap, even though it weights a walk and HR as exactly 1.  Its limitations are self-evident, and therefore, no explanation needed.  With WPA, we have to work harder at explaining what it is and what it isn’t.

Belittling WPA, as a stat, makes no sense, because it represents what it purports to represent.  Nothing more.


#13    Aaron      (see all posts) 2008/07/24 (Thu) @ 17:48

I’m not sure if I’ve got a handle on what WPA/LI is. Is the only difference between it and WPA that WPA factors in the importance of the inning while WPA/LI treats all innings the same?


#14    tangotiger      (see all posts) 2008/07/24 (Thu) @ 18:16

In most cases, the average safe play is worth twice that of the average out play.  So, if the average out play is worth -.027 wins, the average safe play is worth +.054 wins.  If some other situation has the average out worth -.0405 wins, the average safe play is worth +.0810 wins.

With me so far?

What WPA/LI does is simply index it so that the “range” remains constant.  That is, the -.0405 and +.0810 are prorated down, so that the -.027 to +.054 range remains intact.

In this case, we divide the higher leverage plays by 50%.  If you got a +.0810 play when the LI is 1.5, you divide that +.0810 by 1.5 to get you +.054.

In effect, we are saying that you “earned” +.054 wins, and the other +.027 wins was “timing”, of which you had no control.

In baseball, the range is even wider, because the HR is worth alot more than a walk.  So while the average safe play is worth +.054 wins, the walk is +.03 and the HR is +.13.

Again, whatever it is you did, you divide by LI, so that you get the “deleveraged” numbers: each PA counts the same.

However, let me try to give an extreme example: bases loaded, tie game, bottom 9th two outs, aka walk is as good as a HR.

So, in this particular case, the relative value of each safe play is worth the same.  So, whether it was a walk or a HR, it gives you the same “earnings”.  This play could say give you +.54 wins, and you’d get that whether it was a walk or HR.  But, if this is a LI=10 play, then you divide everything by 10, and this play now gives you +.054 wins.  The rest is treated as “timing”.

So, WPA/LI tries to center things, such that the range is kept “normal”, while also using baseball knowledge, such that in some cases a walk is a HR, and they each earn you the same wins.


#15    studes      (see all posts) 2008/07/24 (Thu) @ 18:29

Tango, I responded to your second post at that thread.  A “Neutral WPA”, in which each game’s WPA bucket is spread among players after they’ve been credited with WPA/LI, seems like an interesting concept to me.


#16    tangotiger      (see all posts) 2008/07/24 (Thu) @ 21:42

I replied back:

In some respect, I’d prefer to leave the timing bucket alone, because it is what it is. At the end of the season, you’d see how much of the whole thing was dependent on the timing.

On the other hand, WPA does serve a purpose, if you look at it from the pitcher. A pitcher is the one who puts himself in the position he finds himself. So, any bad timing, he’d have to answer to.

As for relievers, Mo sort of puts himself in LI=2 situations because he is Mo the Great.

You could in one sense carry two timing buckets, one for pitchers and one for nonpitchers. For the nonpitchers, they share the timing bucket, and for pitchers, they own theirs.

You can’t have a reliever with a 1.00 ERA who only pitches in mopup situations “earn” alot of money. He’s basically Steve Jobs in 1978… he’s not yet STEVE JOBS, even if he worked as hard in 1978 as he did in 1984. The money just wasn’t there. Or someone who keeps winning penny-ante pots, but is not put in the high-stakes games.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Dec 03 18:44
Sabermetric Moves of the 2009 Pre-Season

Dec 03 21:15
What would happen if the shootout period was 10 minutes, not 5?

Dec 03 20:51
Marcel 2009 is here

Dec 03 18:40
Avery being Avery

Dec 03 17:41
How to calculate the area of a baseball field

Dec 03 16:57
NYC’s 3 1/2 year mandatory jail time sentence for carrying a loaded weapon

Dec 03 14:50
The Return of the Baseball Abstract?  No, the next best thing…

Dec 03 14:48
Estimating BABIP

Dec 03 10:42
What was Pedro worth?

Dec 03 10:20
Complete Run Expectancy, Retrosheet Years