THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Thursday, April 22, 2010

“Those are the little things that I don’t think you can see in the box score, ever.”

By Tangotiger, 03:16 PM

That’s what Mike Lowell said. Actually, we can put it in.  This is what the win expectancy matrix was set up for.

Let’s look at this chart.  It’s the bottom of the 12th (look at the bottom of the 9th), the batter just flied out to make the 2nd out, and Scutaro would 99% of the time remain on 1B.  That sets the win expectancy at .562.  But Scutaro actually went to 2B, for a win expectancy of .610.  So, we credit Scutaro with +.048 wins.

There you go, Mr. Lowell, it is now officially in the boxscore.  If it’s tangible, we can measure it.  And Scutaro going to 2B is tangible and measurable. 

All we need to do is get the scorers to make a better notation in the boxscore so we can separate these plays better.  If it was a deep fly ball that any runner would have made it, we give 100% of the credit to the batter.  If it was a flyball that essentially makes the Scutaro play like a SB attempt, we give 100% of the credit to the runner.  The tough ones are the in-between plays where the split is not so easily done.

By the way, had Scutaro been thrown out, the odds go down to .500, or a drop of .062 wins.  So, it’s a very heads up play by Scutaro, as he only needed to be safe 56% of the time to breakeven.  So, the issue is not that Scutaro went for it, but rather, why don’t more runners go for it on those plays?  As Scutaro said: do or die. 

***

Also, someone else pointed out to me that they IBB the next batter, raising the win expectancy from .610 to .613.  Why did it go up, if the lead runner wins the game in either case?  And, having a runner on base now makes the outs at three bases possible.  I didn’t look into it, other than guess that it’s because two walks wins the game.


#1    Dan Novick      (see all posts) 2010/04/22 (Thu) @ 16:25

Chances of scoring one run, 1st and 2nd with 2 outs: .106
Chances of scoring one run, man on 2nd with 2 outs: .147

So Pedroia intentionally walking decreased the Red Sox chances of scoring by .041, but increased their chances of winning by .004.

Strange.


#2    Greg Rybarczyk      (see all posts) 2010/04/22 (Thu) @ 16:32

What’s the chance of scoring *at least* one run in the first case?  Is it more than .106, or is that what you meant?


#3    Steve Sommer      (see all posts) 2010/04/22 (Thu) @ 17:00

I think the numbers in #1 are from the run frequency table and are representative of exactly 1 run.  I think the corresponding of “at least one run” are .231 (1-0.769) for 1st and 2nd and .223 for 2nd only (1-0.777).  I used http://www.tangotiger.net/RE9902score.html


#4    Peter Jensen      (see all posts) 2010/04/22 (Thu) @ 17:05

Greg - Chance of scoring at least 1 run baseout state 202 is .238 2005-2009 from my empirically based RE table.  For baseout state 1202 it is .245.


#5    Tangotiger      (see all posts) 2010/04/22 (Thu) @ 17:11

You definitely need the “at least 1 run”:
http://www.tangotiger.net/RE9902score.html


#6    Dan Novick      (see all posts) 2010/04/22 (Thu) @ 17:12

My bad, I read the table wrong.


#7    Tangotiger      (see all posts) 2010/04/23 (Fri) @ 10:48

Reprinting from Fangraphs:

http://www.fangraphs.com/blogs/index.php/those-are-the-little-things-that-i-dont-think-you-can-see-in-the-box-score-ever

The win expectancy chart that I linked to is based only on the inning, score, base, out.

There is NOTHING to stop anyone from creating a win expectancy chart that includes the identity of the batters, fielders, pitchers, players in the dugout, last time a reliever was used, manager, park, the count, the time of day, the climate, and anything else you think can affect the win expectancy of a game. All you need to do is put in the time and effort to create such a system.

Win Expectancy, as a framework, is perfect. Win Expectancy, as an implementation, is limited to whatever parameters you use.

***

All Retrosheet years (I think this was 1956-2008 when I did it):
http://www.tangotiger.net/retrosheet/reports/re.htm

All Retrosheet years, win expectancy:
http://www.tangotiger.net/wins.html

The parenthetical promise was done many years back:
http://www.tangotiger.net/RE9902score.html

It’s also in The Book, which you can read for free from Amazon’s Look Inside.


#8    Ken      (see all posts) 2010/04/23 (Fri) @ 12:27

Why don’t more runners do it? The problem is that if Borbon doesn’t catch the ball, then Scutaro probably scores if he is standing near second base, while he doesn’t if he is tagging up.  In analyzing the decision to tag up or not, you have to include the Win Probabilities associated with an outfield error or the runner misjudging whether the outfielder will catch it.


#9    Greg Rybarczyk      (see all posts) 2010/04/23 (Fri) @ 12:40

Right, Ken.  The numbers we’ve been batting around presuppose that the runner knows the batted ball is a flyout, but of course he can only presume that as it flies through the air, with differing degrees of certainty based on teh situation.

If we had a lot more data, we could construct a Win Expectancy table based on “runner sees a hard hit ball at approximate angle X fly over the shortstop’s head towards left-center field”, and then look at all the possibilities (ball caught + no advance, ball caught + advance to 2nd via tag-up, ball not caught + advance to 2nd, ball not caught + advance to 3rd, ball not caught + advance to home).

One of those things for the to-do list once the complete BaseballFx data system is complete and available smile


#10          (see all posts) 2010/04/24 (Sat) @ 02:02

I think I’ve learned enough from you, Tango, to recognize that the advantages of that more complex win expectancy chart are likely to be outweighed by the disadvantages, mainly in terms of small sample size.  Bill James’s archetypal “smart” ballplayer (i.e., Joe Morgan) does instinctively work all that into the calculation, but if he starts by learning how it works in gross for thousands of games and millions of plays, he can make his adjustments from a more reasonable base beyond his own instinct.  It doesn’t take a lot of brains to know you’re not going to run on Ichiro if you have Greg Luzinski on second base, but most plays are nowhere near those endpoints.  You don’t need the data on how likely some specific middling runner and some specific middling arm will succeed, in part because it doesn’t exist and in part because it will be misleading because the sample size is too small.  You just use the basic data for nearly all cases and know enough about your own capabilities and the scouting reports on the opposing players to adjust for the more extreme cases.  That, I suspect, is a more precise and less of a luddite position than what Lowell actually said, but it’s not too far from his position.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

Feb 12 05:18
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 12 04:55
Who is Jeremy Lin?

Feb 12 03:15
New PECOTA

Feb 12 02:42
Whitney Houston

Feb 12 02:23
Psst… wanna intern in Canada?

Feb 12 00:40
Clutch analogy

Feb 11 20:11
Fighting leads to goals?

Feb 11 19:55
Why do players get crappy caps?

Feb 11 19:12
Hero of the month: Brittney Baxter

Feb 11 17:59
MGL: Today on Clubhouse Confidential