THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Run_Win_Expectancy

Wednesday, October 08, 2008

Nate Silver meets Stephen Colbert

By Tangotiger, 08:16 AM

Now this is cool.  Forget Dan Rather, Keith Olbermann, and whoever else Nate has met.  Stephen Colbert?  Guest-speaker to the President’s dinner?  Wittiest man in America?  A one-time presidential candidate?  That’s really cool.  Nate looked nervous, unlike his very calm demeanor with the other guys.  Must be tough to be sparring with a comedian as sharp as Colbert.  Colbert was pretty good to Nate.  Nate had one good line where he tried to put a baseball analogy to the race for presidency, likening Obama to the Rays, that we’re in the 9th inning down by 2, and Palin was just picked off first base.  Must feel great to make a funny guy like Colbert laugh. 

Nate has Obama with an 89% win expectancy as of today.  Bottom of the 9th, 1 out, down by 2 is a 96% chance of winning.  Down by one though is a 90% chance.  So, that’s where we are in the race.  Bottom of the 9th, 1 out, down by 1 run, McCain batting, Obama pitching.

(39) Comments • 2008/10/12 • SabermetricsRun_Win_ExpectancyBlogging

Thursday, October 02, 2008

Situational Wins

By Tangotiger, 10:51 AM

Here is my first stab at trying to describe Situational Wins.  Please provide comments, especially as it pertains to readability.  I will then make the necessary modifications, and I’ll submit it to THT for publication for the general public to consume.

Saturday, September 20, 2008

WPA/LI

By Tangotiger, 09:55 PM

I posted this at Primer:

I’m not discussing whether WPA/LI is appropriate or not for Cy Young (I would lean toward not).  I’m asking a more general question about what Balrick thinks that WPA/LI is trying to do.

***

Thank to fret for bringing up that game.

In that particular inning, the WPA totaled -.049 wins.  Indeed, every scoreless top of the half will give the defense -.049 wins in WPA, regardless whether you had 3 or 6 batters.  People who like just runs like that.  Those who care about number of runners allowed don’t.

WPA/LI for the game in question for the top of the 1st was +.019.  (Plus is bad for the defense.) What WPA/LI does is treat each PA in isolation, unleverages WPA so that it scales the performance so that each PA is weighted at exactly 1.0, and weights each event (walk, homer, strikeout, etc) with respect to the game state (e.g., K are more impactful with runner on 3b and less than 2 outs than otherwise).

Is it necessarily a bad thing that a team that allows 2 runners out of 4 batter to reach base to show that they are below average (even though they got out of the inning scoreless)?  I don’t think it’s necessarily a bad thing.  I don’t know that it’s necessarily a good thing.

But to dismiss WPA/LI would be preferable with reasoning behind it.

WPA/LI is exactly perfect for a hitter, if you believe that it matters when a hitter strikes out with runner on 3B and less than 2 outs, and you don’t think that it’s just another out.  That is, you believe that the batter and pitcher realize that that situation requires a fairly strong change in approach.  Only WPA/LI will give you what you need, under this belief premise.  Linear Weights by the 24 base/out states is a close second.

WPA/LI is not as good for a pitcher, since a pitcher is his own team, and each PA should not necessarily be treated as if they are as impactful as any other PA.  WPA/LI forces each PA to be worth exactly 1.0.

Monday, September 08, 2008

Win% estimators

By Tangotiger, 08:45 AM

UPDATE: This post has been corrected due to a bug on my part.

=============================

Great job by Zach, but I’ll quibble on the conclusion:

What’s funny is that Clay Davenport, inventor of Pythagenport, denounced his method in favor of Pythagenpat, yet it is in reality the best method when compared to actual record.

When the RMSE of one is 3.990 and from another is 3.992, that is essentially a tie.  What I did is repeated Zach’s study, with a wrinkle of my own.  Through 2007, he has 1792 team-seasons in his study.  I figured out the Clay and Pat win%, which I then multiplied by W+L, and then rounded to the closest whole number to get a win estimate.

The results.  Out of those 1792 team-seasons, there was a tie 1713 times!  In the other 79 times, Clay did better 39 times and Patriot was 40.  If we give each of them half-wins for their ties, Clay’s head-to-head win% over Patriot is .4997.  What works in Patriot’s favor is that it is a simpler construction that doesn’t break down at the extremes.

It’s fair enough to say that they are both equals for the sample at hand (where most teams play in a 9-10 runs per game environment), while Patriot won’t break down at the extreme levels.

(6) Comments • 2008/09/08 • SabermetricsRun_Win_Expectancy

Friday, August 15, 2008

Situational Hitting and Adam Dunn

By Tangotiger, 10:49 AM

Adam Dunn is what WPA/LI was invented for.  In 4500 PA, his wins added over an average hitter, given the base/out inning/score opportunities presented to him was +18 wins.

The most similar player to Adam Dunn in terms of OBP, SLG, and PA is JD Drew.  In his career, Drew is +25 wins.

Three other players with less PA are: Teixeira (+12 situational wins), Bay (+14 wins), and Miguel Cabrera (+20 wins).  If you add up their totals (+46) to Drew’s (+25), and take 30% (to align them to the same number of PA as Dunn), you get +21 wins. 

I think there may be something to the fact that Dunn is not a good situational hitter (or that his skillset doesn’t lend itself to good situational hitting).

He’d definitely be a good case study.

Here is the comparison line, as of today:
+18 situational wins, .247 / .380 / .519 Dunn
+21 situational wins, .290 / .382 / .522 Dunn’s top 4 comps (Tex, JD, Miguel, Bay)… situational wins prorated to Dunn’s PA.

Monday, August 11, 2008

Translating Relievers into W/L records

By Tangotiger, 11:15 AM

Your reliever gets an out.  Your chances of winning go up.  Your reliever allows a runner on.  Your chances of winning go down.  You add up the deltas of all the times that your chances went up, and you add up all the deltas of all the times your chances went down.  Call the former “Win Advancement” (WA) and call the latter “Loss Advancement” (LA).  WA+LA is GA (Game Advancement).

If you start the game at zero, you are marching toward 1 Win or 1 Loss.  In a typical win, the pitching team will accumulate 1.8 WA and 0.8 LA.  The difference in WA and LA, for every win, is always 1.0.  Always. That is, on your march toward a win, you’ll accumulate some good things and some bad things.  And in a win, you’ll accumulate alot more good things than bad things.  The difference, in a win, will always be +1.  Similarly, in a typical loss, the pitching team will accumulate 0.8 WA and 1.8 LA, with the difference always being -1. 

So, in an average game, you have 1.3 WA, 1.3 LA, 0.5 wins, and 0.5 losses.  The WA and LA capture the ebb and flow of the game, on your march toward the win or loss of the game.  There is, on average, some 0.8 “wasted” WA and 0.8 “wasted” LA per game (2.6 GA minus 1 game).  In order to align WA and LA to W and L, simply subtract the waste (average of 0.8 wasted advancements on each side) from the total accumulation in each game (average of 2.6 GA) from each of WA and LA.

Before we talk about relievers, let’s look at the last generation’s four greatest starters: 

Read More

Wednesday, August 06, 2008

Bay, Ramirez, WPA

By Tangotiger, 06:18 AM

Leaderboards as of this morning on Fangraphs:

WPA: Batters
Lance Berkman 5.29
Pat Burrell 4.98
Albert Pujols 4.47
Jason Bay 4.42
Manny Ramirez 4.18

Ramirez is also #7 in WPA/LI and Bay is #9.

That is all.

(1) Comments • 2008/08/06 • SabermetricsDataRun_Win_Expectancy

Thursday, July 31, 2008

Converting WPA into Wins and Losses

By Tangotiger, 04:30 PM

I know I talked about this here at some point.  It came up again in the Mailbag, and wanted to draw your attention to my rather lengthy and technical response.

(21) Comments • 2008/08/01 • SabermetricsRun_Win_Expectancy

Wednesday, July 23, 2008

Mo, Bill James, and me

By Tangotiger, 02:49 PM

Bill James posted Mariano’s seasonal Win Shares - Loss Shares, whereby he gives him a total of 136 Win Shares and 24 Loss Shares.  I replied:

Read More

Monday, July 14, 2008

Regression, schmegression

By Tangotiger, 10:31 AM

Regression analysis is sabermetrics is probably the worst thing that has happened to its discipline.  Rather than it being treated as a starting point, it’s treated as the target point.  Even smart learned men make this goof.  Nowhere is it more evident than in the regression run value of a double, as Patriot shows us here:

Read More

(17) Comments • 2008/07/15 • SabermetricsLinear_WeightsRun_Win_Expectancy

Thursday, July 10, 2008

WPA is… WPA is not…

By Tangotiger, 03:23 PM

What WPA represents is the quantification of your feelings as the game unfolds.  Imagine if Youk hit into a triple play his first 2 AB, with the score tied 0-0 the whole time, then the Sox lead 15-0 (and he gets two outs), then he hits two HR.  How is it that you felt with Youk, if you tracked it in real time?

Well, his first two AB, you are cursing his name like there’s no tomorrow, then when the team batted around (twice), your blood pressure starts to go down, and then, with the score at 15-0, you’re probably not even watching the game any more.

That is what WPA captures.... the quantification of your feelings as the game unfolds, assigned to the players involved.

WPA is not a way to evaluate the talent of a player.  WPA is exactly the same as counting a PA 11 times when the bases loaded down by 1 with 2 outs in the bottom of the 9th and counting a PA as almost zero in a blowout game.  It is basically ridiculous to think that one PA can inform you on the talent level of a player 1000 times more in one situation than another.

WPA/LI however might be a way to evaluate the talent of a player, since now each PA is exactly worth 1 PA.  The only thing we are doing is realizing that baseball might not be a random game, and that a player might tailor his approach based on the base/out inning/score.  We don’t know how much he does tailor his approach.  That needs to be studied.

(16) Comments • 2008/07/24 • SabermetricsLeverage_IndexRun_Win_Expectancy

Tuesday, June 17, 2008

Even more about replacement level

By Tangotiger, 12:02 PM

Referencing this article on VORP, some BTF readers make some curious statements, while JC makes some puzzling ones.

I made this post at both blogs:

Read More

Saturday, June 14, 2008

Defensive Indifference: Strange thing in BOS/CIN game today

By , 06:16 PM

In the bottom of the 10th inning in today’s BOS game, CIN was down by 2 runs with 2 outs, with Jolbert Cabrera, a veteran journeyman player, on second base.  With Griffey on base and the shift on, Jolbert easily took third on the next pitch, since no one was playing near third base.

Now, obviously being on third base is no better than being on second in that situation.  In fact, you can make an argument that a foul ball occasionally hits a runner on third in fair territory, so that it is better to be on second, although I don’t think I’ve ever seen that, and it is exceedingly rare (if it has ever happened at all).  Plus, being on second base, you can possibly steal the signs or the location and relay it to the batter, although runners rarely if ever do that anymore (I am not sure why).  On the other hand (that is the last hand, I think), being on second might be a distraction to the batter, although certainly you are not going to be “jockeying” on the basepaths and you certainly don’t need to have any kind of a lead since it doesn’t matter if you score on a hit.

Now, even if he can walk to third on the pitch, which was apparently the case, there is a non-zero chance that you could trip and fall and get thrown out.  I found it curious that a guy who has been playing baseball for almost 30 years would do that (take third).

(5) Comments • 2008/06/15 • SabermetricsIn-game_StrategyRun_Win_Expectancy

Thursday, June 12, 2008

Complete Run Expectancy, Retrosheet Years

By Tangotiger, 06:43 PM

RE chart for the entirety of Retrosheet years.

To read the first record: with the bases empty and 0 outs, there have been almost 2 million PA with almost 1 million runs scoring from that point to the end of the inning.  The average (R/PA) is 0.494 runs, which is labeled as REOI (runs to end of inning).  REOI_0 is the percentage of times that there were no runs scored at all in that inning (72.7% in this case).

Excluded are all partial innings, and home halves of 9th and later innings.  I am only looking at the base/out state at the start of the PA.

Here’s the SQL

(89) Comments • 2008/10/10 • SabermetricsRun_Win_Expectancy

Friday, June 06, 2008

Hardball Times Team Stats

By Tangotiger, 09:10 AM

I don’t know when THT rolled out Fangraphs WPA stats, but I like it.  They give you the team-level totals.  The Angels for example are a total of +6.5 wins (meaning their actual wins minus losses is 13 games… as you can see, the player performances are a perfect match to their teams’ record).  That breaks down as -2.0 wins for batting, +4.7 for starters, and +3.7 for relievers.

Fangraphs is likely using a run environment that is too high, meaning it gives too much credit to pitchers.  The total for the 30 MLB teams should have batting as exactly 0.0 and starters+relievers as exactly 0.0.  Checking now… the total of the 30 MLB teams is -51 wins on offense and obviously +51 on pitching.  So, until David A. updated the run environment charts he uses for the win expectancies, you need to mentally add +51 wins per 70,000 or so PA to your hitters (i.e., about +0.1 wins per 200 PA), and remove 51 wins per 16,000 or so IP to your pitchers (about -0.2 wins per 70IP).  As you can see, no big deal at the player level so far.

Also note that starters are way behind relievers, as they always are.  Relievers get the advantage in that their run environment is being compared against a fixed point (say 4.50 runs per game), when in fact relievers, because they are relievers, should actually be compared to a lower run environment.  But, the charts I provided David doesn’t allow for that to be handled.  I can do it, but it’s a pain in the butt. 

In any case, since you likely want to compare to replacement level, not average, you’ll have to make an adjustment anyway, so you might as well do it after-the-fact, and not in real-time.

Anyway, the reason I happen to discover THT’s WPA charts, is that Rally was talking about the Mariners fielding.  It seems that Safeco is always at the center of issues with fielding stats.  They’ve got close to the league-low in both infield and outfield fielding.  It’s hard to believe that a team with Beltre and Betancourt and Ichiro can be that dismal.  Either the other players are so dreadful as to bring them all down, or one or all of these guys aren’t as good as their reps.

At some point, I’ll be looking at Safeco’s PBP data to see if there’s something strange with their data.

(13) Comments • 2008/06/09 • SabermetricsFieldingRun_Win_Expectancy

Wednesday, May 21, 2008

Fangraphs: exciting games of the day

By Tangotiger, 03:25 PM

Fangraphs is showing the average LI and average win expectancy for each game for a particular day in its sidebar (sorted for your ease).  So, you can see which game is the most “exciting” based on these two measures (high LI means alot of swing potential, and a win expectancy that is close to .500 probably means that the game was close for a good part of the game).

(1) Comments • 2008/05/25 • SabermetricsLeverage_IndexRun_Win_Expectancy

Thursday, May 08, 2008

Distributing excess wins and losses in Win Shares

By Tangotiger, 04:14 PM

Guy has it right when he says:

Read More

(3) Comments • 2008/05/08 • SabermetricsRun_Win_Expectancy

Wednesday, April 23, 2008

Retro Fangraphs

By Tangotiger, 11:18 AM

Fangraphs is starting to add the years, and 1974 was the first one.  John Lowenstein and Tommy Hutton are your clutch hitters that year, while Yaz was the choker of the year.  The leader in Leverage Index was Tom Murphy (who?) at 2.05.  He also happened to be the best reliever that year.  So, the Brewers used him perfectly.

(4) Comments • 2008/05/12 • SabermetricsRun_Win_Expectancy

Friday, April 18, 2008

Not EXACTLY one run, but AT LEAST one run

By Tangotiger, 08:50 AM

Smart people make this mistake all the time, but friend-of-The-Book Joe Sheehan doesn’t:

...with a runner on second and no one out, you have a 62.5% chance of scoring one run. With a runner on third and no one out, you have an 82.7% chance of scoring one run. With one out and no one on, you have a 16.5% chance of scoring one run

He references BBTN, and we have it in The Book as well, but, I’ve also got it online right here.  While he did say “one run”, he really meant “at least one run” and not “exactly one run”.  And his numbers bear that out.  And he was right to compute it that way.  With one out and no one on, the chance of scoring exactly one run is 10%.  But, the chance of not scoring at all is 83%, meaning the chance of scoring at least one run is 17%.  And that is the number we care about.  Invariably, we almost always want to know chance of scoring not at all, or its complement.  (His 82.7% for runner on 3B looks mighty low.  I have it at 86.4 for 1999-2002.  Did he mean to say 87.2%?)

Good job to Joe in not falling into the trap.  Admiral Ackbar has snagged many a fine folk.

(7) Comments • 2008/04/20 • SabermetricsRun_Win_Expectancy

Thursday, April 17, 2008

Loss Shares and Win Advancements

By Tangotiger, 08:22 AM

Studes has a little program as to how to calculate WPA Above Baseline.  He actually taught himself PHP so that we have a simple calculator.  Sweet.  I think the algorithm needs modification though.  He says:

Read More

(2) Comments • 2008/04/17 • SabermetricsRun_Win_Expectancy
Page 1 of 4 pages  1 2 3 >  Last »