THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, November 18, 2011

To pinch-hit or not?

By Tangotiger, 10:45 AM

MGL’s terrific guest piece at BPro:

In any case, it is easy enough to see how these starters pitched after they were allowed to bat. Remember that, on average, the starter pitched another 1.42 innings after his stint at the plate. What was the average TAv against over these 1.42 innings, again, after removing all pitcher batting? Remember, these were .258 pitchers for the season who had pitched to the tune of .221 so far in the game. They were also facing the lineup for the third time on the average; if there was no carryover effect, we would actually expect them to have pitched worse than their normal .258, all other things— like the park, weather, and opposing batters—being equal.

So how did they pitch? (We’re going to exclude the ninth inning for the reasons stated above.) They pitched to the tune of a .251 TAv against (after adjusting for opponent batter pool)—better than we expected, but quite a bit worse than their .221 prior to being allowed to hit. Coincidentally, .251 is almost exactly the same as the average reliever, who is a .250 pitcher. So leaving your “hot” starter in the game yields no advantage over replacing him with an average reliever, unless he is a considerably above-average pitcher.

What about when the starter was taken out of the game for a pinch hitter? How did the relievers pitch in the very next inning? They allowed a TAv of .243, which is around what you would expect from a late-inning reliever.

So let’s recap the last few paragraphs. When a manager allows his starter, who is an overall .258 pitcher but has thus far pitched at a .221 level, to bat in a high-leverage situation (LI > 1.5) after he has pitched at least five full innings, he pitches at a .251 level (TAv) for the next 1.42 innings, on the average. When the starter is removed in the same situation, relievers pitch at a .243 clip for at least the next inning. Clearly, there are some starters who are good enough to post a sub-.243 TAv later in the game, but remember we are asking the question, “What if we were to remove all starters when they have completed at least five innings on the mound and they are due to bat in a situation where the LI is greater than 1.5?” Whatever our answer is, we can perhaps leverage that one-size-fits-all strategy by letting some pitchers (aces) bat for themselves and continue to pitch. Of course, you are still giving up the value of using a pinch hitter, which is almost the whole point of the strategy. In other words, a pitcher would have to pitch a heck of a lot better than .243 or .250 in order to justify allowing him to bat. This brings us to our next, vitally important question:
...
Also remember that each team averages around 28 of these decisions per season, and the starter ends up batting around 12 of the 28 times. Multiplying 12 times .032 wins yields a gain of .384 wins per season per team by virtue of this simple strategy.

If one were to argue that such a strategy might tax a bullpen or hurt the confidence or ego of a team’s starting pitchers, remember that the average subsequent IP whenever a starter is not removed for a pinch hitter is only 1.42. That means that we would be transferring a total of 12 * 1.42, or 17 IP, from the starters to the bullpen, an average of around three fewer innings per starter and perhaps one or two more innings per reliever. This hardly seems like a crisis.

A gain of 0.384 wins on 17 innings?  That’s a gain of +.200 wins per 9 IP, which is like replacing an average pitcher with Roy Halladay.


#1          (see all posts) 2011/11/18 (Fri) @ 11:59

What I see as missing here is a way to evaluate the state of the bullpen when the decision has to be made. These numbers tell of a single, isolated game. What of the situation where the team played a 15-inning game last night and has a doubleheader the day after tomorrow? Then it may well be advisable to let the starter hit and get the extra 1.42 innings out of him, even if it lowers the win probably of this single game.


#2    Tangotiger      (see all posts) 2011/11/18 (Fri) @ 12:27

What percentage of those extra 17 innings that MGL is talking about do you think that would account for?

That said, in addition to the constraints that MGL added (at least 5 innings, LI of at least 1.5), I have no problem adding that the bullpen could not have been used for at least x amount of innings in the previous day, and y amount of innings two day ago.

The main point still stands though: you get a humongous benefit by transferring 17 innings over the course of a season.

Heck, you transfer those 17 innings today, you may even get a few of them back, in a more well-rested starting pitcher for his next game.


#3          (see all posts) 2011/11/18 (Fri) @ 12:34

What this makes me think is that if we could do a better job of figuring out which starting pitchers “had it” in a given game (or alternatively, if that really is just luck, why that is so), that there is a large potential payoff to the team.  If the manager really knew that his starting pitcher that day was not much better than a reliever that day, I think it would be much easier to convince him of the value of pinch hitting.  But a manager today is not likely to believe the premise.  In order to get MGL’s proposed strategy actually adopted by managers, we need more research into pitcher effectiveness.


#4    Tangotiger      (see all posts) 2011/11/18 (Fri) @ 13:46

Responding to Bill in the comments section at BPro:

I object (strenuously!, ala Demi Moore) to Bill’s contention that a team’s bullpen is so taxed to the limit that it couldn’t find 17 innings somewhere.

Let’s first start with what could be the “limit” for a team’s bullpen.  Last year, the Braves led the NL in ERA for bullpen AND were #2 in most IP (at 522).  Fine, the rookie stars on the Braves are an exception you say.  The Nationals had almost as many IP (520), with an ERA of 3.20, 4th in the league.  The Pirates led with most IP (526), and their reliever ERA 3.76) was a bit worse than the league (3.59).

If you take the top 4 in the NL in IP by relievers (average of 521 IP), the average ERA for those teams was 3.36.

It seems to me that we can say that the “limit” to a team’s bullpen is 520-530 innings, and that’s really being conservative.

Since MGL is saying that we want to add 17 innings to the bullpen, then any team that had less than 500 innings on its bullpen would be able to figure out how to get those 17 innings.  Nine of the 16 teams had under 484 innings, and therefore, would EASILY be able to add 17 innings to its bullpen.

Reds, Cubs, Rox were all on the 500 inning bubble, so they’d have to stretch a bit to get to 17 innings, but even doing so, they’d still end up with fewer innings than the league-leader in Pirates.

Only 4 of the 16 teams were close to that 520-530 limit that they couldn’t add another 17 innings.

I should also note that the league leader in IP in 2010 was the Nationals (at 546 innings) with a low ERA of 3.35.  The Padres in 2009 led with 571 innings, and also a better than league average ERA (3.75).

Therefore, you can make the case that every NL bullpen in 2011 had plenty of room to add 17 innings.

So, what Bill is saying sounds nice, and seems reasonable, but when you look into it, you see that it’s not a concern most of the time.


#5    Tangotiger      (see all posts) 2011/11/18 (Fri) @ 13:57

Responding to Mike:

Yes, the key point, in terms of why managers believe in the hot-hand, is when MGL says this:

They pitched to the tune of a .251 TAv against (after adjusting for opponent batter pool)—better than we expected,

Where the expected was .258. 

If for example 25% of those pitchers TRULY was on, and had a true talent that day of .230, while the other 75% were pitching their normal selves (.258) and simply were lucky to get the results that they did, we have an overall average of .251.

So, the manager might be thinking:

You know, I’ve got a 25% chance here that what I’m seeing, the .221 so far, is close to real, that I’ve got a pitcher pitching at a Cy-level.  There’s a 75% chance I’m seeing average, .258,, and my reliever is going to be just a bit better, at a .245-.250 level.  I’m not taking him out unless I know for sure.

This is why I can’t begrudge a manager here.  It takes enormous b-lls to simply say that since he can’t REALLY tell, then he simply must presume that ALL of the pitchers are reverting back to 7 points better than their overall self.  And at that point, he’s going to have to switch to the reliever if the talent level (minus 7 points) is worse than the .245-.250 level at his bullpen disposal.

But Mike is right.  If we can find some indicator to give the manager more confidence in which pitchers are part of the 25% that are truly on, and which pitchers are part of the 75% that are getting lucky, that would be mighty helpful.


#6    MGL      (see all posts) 2011/11/18 (Fri) @ 15:18

I wrote this on the BP comments:

Bullpen management is a legitimate concern as I state 2 times in the article.

Certainly the manager can add to the requisites for using this or a similar strategy (a quicker hook for the starter when he comes to bat in high leverage situation) that he has a reliever available who is better than the expected performance of the starter.

BTW, there are many examples where the starter who was allowed to hit was an awful starter (since the average starter in that bucket was .258 (TAv against), there are plenty of much worse ones), such that even a replacement reliever would be better. Surely in those cases, Bill’s objection doesn’t hold much water, again assuming that the bullpen in general is not overly taxed from recent overuse.

To respond to Mike and Tango above, I think that managers think that virtually ALL of the pitchers they leave in the game are “on.” That is the primary reason they leave them in the game, I would think.  In other words, that is evidence that managers (and pitching coaches) cannot tell whether a pitcher is “on” or not.

If you remember the threads about starters and relievers in the 9th inning, I found that when a manager chooses to leave a starter in for the 7th, 8th, and 9th innings, they also had been pitching brilliantly, but alas they reverted back to around their usual level subsequent to the decision (in the 9th, we had to make an “adjustment” for the offensive strategy because most of the time when a starter pitched the 9th, he had a big lead).  More evidence that managers can NOT tell when a pitcher is “on.”

Now, if they could come up with a way to actually tell whether they were on or not, then yes, they could leverage the strategy.  But that is a big IF.

I should have emphasized this in the article more or simply not had the 5 inning cutoff, but when you allow the manager to take his starter out in the bottom of the 4th or top of the 5th (after pitching 4 innings only), you more than double the gain from that one inning alone, presumably because managers hardly ever do that.

The Cubs could have gained 1.5 wins last year alone by utilizing the “4th inning or later” strategy. I find that amazing!


#7    MGL      (see all posts) 2011/11/18 (Fri) @ 15:35

The other thing I wish I had said in the article is this:

I’ve been wanting to do a study like this for a long time for the following reason:

Although it is not an exact science, as several people have pointed out with regard to bullpen issues (I also pointed out a few other considerations in the article, such as burning a pinch hitter early in the game, perhaps affecting the psyche of your starters, etc.), it is the first significant step, I think, in quantifying how many wins optimal managing could potentially add to a team’s w/l record.  Just from this one strategy (albeit it is probably the one with the biggest single advantage, by far), to be able to gain 1 win a season (using the 4+ inning strategy)?  I think that is huge!


#8    Tangotiger      (see all posts) 2011/11/18 (Fri) @ 15:42

MGL:

1. I have no doubt that the manager cannot tell who is really on. 

2. I do think that the manager THINKS he can tell.

3. And we all agree that there IS such a thing as a pitcher being on (25% in my crude illustration).

So, he doesn’t want to pull the equivalent of Doc Halladay out of the game, if there’s a 25% chance that he actually has Doc’s twin.

It’s really a risk aversion move here.


#9    Clemente      (see all posts) 2011/11/18 (Fri) @ 17:57

Actually, T#8, your conclusion means item 2 does not have to be true.  The manager could think in line with MGL and know he has no knowledge, or could be indifferent to knowing.  But once he thinks item 3, he will reach your conclusion---leave them in on the off but decent chance they will continue to pitch great.

I think this disregards the advantage of the PH in a leveraged situation, but managers do not have to (foolishly) believe they know something to reach the same decision.


#10    MGL      (see all posts) 2011/11/18 (Fri) @ 19:06

I don’t see how a significant percentage are really on. If we don’t see much of a difference overall how does the math hold up that 25% are on?  Unless “on” means pitching a little better than usual. I think that managers think “on” means pitching brilliantly, otherwise they wouldn’t be leaving pitchers like Jeff Suppan and David Bush in the game when they are pitching brilliantly. Unless u think that the spread of true talent in any one day is significantly larger than the normal spread of talent I just don’t see how a significant percentage of these pitchers are really “on”. Maybe that’s why managers cans see it - because it doesn’t exist.


#11    MGL      (see all posts) 2011/11/18 (Fri) @ 21:30

"So, he doesn’t want to pull the equivalent of Doc Halladay out of the game, if there’s a 25% chance that he actually has Doc’s twin.

It’s really a risk aversion move here.”

No, I don’t think it is risk aversion. Risk aversion has to be associated with as palpable negative consequence.  No one would ever know if a manager took a pitcher out who was actually “on.”

A similar cognitive paradigm is likely going on, which is that the manager would be ridiculed if he took a pitchers out who was pitching brilliantly, because everyone (incorrectly) assumes that he is taking out Doc.

The worst part is that even taking out Doc is marginal proposition and often completely correct, depending on the actual gain from a PH.  As you can see from my chart, there are situations where a pinch hitter can gain you .5 runs (with leverage).  Your starter would have to be 2.25 runs per 9 better than his replacement if you expected him to pitch another 2 innings.

Let’s say that Doc is coming to bat in the 6th inning and the leverage is 2.5 and the gain from a pinch hitter is .2 runs.  That is not atypical.  Taking him out is 100% the correct play, almost regardless of the reliever.  Can you imagine the flack you would get if Doc had thus far allowed one run on 3 hits?


#12    MGL      (see all posts) 2011/11/18 (Fri) @ 21:33

Can we just all agree that in order to make correct in-game decision some of the time, you have to know some of the “numbers” as well as understand how to crunch the numbers in order to get the appropriate answer, and that there is not a manager that exists on the planet (a little exaggeration) who knows or can do either (or respects the fact that this is the case), and never write another thing about “bad managers?”


#13          (see all posts) 2011/11/19 (Sat) @ 00:00

It’s eerie how closely the numbers on the manager’s decision here line up with the numbers on you’ve recently been talking about w/r/t football coach decision’s to go for it on short 4th downs. In baseball it actually seems they prefer the more risky strategy of seeing if the SP actually does ‘have it’ in this game, but the preference for the riskier move here is probably due to it being the move of inaction.

These calculations look just not-egregious enough for the manager/coach to default to conventional wisdom unless in-game reads provide evidence to the contrary. Which, actually, fits in quite nicely with popular impression of sport decision-makers: unless you’re Tony LaRussa or Bill Belichick, coaches/managers typically make a move when it’s ‘clear enough’ to them that it’s the right one, instead of just being what they think is the better call. (Although i guess contrary to my prior, LaRussa doesn’t seem to have a great record with pulling pitchers in these situations.)


#14    MGL      (see all posts) 2011/11/19 (Sat) @ 05:03

In the football case, it is a combination of risk aversion and ignorance of the numbers (and concepts).

As I said above, in baseball, I don’t think this has anything to do with risk aversion.  I think it is ignorance and faulty logic.  One, they don’t realize the cost of not putting in a pinch hitter.  Two, they incorrectly think that their pitcher is really “on” and that putting in a reliever is going to be much worse.  And three, they think that they cannot possibly utilize their bullpen any more then they have to (there seems to be this irrational fear in baseball of overtaxing your bullpen). (And of course other things, like it is “manly” for a pitcher to throw as many innings as possible, etc.)

OK, I guess there is some risk-aversion.  If they put in a reliever and he pitches poorly, the manager will be heavily second guessed.  And even if the reliever pitches well, the manager will still be criticized for taking out a “hot” starter…


#15    dave smyth      (see all posts) 2011/11/19 (Sat) @ 07:55

When looking at these moves, don’t ignore the entertainment value of the ‘storyline’ of a particular contest.  The entertainment is not just in the fact of a win itself, it’s also in watching the ‘process of trying to win’. Hitting for ‘Doc’s twin’ in the 5th might be worth a fraction of a win, but it’s also pretty dry and unemotional.

I’m not trying to state how much weight should be put on this, just that it is a real consideration, IMO…


#16    MGL      (see all posts) 2011/11/19 (Sat) @ 20:22

Dave I agree. We sometimes forget that the only value in wins for a team is in $$.  And dollars come from attendance. So if some strategy reduces wins but otherwise boosts attendance through entertainment value or whatever, then....


#17    Les      (see all posts) 2011/11/20 (Sun) @ 01:19

It might not be risk aversion, but is certainly criticism aversion. Assume you pull Doc (or even your #4 pitcher if he’s pitching like Doc) when he’s given up 1 run on a couple of hits and the PH hits into a double play and the bull pen blows the game and you don’t have a left handed pinch hitter available at the bottom of the ninth. Your fellow managers, ex-managers, bloggers, sportswriters, etc. would let the world know what a dummy you are and that you should be fired etc. If you do this to many times and it doesn’t work out, your owner will decide that he needs a new manager.
If you doubt this, read the comments about Mike Smith’s go for it decision. Check Brian Burke’s site for a good sample of what happens when the right call fails http://www.advancednflstats.com/2011/11/roundup-111811.html#more.
You’ll need to get a high fraction of the fans, sportswriters, etc. to buy into the strategy before most (any) manager goes along.
Almost everyone judges a move against the conventional thinking based on the outcome. And a fail outcome is seen as much worse than a success outcome.
Les


#18    Les      (see all posts) 2011/11/20 (Sun) @ 01:24

Clarify my last line. A criticism of a fail outcome is orders of magnitude more than the praise of a successful outcome. See Brian’s site for lots of discussion of this. http://www.advancednflstats.com.
Some of Mike Smith’s critics even say that before the play failed they thought it was a good move, but because it failed it was a bad move and Smith needs to go.
Denver’s coach Fox came in for the same treatment early in the season for a similar decision.
Les


#19    Geoff Buchan      (see all posts) 2011/11/20 (Sun) @ 07:22

Les - Criticism in the NFL is magnified because there’s only one game a week, so there’s lots of time for that to play out.

In MLB there’s a new game every day, so you don’t dwell so much on any one loss or mistake. It should be easier to go against the conventional wisdom in baseball, as this situation comes up only about 1 in 10 games for 5 innings.

Now MGL did say pulling after 4 produced bigger gain, but that would generate more scrutiny (not letting the pitcher “go for the win"), and would also happen more often (about 1 game a week).


#20    Geoff Buchan      (see all posts) 2011/11/20 (Sun) @ 07:34

MGL - Neat study. Why choose the 1.5 LI cutoff? I realize any cutoff is arbitrary, I’m just wondering what’s special about 1.5 as opposed to 2, 1.75, 1.25.

In theory one could develop a formula taking as input the LI, the pitcher’s batting ability, his pitching ability, and the likely PH’s batting ability (which could be adjusted for expected platoon splits) that would compute the WPA gain/loss from pinch hitting or keeping the starter in.


#21    Geoff Buchan      (see all posts) 2011/11/20 (Sun) @ 07:43

Another question… what’s the inning distribution of these situations? I saw the cutoff is 5 (or 4) innings), but what is the average inning that has a high leverage AB for the pitcher?

Obviously the later in the game it is, the more “reasonable” a decision to pinch hit becomes according to conventional wisdom. Pulling him after 7, or even 6, innings would likely face little criticism for being unusual.

One last comment: since it’s the starter batting, you know that the top of the batting order follows him (unless it’s Tony LaRussa experimenting with the pitcher hitting 8th). So the subsequent batters should be better than average for the team. I’m not sure how that would affect things, though.... I guess it lessens the importance of the PH with fewer than 2 out, but magnify it with 2 out.


#22    Bill Waite      (see all posts) 2011/11/20 (Sun) @ 20:24

Geoff/21

One of the charts in the article shows the inning distribution.

In the 5th inning, there are 616 PAs where the pitcher’s spot is up and the other criteria are met (if I’m reading the chart correctly). 6th is 747, 7th is 922, 8th is 339. So about 20% of the PAs happen in the 5th inning, but I imagine some of the ones in the 6th are after the pitcher has only pitched 5 innings on the road, so maybe around 35% total happen after only 5 IP?

As to your second comment, I believe having the top of the order up actually magnifies leverage index in all situations (including 0-out situations, although I’m not sure about RISP+0 outs) simply because the pitcher is effectively hitting in a slightly higher run-scoring environment. But I believe the effect size is pretty small in most cases.


#23    Bill Waite      (see all posts) 2011/11/20 (Sun) @ 20:49

"Would I want to pinch-hit for Hiroki Kuroda in the sixth inning down by a run so that I could get more work or innings for Matt Guerrier and Mike MacDougal? When the guys I’m batting are Trent Oeltjen...and others who primarily made up LA’s bench last year?”

I think dodgerken’s comment on BPro illustrates one reason why a lot of people underestimate the value of a pinch hitter.

Oftentimes, people make decisions by weighing the EMOTIONAL IMPACT of competing ideas (e.g. “pitchers can’t hit” vs. “neither can bench players"), and the emotional weighting mechanism gets it wrong because:

1) People have a visceral lack of confidence in their bench players’ ability to hit. In other words, they FEEL STRONGLY that their bench players are weak players.

2) They don’t have any significant emotions about their pitcher’s ability to hit because that’s not really their job, and because it doesn’t set them apart from other pitchers.

Objectively, I’m sure Trent Oeltjen is a much stronger hitter than Hiroki Kuroda, but emotionally, dodgerken doesn’t FEEL much of a difference between them.


#24    Bill Waite      (see all posts) 2011/11/20 (Sun) @ 21:13

Geoff/20

Actually, the FORMULA for calculating what you describe is simple, and I believe MGL did use such a formula in calculating “leveraged wins gained”, etc.

The formula is:

(starter’s expected runs/inning - replacement’s expected runs/inning)*starter’s addtl innings - (run expectancy of pinch hitter - run expectancy of pitcher hitting)

Of course, the value of the formula depends on the accuracy of the estimates you plug into it. I believe MGL plugged in run expectancy of an average pinch hitter vs. an average-hitting pitcher, and the runs/inning allowed of a league-average reliever vs. average starter left in the game.

I believe he also calculates what it would look like if we adjusted for the starter’s pitching ability (based on full-season totals, not based on how he’s pitching that night).

An individual team could certainly benefit from adjusting for the actual strength of their individual starters, relievers and pinch-hitters, but when they do plug in all the numbers, there probably won’t be very many situations where it’s correct NOT to pinch hit given MGL’s criteria (LI>1.5, etc.).


#25    MGL      (see all posts) 2011/11/20 (Sun) @ 22:55

In the numbers in the last couple of charts, for each time a pitcher came to bat, I figured the “loss” in WE by taking the difference between an average pinch hitter and an above-average hitting pitcher and the difference between THAT pitcher’s TAv for the season minus 6 points (I gave him a little “onness") and an average reliever (.250) and multiplied that by the expected number of subsequent innings pitched, which I merely set at 1, 1.5 or 2, depending on the inning of the PA, and of course I multiplied the pinch hitter gain by the exact leverage of that situation and I multiplied the starter/reliever difference by 1.5 (just assuming that those 1-2 subsequent innings would be somewhat high leverage as well, since it is likely a close game.

In calculating the difference in RE between an average PH and an above-average hitting starting pitcher, in the mini-Markov chain, I used the RE chart when the lead-off batter bats next, so I did take into consideration the fact that the top of the order follows the pitcher.

My cutoff of 1.5 LI was completely arbitrary. If I were a manager or a consultant to a manager I would simply remove the starter when the difference between him and a pinch hitter multiplied by the LI, regardless of what the LI is, is significantly (to make up for burning a pinch hitter and burning your bullpen a little) larger than the difference between the starter’s expected pitching and a reliever’s expected pitching (for 1 or 2 more innings of course).

If I had to give a manager a rule of thumb, I would probably use the 1.5 LI cutoff and I would start with the 4th inning and not the 5th, since, as you can see in the article, your more than double your gain when extending this strategy by just one inning (allowing managers to pull the starter after pitching only 4 innings).

Again, if I had to suggest a rule of thumb, I might also exempt his ace starters unless the LI is more than 2 and the starter has pitched at least 6 or 7 innings already.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 15:12
Do pitcher’s reach back for velocity when needed?

May 25 15:02
Pete Palmer’s new book: Basic Ball

May 25 14:44
What sabermetrics is NOT

May 25 13:04
“Why Kickstarter works”

May 25 12:51
Chad Curtis

May 25 12:40
Largest demonstration in Canadian history?

May 25 11:32
Howard Stern

May 25 11:26
Lack of hustle during a game

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion