THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, November 20, 2007

Going for extra bases, Retrosheet Years

By Tangotiger, 01:51 PM

Want to know how times have changed, how often runners go for the extra base on a hit?  Here you go.

To read the first line: when a single is hit, with a runner on 1B, between 1957 and 1968, with 0 outs, the runner will go for the extra base 36% of the time, and he will be successful 95% of the time.  This compares to the 1993-present numbers of 27% and 96%.  The gap widens with 1 out (39% attempt, against 30% attempt), and shrinks alot with 2 outs (40% to 36%).  The 1957-1968 was definitely more aggressive, as they ran just as hard with 1 out as with 2 outs, which is not necessarily a good idea.

The same pattern repeats with the runner on 2B.  With 2 outs, there’s not much distinguishing a runner’s aggressiveness over the years.  But, at 0 and 1 out, runners today are far more passive.  There’s nothing necessarily wrong with that, since far more runs score today, making the breakeven point higher today (outs are costlier).

On the flip side, a double, with a runner on 1B, has not changed the aggressiveness of the baserunner.

At some point, soon I hope, I’ll break down these numbers by baserunner.

I’ll also point out the obvious:


An “opportunity” to go for it, or not, is not really an “opportunity”.  If let’s say Cince Voleman goes for the extra base 50% of the time, while his peers go for it 33% of the time, you can pretty much bet that most of the other 50% of the time no one would have gone.  And if your typical pitcher goes for it 15% of the time, you can pretty much bet that everyone would have gone.

It really breaks down like this.  For every 100 singles hit:
45: impossible to take the extra base (infield single, already in OF’s hands before runner even reaches 2B, etc)
15: impossible NOT to take the extra base (ball hit deep, you can walk from 2B to 3B, etc)
40: the discretionary plays

So, what I would do (presuming the numbers in this illustration are valid), is subtract 15% from all runners in the numerator, and subtract 60% from the denominator.

So, Cince Voleman’s 50% attempt rate becomes: (50-15)/(100-60) = 35/40 = 89%

Doesn’t this make more sense?  That, given a real opportunity to take the extra base on a single, that Cince here will try for it 89% of the time?

And your league average runner: (33-15)/(100-60)=18/40=45%.  That is, 45% of the time, an average runner will go for the extra base.

As for the success rate, we have to do the same thing.  Since everyone goes 15-15, we have to take those out.  So, a league average runner might have the following numbers:
33 attempts: 31 reaches, 2 thrown out

Since everyone goes 15-15, we take those out.  Now we see the following:
15 gimmes, 16 reaches, 2 thrown out.

Now, his “success” rate is not 31/33 (94%), but rather 16/18 (89%).  We paint this picture: 45% of the time, the runner goes for the extra base, and 89% of the time, he’s successful.

Suppose you have a bit slower runner, say he has this:
20 attempts: 18 reaches, 2 thrown out
80 stays put

We separate the 100 attempts as:
15 gimmes, 3 reaches, 2 thrown out
45 gimme stays put, 35 passively stays put

Now, we’ve got 5 attempts in 40 discretionary plays, or only 12% attempt rate.  And a 3/(3+2) = 60% success rate.

Doesn’t this make more sense in describing this runner?  Otherwise, if you leave the gimme plays in, you are saying he attempts to go 20% of the time, and is successful 90% of the time.  If this is true, he should be going alot more.

The key here is that there’s tons of noise in the data, and this is one way to remove them.

What I need to do now is figure out the actual % of plays that are really discretionary, rather than this semi-blind guess.

#1    Sky      (see all posts) 2007/11/20 (Tue) @ 14:50

How do you propose to figure out the real percentage of discretionary plays?  Isn’t it really more of a continuum than a three-bin categorization?


#2          (see all posts) 2007/11/20 (Tue) @ 16:17

Nice!

My first thought was this: one way to figure out how many plays are discretionary would be to assume that the success rate of the marginal attempt should be the same as the success rate of the marginal stolen base attempt.  (At least with nobody on.)

But, perhaps there’s some game theory here, where the SB attempt forces the pitcher to throw faster or throw some pitchouts.  So the numbers shouldn’t be the same.

And, of course, it could be that baserunners and managers aren’t as economically efficient as we think they should be.


#3    tangotiger      (see all posts) 2007/11/20 (Tue) @ 16:26

Sure, everything is a continuum.  But, I’m going to deal with what I know: pitchers will rarely try for the extra base.  So, if I see that they go for it 15% of the time, I have to figure that all those were gimme-types.  Vince Coleman, Willie Wilson, etc, will almost always go if they think they can make it.  Add 5% or 10% above that, and that gives me the discretionary plays.

If you have other ideas, feel free to post…


#4          (see all posts) 2007/11/20 (Tue) @ 16:53

It’s probably easier to start with the doubles.  You need to know what percentage of doubles are ground-rule, and then find out for non-ground rule doubles what the advancement rate is by zone, and what percentage of doubles are hit in each zone.

The problem with singles is the absence of hang-time info, which makes finding out which plays are discretionary and which are not quite difficult. On the other side of the coin, I am pretty sure that there are quite a few pitchers in the National League with at least 500 career at-bats who never went first-to-third on a single. 

The other problem with measuring advancement opportunities for singles is the hit and run; teams are more likely to hit-and-run with at least average speed baserunners.  Advancement percentage increases, but there are corresponding losses in other areas.


#5    Pizza Cutter      (see all posts) 2007/11/20 (Tue) @ 18:08

The problem with a fine-grained look at extra-bases is that we have to control for 4 factors: where the runner is when the ball is picked up (is he already halfway home or is he only halfway to third?), how far away the OF is from the target base (which will be affected by fence distances), how fast the runner actually is, and how good an arm the OF has.

I ran a study earlier in the year using sac flies (linked off my name), which controls for how far the runner has to run (90 feet), and with hit location data (which Retrosheet has from 1993-1998… which doesn’t do much good in rating the current batch of hitters/runners in MLB), a good guess as to how far away from home the ball was when the fielder caught it.  We do have good speed scores, but we don’t have a really good arm rating system yet.  If we had one, we could model the odds for the chance that a runner would make it, and match that up to the break-even point to see whether runners are optimalizing their team’s run expectancy with their decisions.  That’s the important question.  If the break-even point is 60% and the runner has an 80% of making it, then he’d better be running.  Even if he gets thrown out, he (or his third base coach) made a good decision.


#6    tangotiger      (see all posts) 2007/11/20 (Tue) @ 20:23

Looking only at starting pitchers as runners: single, runner on 1B:

From 1957-1981, they went for it 21% of the time.  Since 1982, it’s 13%.  That’s with 0 outs.

With 2 outs, it’s 25% and 22% respectively.

So, those I think are close to our benchmarks.  In today’s game, with 2 outs, there are 20% of singles that are gimmes in terms of taking the extra base.


#7    tangotiger      (see all posts) 2007/11/20 (Tue) @ 22:17

I’m looking at individual runners now, with 0 outs.  Mo Vaughn was the worst, attempting only 5% of the time.  Before you think it’s park effect, even if he went 0% at Fenway, that still give him 10% of the time on the road.  Olerud is at 9%, and that’s almost certainly because of his head condition.  There’s a bunch at 10%, including Delgado and Piazza.  Kinda strange that pitchers were able to go for it at a higher rate than these guys.  It seems therefore that 10% seems to be the gimme level.

Piazza has got to be one of the worst runners ever.  He attempted only 10% of the time, and of those times, he was successful only 75% of the time. That’s horrible.  And with 1 out, he’s at 18%, 84%.  And with 2 outs, 17%, 92%. 

There’s a runner from the 60s that attempted for the extra base 72% of the time with 0 outs!
http://www.retrosheet.org/boxesetc/B/Pbeckg101.htm

With 1 out (59%) and 2 outs (48%), he was also pretty high.  In these two situations, he was always safe.  It’s almost certain that there were tons of hit&runs with him on the bases with no outs.

When I look at the speedsters (Coleman, Wilson, Rickey, Raines), they were only 5% or 10% points higher than league average, which isn’t that much.  Willie Mays was much higher.

It’s an interesting set of numbers, and I guess we have to be very careful how far we are willing to go with them.

Hit&Runs and the batters at the plate have alot to do with how far you go.  If balls are hit to RF, it’s alot easier to make it than if they are hit to LF, so at the least, we need to break it down by handedness of the batter.

The thing is, there’s not that much sample size to begin with.  Pete Rose (very aggressive runner), was on 1B when a single was hit 1091 times.  Most of the long career guys are at 500-600 times.  And that’s across all three outs.  So, at each out level, we are down to a couple of hundred opps. If we break this down at the handedness level, it goes down that much more.

I guess I was hoping the data would tell us more than it likely is.


#8    Peter Jensen      (see all posts) 2007/11/21 (Wed) @ 16:43

I would think that the running ability of the batter would also be a factor.  A good running batter who reaches second when the runner on first advances to third may be credited with a double, while a slower running batter may not attempt to run to second and if he does may be scored as a single and a fielder’s choice.


#9    Tangotiger      (see all posts) 2007/11/21 (Wed) @ 17:05

That’s interesting.  The implication therefore is that the faster the batter-runner, and with a runner on first, then a more disproportionate % of singles+doubles that are counted as doubles, than if there was no runner on first.

So much to do…


#10    studes      (see all posts) 2007/11/24 (Sat) @ 01:07

Just catching up with this one; want to point out that I graphically reported this earlier this year…

http://www.hardballtimes.com/main/article/ten-things-i-didnt-know-last-week36/

...for those of you who like graphs.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Nov 20 01:43
Sabermetric Moves of the 2009 Pre-Season

Nov 20 04:02
Nate Silver: hero to interviewers

Nov 20 02:01
My 1B is better than your 1B

Nov 20 00:26
MLB logo

Nov 19 23:03
NBA’s Marcel

Nov 19 19:13
Offense by position groups by decade

Nov 19 17:32
Changes in home run rates during the Retrosheet years

Nov 19 16:40
One Year and One Million Hits Later

Nov 19 16:22
Soria as a starter?

Nov 19 13:50
Response of a fired head coach