THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, January 30, 2007

Home Field Advantage

By Tangotiger, 10:30 AM

Phil points to the latest BEPRESS issue, where Phil says:

They start by examining run scoring: they find that in games of 2004-2005, the home team scored .093 runs per game more than the visiting team. This number looks small, and so the authors conclude this “little supports” the explanation that home teams are “more proficient at scoring runs.”

Ah, the power of picking and choosing numbers, without context.  Here’s the reality:


Home teams allowed 11650 runs in 2004, and scored 11726 runs.
That difference is a miniscule 76 runs in 2428 games (or 0.03 runs).  In effect, you are just as likely to score at home than on the road!  Not.

Home team batters hit in 21092 innings, while their pitchers pitched in 22302 innings.  On a per 9 inning basis, we get the following:
runs scored: 5.00
runs allowed: 4.70

That difference is a whopping 0.30 runs!

Here’s the base data:
http://www.retrosheet.org/boxesetc/YS_2004.htm

In 2004, the home teams actually won .535 times, slightly more than the above estimate.

How about 2005?  The home team winning percentage was .537.  The RS and RA were 11345, 10980 (difference of 0.15 runs).  On a per 9IP basis, the numbers are: 4.86 RS, 4.44 RA for a difference of 0.42 runs.  So, the home team winning percentage was a bit less than estimated.

Put it together, and the home team winning percentage is 0.536, with a run differential of 0.36 runs.

PythagenPat would estimate this winning percentage (4.93 RS, 4.57 RA) as ..... 0.536.

#1    tangotiger      (see all posts) 2007/01/30 (Tue) @ 12:40

Here’s another way to look at it.  Extra Inning games guarantees that the home team bats in the bottom of the inning.  But it still won’t guarantee that it will use up those 3 outs.

This is what happens when I run this report on baseball-reference.com:
From 1995 To 2006,
Home Games,
Exactly a 1 Run Margin of Victory,
and Extra Innings

Home team scores 7942 runs and allows 7428 runs, for a difference of 514 runs.  And, of course, the home team won exactly 514 more games than they lost (remember, I forced a 1-run margin of victory).

When the margin of victory was exactly 2 runs, the home team won only 21.5% of the time!  Why?  Because if you are in extra innings, and you are down by say 2, 3, or 4 runs, the way to win the game is to score 3, 4, or 5 runs.  But, you are likely to score 0 or 1 run, and thereby, not getting a win.

This becomes more clear in this situation:
From 1995 To 2006,
Home Games,
Greater Than or Equal to a 5 Run Margin of Victory,
and Extra Innings

What is the chance of the home team having won the game?  ZERO!  Any extra inning game that ends with a 5-run margin or more immediately implies that the home team did not win.  A home team can only win by up to 4 runs, in extra innings. 

Is there any wonder that the aggregate is very skewed in the direction that the authors are showing?

If we look at all extra inning games, regardless of margin of victory:
From 1995 To 2006,
Home Games,
and Extra Innings

The home team scored 0.28 runs LESS than the visiting team, but won 52.4% of the time.

Winning 52.4% of the time is EXACTLY what probability theory expected, if the HFA was such that the home team scored 0.4 runs more than the road team:
http://www.tangotiger.net/innwin3.html

(See top 9, score tied entry)


#2    tangotiger      (see all posts) 2007/01/30 (Tue) @ 12:58

That should be “0.4 more runs per 9 full innings”.


#3    Phil Birnbaum      (see all posts) 2007/01/30 (Tue) @ 14:41

Tango, do extra-inning games occur often enough that they provide a full explanation of why HFA disappears in 3+ run games?


#4    tangotiger      (see all posts) 2007/01/30 (Tue) @ 15:16

Here is the wonderful link:
http://www.baseball-reference.com/games/situational.shtml

For 1995-2006, the win% for 3+ margins of victory is .508 for the home team.  Take out the extra inning games (only 2%), and it balloons to .515.

A much better study would have stopped the game after 8 innings.  You would have seen far different numbers.  And numbers that actually make sense.

The issue discovered is a baseball-related issue: starting from the bottom of the 9th, the home team doesn’t have to keep playing.


#5    tangotiger      (see all posts) 2007/01/30 (Tue) @ 15:38

Let’s take Phil’s wonderful data:

http://www.philbirnbaum.com/probs2.txt

If we focus on the start of ninth inning (visitor at bat, no outs, no one on base… includes leadoff HR to start inning, but let’s go with it anyway), and score differential of at least 3 runs:
the home team is ahead 9925 times, and behind 7910 times (17,835 games).

That is, if we played only 8 full innings, the home team wins .556 times when the margin of score is at least 3 runs.

(By the way, Phil’s text shows 1979-1990, but when I compare to Forman’s data, it’s likely 1974-1990.  That’s my assumption going forward.)

If we use Sean’s tool, for complete games that ended with at least a 3-run margin of victory, in 17,340 games that ended like that, the home team won 8919 and lost 8421 (.514).

It’s pretty clear that we’ve got a bottom of 9th issue going on here.


#6    Phil Birnbaum      (see all posts) 2007/01/31 (Wed) @ 00:18

>It’s pretty clear that we’ve got a bottom of 9th issue going on here.

Yup, that’s what I said.  But you have real numbers and I used the back of an envelope ...  sure is nice to have non-lazy people around!

And thanks too for the correction, the data probably *was* 1974-90 ... that’s what Retrosheet had at the time.  It was 1979-90 for a long time before that, and I probably just put that out of habit.


#7    tangotiger      (see all posts) 2007/01/31 (Wed) @ 01:38

You can tell it’s 74-90, cause when you compare your “visitor, 1st inning, bases empty, tie game”, the number of occurences comes pretty close to the number of games in that time period.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Nov 20 01:43
Sabermetric Moves of the 2009 Pre-Season

Nov 20 04:02
Nate Silver: hero to interviewers

Nov 20 02:01
My 1B is better than your 1B

Nov 20 00:26
MLB logo

Nov 19 23:03
NBA’s Marcel

Nov 19 19:13
Offense by position groups by decade

Nov 19 17:32
Changes in home run rates during the Retrosheet years

Nov 19 16:40
One Year and One Million Hits Later

Nov 19 16:22
Soria as a starter?

Nov 19 13:50
Response of a fired head coach