THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Sunday, June 01, 2008

Why is the HFA so high this year?

By , 04:21 AM

I did not realize it was until I read this article on ESPN.com.  In it, Buster Olney takes on the question as to why the home teams are winning at a .577 clip so far this year (as of May 29, according to Olney).

In the article, Olney says, according to several GM’s, players, managers and scouts, it might because of the “influx of young players” who are more familiar with their home environments, or perhaps even party on the road, more so than the average player I guess.

As I said, I hadn’t even noticed that home teams are winning at such a high rate so far this year, and normally I wouldn’t think anything of it anyway.  But given the sample size, the difference between this year and what is typical (around 53-54%) is greater than 2 SD, enough to raise an eyebrow or two.  Plus, there are a lot of weird things going on in baseball so far this year (well, at least one weird thing, which is the low run scoring and especially HR rate in the AL).

Anyway, the “young players” explanation seems a little silly to me on its face.  I mean how many extra young players would it take to make such a difference?

Not one to accept anything at face value, especially that which “scouts, managers, GM’s, and players” posit, I looked at the average age and distribution of ages of pitchers and batters so far this year as compared to last year at the same time (thru May 29).  Each age is prorated by the number of PA or TBF.  The distribution of ages is percentage of total PA or TBF.  Here is what I found:


2007 NL Batters
Av age: 29.8 < 25 .11 25-30: .42 30-35: .31 >35: .16

2008 NL Batters
Av age: 29.3 < 25 .10 25-30: .47 30-35: .32 >35: .11

2007 NL Pitchers
Av age: 29.6 < 25 .11 25-30: .48 30-35: .28 >35: .13

2008 NL Pitchers
Av age: 29.4 < 25 .13 25-30: .40 30-35: .35 >35: .12

2007 AL Batters
Av age: 30.1 < 25 .10 25-30: .33 30-35: .44 >35: .14

2008 AL Batters
Av age: 30.0 < 25 .10 25-30: .38 30-35: .38 >35: .14

2007 AL Pitchers
Av age: 29.0 < 25 .14 25-30: .44 30-35: .30 >35: .12

2008 AL Pitchers
Av age: 28.9 < 25 .12 25-30: .52 30-35: .22 >35: .13

The average age among NL batters is .5 years less than last year and among NL pitchers it is .1 less.  In the AL, it is .1 for the batters and .1 for the pitchers.

However, there is no noticeable difference in the number of under-25 players.  In the NL, there are 1% fewer under-25 batters and 2% more under-25 pitchers.

In the AL, the number of under-25 batters is the same as last year, and for pitchers, there are 2% fewer under-25’s.

Even if it were true that “young players” do better at home than on the road, relative to the average player – and that is a shaky theory at best, IMO – I don’t know where all the GM’s. players, managers, and scouts got the idea that there are more “young players” in baseball this year than in past years, or at least as compared to last year.  There aren’t, even though the average age in baseball is slightly less than last year.  Do they just say the first thing that comes to their minds?  I don’t know.

Here is a nice conspiracy theory to explain the higher HFA this year: If it is true that teams are now allowed to keep their baseballs in climate controlled environments, what is to prevent them from taking out the livelier balls when they are batting or when they are behind in the game?  I have mentioned that several times over the years with regards to the Rockies and the humidor.  It seems to me that if you allow each team to “doctor” the baseballs, even if that means getting them back to their “original” condition, you are just asking some teams to cheat.  People will cheat when given the chance.

Just a theory.  Hey, I think it is a heck of a lot better than the “younger players” theory!  At least I am not making up facts.

#1          (see all posts) 2008/06/01 (Sun) @ 05:34

Well, if you want conspiracy theories, maybe MLB has decided to take a cue from the NBA, where it seems to be generally accepted that the refs favor the hometeam. Perhaps umps have started squeezing the strike zone on the road team pitchers and expanding it for home team pitchers. That should be testable in theory, though I’d be really surprised if something like this was happening.


#2          (see all posts) 2008/06/01 (Sun) @ 11:19

I have done some research on how HFA changed over time and also some research on how the age distribution has changed over time. Below are the HFAs for each decade starting with 1901-10

0.1042
0.074
0.0928
0.0976
0.0922
0.0784
0.0796
0.0752
0.08
0.07

The last decade is actually 1991-2002. The .07 Means that home teams had a .535 winning percentage.

I also found the percentage of all players with 400+ PAs who were 20-24 for each decade. Here are the percentages over the same decades as HFA (except the last is 1991-2000)

16.11
21.04
15.95
16.05
18.29
14.60
21.77
21.88
15.34
13.07

I know it is only 10 observations, but the correlation is -.18. So the greater the % of young guys, the lower the HFA, although it is not statistically significant.

My HFA article from the Baseball Research Journal is at

http://www.geocities.com/cyrilmorong@sbcglobal.net/HomeRoad.htm

My article on player aging from Beyond the Boxscore. I don’t have the data there on aging that I present here. There is a graph, but part of it is cutoff since BTB made some format changes

http://www.beyondtheboxscore.com/story/2006/9/8/144018/2113


#3          (see all posts) 2008/06/01 (Sun) @ 11:23

How much higher than 2 SD is it?  There are lots of weird things that could happen just by chance in the first two months of the season ... maybe this is just this year’s.


#4    MGL      (see all posts) 2008/06/01 (Sun) @ 12:31

Yeah, I agree that if we start out with the question of “what is weird so far this year” we are going to find some things that are weird by chance alone, by definition.

There were around 800 games as of May 29.  According to the binomial, that is around .018 for one SD (a little more, given the fluctuating wp of each game). 

Assuming that .535 is the “true” mean home wp, .577 is 2.35 SD from the mean.  Not alarming, but, as I said, I find it somewhat troubling given the changes in the way baseballs are supposedly stored this year.


#5    MGL      (see all posts) 2008/06/01 (Sun) @ 12:36

Yes, and the “theories” presented by Olney are troubling in two respects:  One, the fact that there are NOT more young players in baseball this year.  Two, the fact that these “experts” are just making up crap about young players having unusually good home/road splits, in order to fit the observation. 

If HFA were less this year, there is little doubt in my mind that the “experts” could easily have used the “reverse” argument.  “Given all the young players in baseball this year, they just don’t have the same comfort and familiarity level at home that the veterans have.”

How does ESPN (and Olney) even print this crap?


#6    Chris J.      (see all posts) 2008/06/01 (Sun) @ 12:41

Once the game begins, don’t the umps have control of all the balls?


#7    archilochusColubris      (see all posts) 2008/06/01 (Sun) @ 14:05

I wonder about the relationship between these two ‘weird things’ so far this year. As that one major league scout proposed, it does make sense that a weaker run-scoring environment would have an increased home field advantage.

So 2 questions that should be pretty simple to answer.

1. How does HWP change in response to runs scored per inning?

2. Is the increase in home field advantage this year showing up as an increased advantage in runs scored per inning or is that remaining fairly constant?


#8    MGL      (see all posts) 2008/06/01 (Sun) @ 15:44

I don’t think that home wp has much, if any, relationship to rpg.  And run scoring is barely down in both leagues combined.

I have no idea where the balls come from once the game starts.  Obviously the ball boy gets them from somewhere and gives them to the umps.  Where the ball boy gets them from, I have no idea.


#9    birtelcom      (see all posts) 2008/06/01 (Sun) @ 16:23

For whatever it’s worth, note that over the full 1978 season, the major league home team record jumped quirkishly to .573, essentially the same as it is now for 2008.  In 1977 and 1979 the HFA was perfectly normal—the ‘78 jump was presumably just an odd random variation, though if the sabermetric community had been then what it is now, there presumably would have been much speculation over explanations.


#10    MGL      (see all posts) 2008/06/01 (Sun) @ 17:07

I alwsys like to think in terms of the null hypothesis (nothing ever goes on the change the HFA), and what the distribution would look like if that were true.  We would expect the HFA to be greater than .555 or less than .515 (2 sigma) once per 20 years by chance alone.  There would even be a 50% chance or so for one season to be greater than .560 (2.5 sigma), again, by chance alone.

Anyway, barring some information that we know about, picking out the first third of a season “after the fact” is cherry picking or curve fitting, or whatever it is called.  It is worth looking into, but not much more than that.


#11    john      (see all posts) 2008/06/01 (Sun) @ 17:27

hmm maybe this is why the braves home winning pct is so good and their road so poorly lol


#12    fifth of      (see all posts) 2008/06/01 (Sun) @ 17:32

OK, here are the actual w%, pythagorean, and BaseRuns/out pythagorean (1.83 exponent) for home teams in 2008, 2007, and 2003-2007:

5-year: .542, .539, .539
2007: .542, .537, .537
2008: .577, .566, .560

The highest HFA in those 5 years was 2004: .550, .541, .541.

For the five year period, if you just used Tango’s szERA (ERA estimator using only K and BB), the home team has a .520 advantage. This year it is .515. The 5 year home has .165 K/PA and .081 BB/PA against .175 K/PA and .076 BB/PA on the road. This year, it’s .170 / .086 and .176 / .082.

There has been a slight uptick in HR HFA which has a reasonably big impact on the numbers; the difference is 19 HR (one every 44 games), and if you take them out the BsR HFA shrinks to .531. Balls in play have also had a far above average split. The split for wOBA on balls in play is .299 against .293 for the five-year, but .300 against .285 for 2008.

3B/(2B+3B), home: .099 5-yr, .098 2008
3B/(2B+3B), road: .087 5-yr, .071 2008
(1B+ROE)/(AB+SF-K-XBH), home: .2587 5-yr, .2608 2008
(1B+ROE)/(AB+SF-K-XBH), road: .2534 5-yr, .2457 2008

So, I think the argument that it is merely chance holds the most weight. If I were to try to come up with a hypothesis that would superficially match the data as Olney et al might, it would be that defenses have been extra good at home this season. I don’t really think that that is worth discussing. I don’t think the 19 HR are worth discussing whatsoever.


#13    fifth of      (see all posts) 2008/06/01 (Sun) @ 17:40

The .531 in that second paragraph was some sort of typo that took over the sentence after the error had been made. It should be .558.


#14          (see all posts) 2008/06/01 (Sun) @ 20:02

If you want to check Buster’s claim about young players partying on the road or whatnot, why not just find the OPS home/road splits for each age group, and see if they are actually the cause of the splits more than other players (and if it’s that they’re that much better than expected at home, or that much worse than expected on the road).


#15          (see all posts) 2008/06/01 (Sun) @ 21:10

Sal, that seems like a good and logical idea, but I’d bet that the difference is so tiny that it barely even shows up. Could be worthwhile to give it a shot for someone who has the data readily available.


#16          (see all posts) 2008/06/02 (Mon) @ 01:56

I’d guess that it would be a negligible difference if at all as well, but it is possible, and doing it by team (for instance) may show that some clubhouses are more apt to drunken revelry than others.

The thing is, we just don’t know unless we check. And then once we do check, we can all mock Buster Olney about it.

(If Olney persists in claiming they’re partying hard on the road, yet their performance is better, we should all calculate what the HFA would be like if they drank at home too!)


#17    Ben      (see all posts) 2008/06/02 (Mon) @ 08:15

I have a hard time believing that if teams were using a different set of balls for home and away teams that the HR rate in the AL would be down.  Up or even I could understand.  Down makes no sense to me.


#18    Bjorn      (see all posts) 2008/06/02 (Mon) @ 09:24

Can each team chose their own “climate” to keep the balls in or is there a standard MLB “climate” that all teams are suposed to use.

I imagine a baseball would behave quite differently if you set the climate chamber to emulate july in the Sahara desert or Januari in Alaska.

I also imagine that some teams would be advantaged by the former and some by the later depending om exactly how the teams are “constructed”.

Another thing that could matter is that some of the physical properties of the baseballs probably doesn’t “hold” very well once out of the climate chamber. For instance I would guess that a ball of significantly different temprature than the ambient temperature wouldn’t be able to hold that temperature very long. So I guess you need to consider when during the game balls are brought out.


#19    tangotiger      (see all posts) 2008/06/02 (Mon) @ 11:07

http://tangotiger.net/wins.html

The very first number you see is: 0.4608

Subtract that from 1, and you get .5392

That is the historical HFA, from 1957-2006, excluding 1999.

If the report is .577, that is 38 points above the norm. 

I seem to remember that there was a recent season, perhaps the AL about 5 years ago, where there was almost no HFA, and that was for the entire season.

Anyway, 2.1 SD on one side occurs 2.5% of the time, right?  There’s been 50 seasons in that sample, so we’d expect one such season to really overshoot and one to undershoot. 

It just seems to me that Buster understands numbers just enough to be dangerous.


#20    Ken      (see all posts) 2008/06/02 (Mon) @ 15:34

Could it have anything to do with better late game managing decisions from the home team with the last turn to bat?  Maybe not bunting excessively like in year’s past?


#21    MGL      (see all posts) 2008/06/02 (Mon) @ 16:00

Ken, almost all of the HFA comes from better component performance (s,d,t,hr,bb,hp,etc,) at home than on the road.  There is virtually no advantage to batting last in the first place, so any change in “late game” or home team strategy is not going to have any noticeable effect on overall HFA.

I can’t really think of anything that is going to have a significant “all of a sudden” (as opposed to gradual - over the years) effect on HFA, other than teams monkeying around with the baseballs.  I am not suggesting that that is a strong possibility.  I don’t think it is.  I don’t think it is a good idea to let teams “monkey” with the baseballs unless there are stringent controls, but still, I think it is way more likely to be a fluke than anything conspiratorial by certain teams.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Dec 03 13:52
Sabermetric Moves of the 2009 Pre-Season

Dec 03 16:14
What would happen if the shootout period was 10 minutes, not 5?

Dec 03 16:06
How to calculate the area of a baseball field

Dec 03 16:04
Avery being Avery

Dec 03 14:50
The Return of the Baseball Abstract?  No, the next best thing…

Dec 03 14:48
Estimating BABIP

Dec 03 13:58
NYC’s 3 1/2 year mandatory jail time sentence for carrying a loaded weapon

Dec 03 10:42
What was Pedro worth?

Dec 03 10:20
Complete Run Expectancy, Retrosheet Years

Dec 02 23:36
The Holy Writers strike again!