Wednesday, April 09, 2008
HR distribution
Victor looks at the distribution of HR. I wrote this at BallHype:
Good job, Victor. The shape you show is pretty much what I expected. After all, the mean HR rate is around .03 HR per PA. At 500 PA, 1 SD = .0076. So, if a guy gets zero HR, then he’s at -4 SD. You can’t get any lower (at 500 PA). Because your mean is starting so low, the downside is very limited, compared to the upside.
I would guess that if you were to compare the HR distribution, not to the normal distribution, but to a random distribution whose mean is .03, you’ll get something very very close to what you are showing.
I would imagine that OBP, BABIP, and XBH/H would show something much closer to a normal distribution. Probably even K/PA. (As a bonus, K/PA is not really frought with the selective sampling issues that Victor noted.)


I’m having trouble putting the numbers to words. What does this actually mean?
If you have extra players/events on the right hand tail… more than you’d expect from a random distribution… does that mean that the individual components of hitting HRs (strength, bat speed, hand/eye coordination, recognizing bad pitches, etc.) aren’t distributed randomly? In other words, is someone with good bat speed more likely to be also someone who is good at recognizing bad pitches?
Reminds me of an argument I had with a friend one time. She thought that people who were good looking were more likely to be smart, and vice-versa. I disagreed. In other words, if 1 out of 10 people were smart and 1 out of 10 people were good looking, I’d expect 1 out of 100 people to be both smart and good looking. Likewise, I’d expect bat speed and batting eye to be randomly distributed… but perhaps that’s not the case. Perhaps both are the product of hard work, and thus someone who works hard is more likely to have the many components of what makes a good HR hitter. Is this making any sense?