THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, April 09, 2008

HR distribution

By Tangotiger, 09:13 AM

Victor looks at the distribution of HR.  I wrote this at BallHype:

Good job, Victor. The shape you show is pretty much what I expected.  After all, the mean HR rate is around .03 HR per PA.  At 500 PA, 1 SD = .0076.  So, if a guy gets zero HR, then he’s at -4 SD.  You can’t get any lower (at 500 PA).  Because your mean is starting so low, the downside is very limited, compared to the upside.

I would guess that if you were to compare the HR distribution, not to the normal distribution, but to a random distribution whose mean is .03, you’ll get something very very close to what you are showing.

I would imagine that OBP, BABIP, and XBH/H would show something much closer to a normal distribution.  Probably even K/PA.  (As a bonus, K/PA is not really frought with the selective sampling issues that Victor noted.)


#1          (see all posts) 2008/04/09 (Wed) @ 13:38

I’m having trouble putting the numbers to words.  What does this actually mean?

If you have extra players/events on the right hand tail… more than you’d expect from a random distribution… does that mean that the individual components of hitting HRs (strength, bat speed, hand/eye coordination, recognizing bad pitches, etc.) aren’t distributed randomly?  In other words, is someone with good bat speed more likely to be also someone who is good at recognizing bad pitches?

Reminds me of an argument I had with a friend one time.  She thought that people who were good looking were more likely to be smart, and vice-versa.  I disagreed.  In other words, if 1 out of 10 people were smart and 1 out of 10 people were good looking, I’d expect 1 out of 100 people to be both smart and good looking.  Likewise, I’d expect bat speed and batting eye to be randomly distributed… but perhaps that’s not the case.  Perhaps both are the product of hard work, and thus someone who works hard is more likely to have the many components of what makes a good HR hitter.  Is this making any sense?


#2    Tangotiger      (see all posts) 2008/04/09 (Wed) @ 14:03

This isn’t what he’s talking about.

He’s starting with a given true talent mean.  That’s his starting point.

His expectation is that this is a center of a random distribution (as explained by the binomial).  So, given that starting point, he expects sample performance to follow that normal distribution shape.

It doesn’t.

However, we shouldn’t expect it to, necessarily, since the starting mean is so low.


#3          (see all posts) 2008/04/09 (Wed) @ 16:50

Gotcha… actually, I had a question that came up when I was reading the lecture notes that someone linked to the other day from that Berkeley stats class, and I think Victor does it as well.

It appears that the way to account for imperfect correlations is to regress the portion of the “r” value below 1.0.  So with .8, Victor regresses 20%.  However, I was always taught that the “r^2” describes the “percentage of variance due to chance”.  If that were the case, wouldn’t that mean that with an r=.8, that .64 of the variance was due to skill, and .36 was due to chance, and therefore you’d want to regress 36% ?


#4          (see all posts) 2008/04/10 (Thu) @ 19:35

Log-normal, maybe?  Many truncated distributions are log-normal (the distribution of income, for example), so maybe it’s work with home run frequency...although home run frequency is truncated at both ends of its distribution (0 to 1).  But 1 is a lot further away from the mean than 0 is, so that probably doesn’t matter.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

Feb 11 19:42
Who is Jeremy Lin?

Feb 11 19:33
Clutch analogy

Feb 11 19:12
Hero of the month: Brittney Baxter

Feb 11 17:59
MGL: Today on Clubhouse Confidential

Feb 11 16:48
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 11 10:29
Dwight Evans

Feb 11 02:12
Performance through the ages

Feb 10 23:01
For Your Soul

Feb 10 18:32
Moneyball at Villanova

Feb 10 17:00
Psst… wanna intern in Canada?