THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Tuesday, November 22, 2011

Selective end points AND data mining AND publishing bias rearing their ugly heads again…

By , 11:29 PM

Let’s see how many posts it takes for the geniuses on BBTF to figure this one out.  So far 9 and counting…

Anyway, here is the link:

http://www.thegoodphight.com/2011/11/21/2485197/phillies-citizens-bank-park-not-a-hitters-haven

to an article which tells us that CBP has played almost neutral for the last 4 years, therefore it is now a neutral park, as opposed to the first 4 years when it was an extreme hitters’ park (around 1.07).

Let’s forget for a second how a park can all of a sudden change its true PF’s (it can’t other than by changing other PF’s in the league and even then it won’t change much - of course the “effective” PF can change - a little - with weather and with different players).

Instead, let’s do this thought exercise:

You have 30 parks with a true PF of x, y, x, etc.  I am telling you that they never change (which is actually reasonably true, as I indicated above, barring a remodel of course).  We track the observed (sample) PF’s for 8 years.  What are the chances that in the last say, 3, 4 or 5 years (you get to choose the end points) some park will show an observed PF that is quite different than its true PF AND/OR quite different than the observed PF in its first 3, 4, or 5 years?

IOW, what can we conclude about the true PF of CBP?  Not much other than its true PF is likely the un-weighted average of the observed PF over the last 8 years, regressed toward some mean (of a similar park, dimension, weather, altitude-wise, etc.).  If you want to weight more recent years slightly more than past years, I don’t have much of a problem with that, although I don’t think that any weighting is necessarily appropriate…

(17) Comments • 2011/11/28 • SabermetricsSamplingStatistical_Theory
Page 1 of 1 pages

Latest...

COMMENTS

May 26 07:27
“Why Kickstarter works”

May 26 03:03
Pete Palmer’s new book: Basic Ball

May 26 01:11
Largest demonstration in Canadian history?

May 25 19:41
What sabermetrics is NOT

May 25 16:59
Howard Stern

May 25 15:12
Do pitcher’s reach back for velocity when needed?

May 25 12:51
Chad Curtis

May 25 11:26
Lack of hustle during a game

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion

THREADS

November 22, 2011
Selective end points AND data mining AND publishing bias rearing their ugly heads again…