THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, September 22, 2010

Park component numbers by adjusting for identity of hitters and pitchers

By Tangotiger, 10:10 AM

Just last week, I was explaining how to do park adjustments, you should use the component lines of the players, and not presume that each player plays an equal number of games home and away.

Analysts privately calculate their own more specific park factors.  As I said, among other things, I look at who actually played in the parks, not presuming that CC pitched exactly one time in the 13 other AL parks and 13 times at Yankee Stadium.

Well, it looks like Colin is going to apply those kinds of park adjustments:

So for every player, in every park they played in, I put together a set of component batting lines--I call them batting lines, but I did them for pitchers as well (call that batting line against, if you prefer--pitchers as hitters were excluded). For a player’s home park, I adjusted the batting line based upon observed home field advantage over a five year period.

These were combined into what I called “road weight” batting lines for each park in which a player played. If the park under consideration was not a player’s home park, the road weight batting line was simply their batting line for that season. If it was their home park, their games played there were given 1/15th the weight and averaged with their road stats. All of these component batting lines were then regressed to the mean – “noisier” components were regressed more than more stable components.

Then, for each batter-pitcher matchup in that park, I calculated an expected set of probabilities using the odds ratio method. (When the home team batted, the component home field advantage was added to the component batting lines of both hitter and pitcher.)

I don’t agree with the need to do the 1/15th.  I understand why he would do that (it makes the thing stabilize faster).  But at the cost of losing nearly half your sample.

He also says:

What these park factors are good at is telling us, with a certain amount of uncertainty, how a player might have played in a different context. Which is what we’re interested in with park factors, isn’t it?

Well, not exactly.

To MGL, that is yes, exactly.  To people trying to assign past value, then it’s no, exactly. 
- MGL wants to know what Juan Pierre will hit in a neutral park, or his expected 162 park-games. 
- Most people want to know how Pierre’s past offense impacts his past team’s chance of winning.  Put someone who can’t take much advantage of Coors, while Dante Bichette takes huge advantage of Coors, and Bichette’s value starts to soar, relative to what it would have been otherwise.  LHH gets killed at 3Com, while Barry Bonds is unaffected: this gave Bonds much greater value.

To each his own.


#1    JEH      (see all posts) 2010/09/22 (Wed) @ 12:08

TT-

I don’t understand your last multi-sentence paragraph.  Any chance of rewording it?


#2    Tangotiger      (see all posts) 2010/09/22 (Wed) @ 12:30

Suppose that you have a park that was death to LHH but a boon to RHH.  And you are a LHH.  How much value would you have in that park?  Well, almost none.

It doesn’t matter if you are as good a hitter as the RHH in the other 29 parks.  Because you played half your games in a home park that killed you, you did nothing to help your team win half the time.


#3    JEH      (see all posts) 2010/09/22 (Wed) @ 14:28

@2/TT

Thanks. I understand clearly what you wrote in #2 and I agree and would be surprised if anyone didn’t as I don’t think that it is too controversial.


#4    Tangotiger      (see all posts) 2010/09/22 (Wed) @ 14:55

But MGL would disagree with you.  I use MGL because he has long professed to not care about awards and past value, etc, and is only interested in the “true talent” level of players.

To that end, to MGL, and those who think like him, they are interested in the impact a player’s skillset has in a neutral park (i.e., 1/30th of each park), or, if need be, on a particular team’s parks (1/2 home, and 1/2 spread out based on the schedule).

I lean heavy toward MGL’s viewpoint.  I find the “past tense” viewpoint fun for its own sake, because we are presented with challenges (like how to distribute noise).

If I had to only choose one for whatever reason, the MGL viewpoint would win, because that is about the present and the future.


#5    JEH      (see all posts) 2010/09/22 (Wed) @ 15:56

TT- I thought I was agreeing with you. smile

I know there were one or two recent threads that involved Park Factors in which MGL was critical of my posts, but I still suspect that a communication gap is the most likely source of disagreement.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 13:18
Do pitcher’s reach back for velocity when needed?

May 25 13:04
“Why Kickstarter works”

May 25 12:51
Chad Curtis

May 25 12:40
Largest demonstration in Canadian history?

May 25 11:32
Howard Stern

May 25 11:26
Lack of hustle during a game

May 25 11:22
What sabermetrics is NOT

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves