Thursday, July 27, 2006
Colorado Park Factor
Here is Dan Fox looking at park factors, for the minor league Colorado team.
The park configuration also plays a role. I once did a(n unpublished) study on the park configuration, and…
found a relationship between distance and runs. I can’t remember what it was.. something like every 10 feet added, increased run production by 1%, or .01 runs per game. I can’t remember which, so don’t quote me.
The key was to figure out the amount of surface area. Using the 5 distance markers (LF, alley, CF, alley, RF), I squared the values, averaged them, and took the square root. That gave me a smoothed out radius for the field. Pretty basic, right? Since the area of the field is pi * r-squared / 4 (if it was a slice of a perfect circle), I was on my way.
Of course, not all fields are even close to being that nicely shaped. Since we do have access to graphical representation of all the parks, we don’t even need to figure out a mathematical representation of them: just count the pixels on the screen. Until then…
I did some similar work with this here:
http://stats.mostvaluablenetwork.com/general/expected-park-factors/
I have since been working on something that does a much better job of predicting park factors. The problem is that there are variables we simply don’t understand that have a pretty large effect on how a park plays. Also, some data is tough to find. Your equation correlates pretty well (r = .69) with actual fair territory, though it consistently underpredicts (I had to add 4,868 square feet so that they would match up) the actual amount. I may use it to fill in my missing data points (nine of thirty). Still, I have my doubts that we can ever predict how a park will actually impact play until major league teams actually play games in it.