Sunday, April 22, 2007
Peak Offensive Age
Here is a blog entry about peak offensive age. In it, the author references a study by Jim Albert which looks at the issue using some advanced and complex (to me at least) statistical techniques. Here is a post I wrote on BTF which also references the above-mentioned blog entry. My post sums up my thoughts on the matter. I was wondering what the many bright minds on this blog think of this issue. As I say below, I think the answer (what is peak offensive age) really illudes us. And it makes a difference when doing projections. Age ajdustments are important.
The study is interesting. I’m not sure I had read it. Maybe I had. Whether James said so or not, I think he was referring to offense only. Even if he meant offense and defense, at the time he had no good way to measure defense and I don’t think he knew that defense peaks so early for most positions (probably all save first base).
There are other (easier) ways to estimate average peak age and to be honest I don’t fully understand Albert’s although he seems to know what he is doing. The traditional sabermetric way is to compute a weighed average of all players’ difference in offensive rate between each pair of adjacent years and to see when that number starts to turn negative. IOW, we look at all player’s difference between age 22 and age 21. That should be positive on the average. Same for 23 and 22. Etc. If we do that, we generally find that from 27 to 28, the difference (28 minus 27) is slightly minues and is then minues from then on, suggesting that 27 is the peak age.
The problem with that methodology is that if the league gets better every year, which we think it does, then that introduces a negative value into every one of those differences which has to be subtracted. For example, if the league gets better by .001 per PA per year and the difference from age 27 to age 28 is -.0005, after adjusting for quality of league, we get +.0005, suggesting that players are still getting better.
The other problem with the “delta” method above is that there is a selective sampling problem that exists in every age pair which also tends to make us understate peak age. Any time a player has a very unlucky (bad) year, he tends to get fewer PA’s or no PA’s the next year. So what does this mean? It means that the first year in ANY pair of years tends to be ppulated by players who were slightly lucky, again, tending to make the difference between year X+1 and year X more negative than it should be if we simply let every player play X number of PA from age 20 to age 40 (or whatever).
I don’t know if Albert’s method had a similar bias (the selective sampling) and I don’t know how an increase in league strength every year affects Albert’s model either.
One thing about Albert’s model that I do know is that since he limits his data to players who have accumulated at least 5000 PA, we may have a sample of players who have gotten a little lucky at some point in their careers (I am not sure if this would be early or late) and we definitely have a sample of players who probably had a somewhat later than average peak (otherwise they would not have played that long). Whether this somewhat later than average peak is inherent to their true aging curve or whether it was by luck also, I don’t know.
As a few posters above have alluded to, of course, the whole thing depends on what sample of players we want to determine true peak age for. Is it players who play a long time. These are necessarily good players. Do good players have naturally late or early peaks? Are their true peaks different from other players? If different kinds of players have different natural peaks, and we want to answer the question, “What is the peak age for the average player?” do we want to weight our findings by playing time? By the number of players in each group? Do we want to know the average peak age of players who actually end upo having some kind of career? Or do we want to include the peak age of players who fizzle out and don’t end up having a career (maybe they had natural early peak ages)? I don’t know the answers to these questions but they are important ones, and difficult to answer.
I honestly think the jury is still out as to the natural peak age of MLB players, however we want to define that group of players. I think it is probably earlier than Albert finds and later than I and others (e.g. Tango) have found in the past.


In my business, we call this sort of problem an attrition bias. Bad players don’t ever get a chance to show what they can do at 27, because they get sent to AAA/Japan/the real world. I wonder if the trajectory of the curve is related to overall ability. (If we were modeling it like a projectile trajectory, the quadratic term… x^2… has a smaller number in front of it.)
I’m thinking out loud.