Thursday, September 16, 2010
Wars of WARs
I’m going to link to Neyer’s piece mostly because I love his headline. Rob also says:
But it’s probably not going to happen soon, and I guarantee none of those guys will listen to me. They’re subject to the same prejudices as the rest of us, prone to the same egos and interests. But they’re also more amenable to reason than most of us.
I just want to say that this has nothing, nothing at all, to do with ego. But Rob is right that this is all about reason.
The WAR framework has developed over time. It started at least as far back as Pete Palmer’s Hidden Game, where he started off with WAA (wins above average). The basics were laid out: offense above average, defense above average, some sort of fielding position adjustment, some sort of relief adjustment. After that came Bill James, who had offensive and defensive wins and losses, which he then compared against “chance that these number of W/L were put up by a .400 player”. And then Bill James again in 1987 comparing Clemens/Mattingly and Rice/Guidry, where the first fresh talk of replacement level came in (specifically noting that the replacement level for pitcher, in that article, was one run above league average). After that, there was (I’ve been told) BBBA, and also Woolner’s VORP followed by Clay’s WARP, and Bill James with Win Shares. Those guys brought it up a notch.
When I became involved was at the old Fanhome boards (RIP) where we had many many many discussions of replacement level. Then, about three years ago, I had what I wanted: the positional adjustment and the relief adjustment. The positional adjustment was, I think, the last big piece in order to stabilize the WAR framework. It was one of those things that was bothering me for a long time, and with some help from the readers of this blog, I was able to crystallize that. The relief adjustment was in due in large part to Guy’s contribution, though minor in the grand scheme of things, was major as it related to the very small subset of pitchers out there.
And that’s where we are: the WAR framework. The WAR framework is about offense compared to average, fielding compared to positional average, a positional adjustment without relying on the offense for that position, a playing time value (replacement level); on the pitching side, different baselines for starters and relievers (a concept first introduced by Woolner); a league adjustment. Fangraphs and Baseball-Reference agree on this. And, I suspect that Baseball Prospectus is trending toward this framework. If we don’t have consensus, we are going to get it. I’ve been dealing with the WAR framework and chatting with the readers of this blog for so long that I think that any of the major issues have been hashed out.
So, what’s the issue then? Well, now that we have a framework, everyone wants their own implementation of that framework. Not all houses are built the same, are they? But they all have the same foundation, the same basic structure. There are two major differences in the implementation (the houses) built by Fangraphs and Baseball-Reference over the WAR framework (blueprint) that has been developed and championed by me and other readers of this blog:
1. UZR v Total Zone: the fielding systems are different; they are different because one guy has access to more data than the other.
2. DIPS v Runs Allowed: one takes a belief system that they are only going to rely on non-BIP for pitching, while the other takes the belief system that all runs are attributed to the pitcher, regardless of sequencing of events, with some generic team-based fielding adjustment
There are other minor issues, like how park factors are used, what kind of relief adjustment is made, what kind of AL v NL adjustment is made, how low or high to set the replacement level, among a few others.
But, they’ve both agreed on the WAR blueprint. Now, the discussions is on the peripherals, about whether to use a two by eight or a two by ten, about whether to use solar panels, about whether to hardwire the smoke detectors or use wireless.
In no way will further discussions lead to invalidating one implementation or the other. In no way should anyone choose a position based on ego. The implementation I favor is whichever one I can backup with evidence, one that I can verify and stand behind. I stand on the side of truth, or at least one of reason. I have no agenda other than to expose holes and make sure there are no cracks in the foundation. (e.g., If someone is going to use Runs Created instead of BaseRuns or Linear Weights, that’s a crack, and I will expose it.)
Until then, presume that all sides have something to add, and just take the midpoint of them all.


"1. UZR v Total Zone: the fielding systems are different; they are different because one guy has access to more data than the other.”
We could have access to the exact same data and they would be just as different. Compare UZR to Pinto and Dewan.
We aren’t going to get a unified WAR unless we can prove the superiority of one defensive metric over the others. The only thing I have to go on is using Dewan’s led to better team defensive projections for the 2009 season. Hardly conclusive. Even if we are able to crown a king of the defensive systems, some of the sites showing WAR have access to that stat and some do not. I don’t think not having the stat will make any of us decide to shut down.