THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Monday, February 01, 2010

Bill James turns Win Shares into Wins above replacement (WAR)

By Tangotiger, 12:15 PM

He doesn’t come right out and say it, but his explicit rankings shows this implicit acceptance.  Follow me for the proof.

The first thing to note is that he ranks 15 semi-random firstbasemen in… some order.  What order is that?  He didn’t say.  This is the Win Shares and Loss Shares of the 15 guys, in his order:
BJ WS LS
1 319 152
2 289 120
3 269 99
4 257 90
5 265 118
6 291 199
7 181 106
8 218 202
9 143 67
10 150 93
11 146 118
12 162 166
13 144 147
14 109 116
15 79 61

As you can see, it’s not in Win Shares order.  How about if I rank them in Win Shares minus Loss Shares order?
BJ WS-LS
1 167
2 169
3 170
4 167
5 147
6 92
7 75
8 16
9 76
10 57
11 28
12 -4
13 -3
14 -7
15 18

Nope, that’s not it either, though it’s getting closer.  Then I thought “Oh no he di’int.” Yes, he did.  If you do Wins above .333, here’s the list you get:

BJ wsWAR
1 54
2 51
3 49
4 47
5 46
6 43
7 28
8 26
9 24
10 23
11 19
12 18
13 16
14 11
15 11

Fref McGriff, his number one guy in this sample, has 319 win shares and 152 loss shares.  You can just do 2*WS - LS, or WS - LS/2, or WS*.667 - LS*.333, or whatever, as long as you weight the wins twice as much as the losses, and you get the same rankings.

To convert Win Shares to Wins Above Replacement, you do:
WS/3 - .333 * (WS+LS)/3

That “/3” is to convert win shares into wins.  And the “.333” is the win percentage replacement level.  So, the final simple equation is (2*WS-LS)/9.  For McGriff, that’s 319*2 - 152 or 486.  And divide by 9 to get to 54.  This is how you convert Win Shares and Loss Shares into Wins Above Replacement (WAR).  And Bill James has ranked his firstbasemen by (his version) of WAR.

Here is the comparison of Bill James’s WAR and Rally’s WAR, ranked in Bill James’ order:

rWAR bjWAR
51 54 McGriff Fred
61 51 Hernandez Keith
53 49 Cash Norm
52 47 Giambi Jason
44 46 Delgado Carlos
36 43 Garvey Steve
26 28 Vaughn Mo
23 26 May Lee
27 24 Teixeira Mark
22 23 Parker Wes
14 19 Sexson Richie
14 18 Power Vic
9 16 Pepitone Joe
6 11 Stuart Dick
6 11 Aikens Willie

Every player is within 5 wins, except for:
- Keith Hernandez, 10 more wins for Rally than Bill James
- Steve Garvey, 7 more wins for Bill James than Rally
- Joe Pepitone, 7 more wins for Bill than Rally

That’s it. 

It seems to me therefore that Bill has accepted that some sort of wins above baseline is needed.  And he is using a level that pretty much match what we’re using for wins above replacement, since the results are consistent.  I wouldn’t worry about the implication of the “.333 win percentage”, since that number is specific for the way his system is constructed.

This really brings Bill James back to where he always was 25 years ago, when he first wrote about replacment level in the context of Rice v Guidry and Clemens v Mattingly.  And when he ranked players as “chance of being better than a .400 player”.  Bill took a detour along the way in terms of ranking players by Win Shares (without considering Loss Shares).  Him bring Loss Shares into the mix really resets his position back along the Wins above replacement track that the rest of us are on.

Welcome back Bill.

Now, let me ask: if Bill is going to rank players by 2*WS-LS anyway, then why not show that single number as well?  Why show the WS and LS numbers separately (which is fine), but not show the actual single number he’s ranking the players by?  And, to give that number meaning, why not just divide it by 9, so you get the wins above replacement scale that the rest of us use? 


#1    BenJ      (see all posts) 2010/02/02 (Tue) @ 14:11

Tom, it looks like he’s using .350.  From Part 1 of Bill’s article, the Aikens comments:

With a career won-lost record of 79-61, including fielding, I estimate that he was 29.90 Win Shares better, in his career, than a .350 player…

There’s a similar quote in front of many of the win shares tables.  Using .350 gives nearly the same numbers Bill cites (like the 29.90 above), within rounding error.  Using the 79-61 WS-LS that Bill gets, I get 30.00 when using .350 but 32.38 when using .333 (too high).  He’s got to be using .350 and unrounded versions of 79-61.


#2    BenJ      (see all posts) 2010/02/02 (Tue) @ 14:17

Using .350 instead gives us:

bjWAR rWAR Diff Player
51 51 0 Fred McGriff
49 61 -12 Keith Hernandez
47 53 -6 Norm Cash
45 52 -7 Jason Giambi
44 44 0 Carlos Delgado
40 36 4 Steve Garvey
27 26 1 Mo Vaughn
24 23 1 Lee May
23 27 -4 Mark Teixeira
22 22 0 Wes Parker
18 14 4 Richie Sexson
16 14 2 Vic Power
14 9 5 Joe Pepitone
10 6 4 Dick Stuart
10 6 4 Willie Aikens


#3    Tangotiger      (see all posts) 2010/02/02 (Tue) @ 14:39

Ok, I can buy he’s using .350, not .333. 

The main point is that Bill is ranking based on some form of wins above replacement.

And, wins above replacement is nothing more than Linear Weights at its core.


#4          (see all posts) 2010/02/02 (Tue) @ 15:13

So many functions have linear approximations!

“Linear Weights at its core” means to me a weighted sum of a player’s own counting statistics “at its core”. The win shares core evidently includes team proportions such as the share of ground balls fielded by shortstops.

(I don’t know loss shares or associated revisions to win shares. Does it appear that James has completed version one?)


#5    Tangotiger      (see all posts) 2010/02/02 (Tue) @ 15:32

Actually, when I say “Linear Weights at its core”, I mean the run values with respect to average.  All the steps, except for the last step in WAR is all “relative to average”.  Everything is done in a relative to average sense.  That’s what I mean about “at its core”.

Until the last step, what you have is TWO dimensions:
- wins relative to average
- playing time (PA or IP)

WAR brings those two into a single dimension

Win Shares goes about it differently… only to end up in the same spot.  And, it is FAR easier to show a player’s performance line relative to average (so you know exactly, for example, that Jeter is great in reaching base on error) than to hide so much under the covers with Win Shares.  Especially when both systems emerge from the water, they both give you WAR.

My WAR framework is built on a docked boat, while Win Shares is in a submarine in the ocean.  And the readers are at the dock.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 03:39
Lack of hustle during a game

May 25 02:54
Largest demonstration in Canadian history?

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves

May 25 00:36
Help needed with sticky issue…

May 24 23:50
Rooting for laundry

May 24 17:04
Firefox, IE, or Chrome?

May 24 12:07
How to beat the shift

May 24 11:11
Incredible story

May 24 09:41
Racial bias in card collecting: not the collectors, but the players on the cards