THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Monday, October 17, 2011

Visualizing wOBA

By Tangotiger, 11:50 PM

This is how I like to visualize the various components of wOBA.  Here’s what to look for:

1. The green area under the 1.000 line is equal to the green area above the 1.000 line.  This implies that the average value of the events equals 1.  This forces wOBA = OBP.

2. The gap in the red area gives you the value of each event above the 0.333 baseline (or whatever the league average OBP is).  This gap is exactly proportional to the Linear Weights run values.

Because of these two points, you get the wOBA values, of roughly 0.7 for a walk, 0.9 for a single, 1.3 for a double/triple, 2.0 for a HR.  That’s all there is to it.

image

There are two great implications of using wOBA:
A. Even though wOBA IS Linear Weights, I have never ever ever had to explain what a “negative run value” is.  It just doesn’t exist in wOBA.  It’s hidden by having the baseline lowered as I have.

B. Because it is coupled with OBP (the same mean, and the same denominator of plate appearances), and OBP is perfectly suited for the binomial distribution, wOBA takes on similar characteristics.  Not exactly, but close enough for our purposes.  Whereas the binomial would say p*(1-p), in this case, we’d use p*(1.1-p).

Hence, my love for wOBA.


#1          (see all posts) 2011/10/18 (Tue) @ 09:12

Beautiful, very nice explanation.


#2    Tangotiger      (see all posts) 2011/10/18 (Tue) @ 10:21

Some people have suggested that I would have been better off scaling it to batting average.

There are good reasons I did not do so.

For starters, scaling to OBP(*) guarantees the “1.000” average is met perfectly.  If I used batting average, then I won’t get to 1.000.  Indeed, the line it would match is lgBA/lgOBP, which basically means it would be right around 0.800.

(*) By OBP I mean, my version of OBP, that includes in the numerator and denominator whatever I use in wOBA, but unweighted.

Secondly, the denominator is all wrong.  The denominator in BA is at bats, while wOBA is plate appearances.

Granted, people may be more used to the batting average scale than the OBP scale.  As for what was needed in The Book, that was irrelevant.  OBP and something akin to that was needed to get to statistical significance discussion.  I had to be able to easily go between OBP (a true binomial with the proper denominator) and something else.  Hence wOBA.

I didn’t envision wOBA then being used on a grander scale beyond The Book.  And in any case, it doesn’t matter, because I can’t stand batting average as some central measure of a player’s abilities.  It’s a peripheral metric that should stay there.


#3          (see all posts) 2011/10/18 (Tue) @ 12:51

That really is a beatiful explanation.  Where does the p * (1.1 - p) come from?  Multinomial distribution math suggests to me that the variance for wOBA should be smaller than for an equivalent binomial.


#4    Tangotiger      (see all posts) 2011/10/26 (Wed) @ 10:35

The calculation for the 1.1 is in the appendix of The Book.  It’s basically the weight of each component squared, multiplied by the frequency of each component.


#5    studes      (see all posts) 2011/10/26 (Wed) @ 11:13

Tango, this is a very neat visualization.  thanks.

As we’ve discussed before, I see people often misinterpreting wOBA weights to say things like “a home run is worth about twice as much as a single” and things like that.

I wish there were some way to squash that.  Oh well.


#6    Tangotiger      (see all posts) 2011/10/26 (Wed) @ 11:19

Right.  The correct thing to say is:

wOBA says that you can trade 1 HR (and 1.2 outs) for 2.2 singles. 

The denominators have to match.  In either case, you end up with around +1.05 runs.


#7          (see all posts) 2011/10/26 (Wed) @ 14:29

Thanks, Tango.  When I actually calculated it, instead of speculating, I got the 1.1.


#8    Colin Wyers      (see all posts) 2011/10/26 (Wed) @ 15:35

When I put together the current LWTS-based TAv, I did similar testing to what’s in the appendix of The Book, and you can do p * (1 - p) for TAv if you want to figure random variance for it. (It’s not EXACTLY that, but I’m betting 1 is as close for TAv as 1.1 is for wOBA - if you really need the extra precision for either there are better methods, as Tango mentions.)


#9    Tangotiger      (see all posts) 2011/10/26 (Wed) @ 16:54

Colin: when I did some quick testing for TAv, I had to do p * (.87-p).  In order to do p * (1-p), I had to set the league mean of p to .300.

That was just quick testing though…


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 15:37
What sabermetrics is NOT

May 25 15:28
Largest demonstration in Canadian history?

May 25 15:12
Do pitcher’s reach back for velocity when needed?

May 25 15:02
Pete Palmer’s new book: Basic Ball

May 25 13:04
“Why Kickstarter works”

May 25 12:51
Chad Curtis

May 25 11:32
Howard Stern

May 25 11:26
Lack of hustle during a game

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion