THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, March 20, 2009

How to split

By Tangotiger, 09:59 AM

Apparently, this isn’t clear as to how not to apportion credit:

Ok, now let me tell you why this splitting-issue is not the right way to proceed.  And I’m going to use hockey as an example.  In hockey, everyone on the ice for the scoring team gets a “plus 1”, and the players on the ice for the opposing team gets a “minus 1”.  This means that every goal has five pluses and five minuses.  If we follow the logic in this paper, this would entail taking this one goal, and somehow splitting it up among the players on the ice.  Perhaps giving the five guys on the scoring team a total of +.5 goals, and the give guys on the opposing team a total of -.5.  And then, among the scoring team, deciding who gets the share of the +.5, like maybe +.25 for the goal scorer, +.10 for the playmaker, and +.05 for the other guys on the ice.

Assume you have a player, which I’ll call Obby Borr.  He’s a +120 for the Bruins, and when he’s not on the ice, his teammates are +0.  Also assume that Borr plays with everyone on the team.  If you proceed with a “splitting” arrangement, Borr will end up being credited for something like +60 goals, if he’s lucky.  Likely, under the splitting-system, he’ll be even lower.  However, in my system, he gets +120.

You see, Borr plus his teammates is +120.  All of his teammates are zero.  Therefore, Borr plus zero is +120, making Borr equal to +120.

You have to treat each player as if he’s his own universe, and you adjust for the extra parameters.  The same logic applies to strength of schedule, or how to credit the DP between the 2B and SS, and several other concepts. 

Splitting doesn’t work.

Let’s think about it in baseball terms then.  Let’s say that you have Pedro v Pujols, and Pujols gets on base 4 times out of 10.  So, that’s +0.6 above average.  Would you split that, and give +0.3 to Pujols and +0.3 to Pedro.  (Plus is good for the hitter and bad for the pitcher)?  Let’s say we have Pedro v Chipper and Chipper gets on base 0 times out of 20.  So, that’s -6.8 relative to average.  Would you split that as -3.4 for Pedro and -3.4 for Chipper?

Suppose you say that “yes!  that IS what I want to do”.  Well, then you’ll end up with let’s say Pedro allowing 250 baserunners to 1000 batters faced.  That total is -90 relative to average.  In the 50/50 split, that means that if you add up each of the individual matchups (Pujols and Chipper and everyone else), then you end up with Pedro only being -45.  Does that make sense?  Well, no.  His competition was already an average batter.  He faced, as a group, hitters who get an OBP of .340.  There’s no reason to try to do any kind of splitting the difference here.

Strat-O-Matic, for example, realizes this.  (In Strat, they give 50% of the outcome to the batter’s card, and 50% to the pitcher’s card.) So, they give Pedro a rate of 80 baserunners per 500 batters faced, because they know in the other 500 batters faced that he has “no influence” on, he’s 170 / 500 (i.e., .340 OBP).  The total is 250 baserunners per 1000 batters faced.  If you focus on his 80/500, you realize that this is -90 relative to average.

This is why you can’t just “split the difference” IF AND ONLY IF the universe of your competition is indeed average to begin with.  This is the idea behind WOWY (With Or Without You), and why it works well with pitchers/catchers at a career level, or pitchers/SS at a career level or in hockey or basketball on a seasonal level: those players face a diverse enough context that they can be said to be the sole driver to their performance (with some minor adjustments).

It fails in the case of say Mark Howe and Brad McCrimmon when they were +85 players that year, because they were defense partners on practically every shift.  It fails for Dionne/Taylor/Simmer because they were out there as one unit for the Kings.  It fails for Trammell/Whitaker, because they are always together.

This is why you have to be very careful.  In some cases, you can do the 50/50, and in most cases, you should not.

Is this any clearer?


#1    Brian Cartwright      (see all posts) 2009/03/20 (Fri) @ 10:17

Works for me, but then I understood it the first time too.


#2          (see all posts) 2009/03/20 (Fri) @ 12:49

"(In Strat, they give 50% of the outcome to the batter’s card, and 50% to the pitcher’s card.)”

A little trivial hair-splitting:  In Strat, 28% of the pitcher’ card is his team’s defense, so they give 50% of the outcome to the batter, 36% to the pitcher, and 14% to the defense.


#3          (see all posts) 2009/04/24 (Fri) @ 01:20

I’m not clear on the Strat example, but everything else makes sense. 

It seems like you say that splitting does not work for pitcher/hitter matchups, but it works when you’re playing Strat, and that it works when you’re looking at two players who always play together.

I don’t play Strat, but it might help to know why or how that kind of splitting works.

And I think this whole discussion might be clearer if you started with a macro point you make which is that we don’t split when making adjustments on whole seasons worth of baseball data. 

or, and this one I’m not so sure about,

you could like not splitting to regression analysis.  You take the one variable that changes (the pitcher in this example), and see what happens when it’s paired with a bunch of random hitters.  The difference in your independent variable is the difference that pitcher made on that batch of hitters.  That’s another sort of end result way of convincing yourself that splitting is unnecessary.


#4          (see all posts) 2009/04/24 (Fri) @ 01:27

Wait, one more thing I don’t understand.  You shouldn’t split credit between a SS/2b for DPs (even assuming you think they play an equal role in all DPs)?

I’d have thought that you would have to split, like you would in the Trammel/Whitaker case, because they always play together.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 08:11
What sabermetrics is NOT

May 25 06:43
Largest demonstration in Canadian history?

May 25 06:39
Lack of hustle during a game

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves

May 24 23:50
Rooting for laundry

May 24 17:04
Firefox, IE, or Chrome?

May 24 12:07
How to beat the shift

May 24 11:11
Incredible story

May 24 09:41
Racial bias in card collecting: not the collectors, but the players on the cards