THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, August 19, 2009

It’s not the data, it’s the sport

By Tangotiger, 02:52 PM

David Berri tries to make his case:

The past six years of Huff’s career demonstrates a great deal of inconsistency.  So which Huff did the Tigers add? Are they getting the player ranked in the top 10% in 2008? Or is it the player ranked in the bottom 10% in 2009? It seems likely that even Huff isn’t sure.  Huff’s job is to hit a round ball with a round stick, and that’s simply not an activity that can be predicted easily. In the Wages of Wins we noted that the stories of Ariza and Huff are not unique.  The numbers attached to players in basketball are simply more consistent than the numbers we see in baseball. And this means that decision-making should be easier in basketball.
...
If only people in baseball had the data we see in basketball. 

I agree with everything, except for the last line.  The reason that you get more consistency is simply because of the nature of the sport.  Suppose, for example, that a tennis match lasted only one set.  That is, a set is a match.  Would Federer win 88% (or whatever it is) of his matches?  No, of course not.  If he’s winning 88% of his matches because he’s winning 65% (or whatever it is) of his sets, then having a one-set match means he’d only win 65% of the time.  Similarly, if you had 7-game or 9-game matches (spread say over two days) then he’d win 95% or 99% of his matches.  He’d look unbeatable (except for when he plays Nadal).

Basketball is like that.  48-minutes is simply way too long a game compared to the 9-innings in baseball.  A 9-inning game in baseball is like say a 20-minute game in basketball.  If that’s all you had with basketball, then you’d probably have a similar uncertainty as with baseball.  Baseball and hockey are comparable in terms of how much randomness affects the performance of teams (and presumably players).

Indeed, you can manipulate how often the better team will win based on how long or short you make the confrontations.  I suspect the main reason that you don’t see 5-set matches for women (I don’t buy the endurance reason) is that if you did it, you’d rarely have any upsets.  As it is, there is probably less turnover among top women’s tennis players than top men’s tennis players.


#1    Ryan J. Parker      (see all posts) 2009/08/19 (Wed) @ 15:46

You make a very good point, but there is plenty of uncertainty abound in basketball statistics. When you live in Berri’s world of wins produced, though, you tend to overlook that sort of thing.

I’m no expert in baseball, but my sense is that the pitcher/batter matchup is easier to predict than opposing lineups in basketball.


#2    Jim A      (see all posts) 2009/08/19 (Wed) @ 18:09

I agree with the general point, but is there any objective way to determine that a 48 minute basketball game is “longer” than a 9 inning baseball game?  That is, other than measuring the inconsistency of statistics, which, as commenters to Berri’s blog suggest, may be partly caused by other factors.


#3          (see all posts) 2009/08/19 (Wed) @ 18:38

Holy cow, thank you. It drives me batty on that site when he constantly refers to basketball players as “more consistent” than football players… as though there’s some difference in the type of person who plays each sport… when instead it’s simply the nature of the sport and the sample size of the events that are being recorded and measured.  I comment this point to him every time he does it, but get no response and he continues to write it.


#4    Colin Wyers      (see all posts) 2009/08/19 (Wed) @ 19:04

I think MGL is fond of pointing this out - the more accurate a measurement is, the more variance will be observed (all else being equal). We see a smaller spread in observed talent in Range Factor than we do in UZR, not because RF is better than UZR but because it isn’t.

Berri seems to be confusing issues with his measurement system with the underlying data. It’s obvious that the play-by-play data we get from Retrosheet is vastly preferable to the box score data he deals with in basketball.

(It’s possible Berri acquires this attitude from defending his Wins Produced system; I’m no expert on basketball but everything I’ve learned about regression models in baseball tells me that they’re tools of last resort.)

And Jim, sure there is. Look at FGA versus PA - that’s our fundamental unit of observation in each sport.


#5    Tangotiger      (see all posts) 2009/08/19 (Wed) @ 22:01

Jim, yes I had a long thread on this a few years ago.  Do a search for
True Talent Sports Leagues
and you’ll find it.

IIRC, a 32-game season for basketball tells you as much as an 82-game season for hockey, 162 for baseball, and 28 for football.  Something like that.

Pretty fascinating stuff, and I’m surprised it doesn’t get more play.

***

I don’t follow Berri too much.  I get the impression he loves to tout his stuff.  I prefer that an honest pro/con be presented on anything created.  We can discuss all the shortcomings of UZR or wOBA or WOWY or FIP or Marcel here, and I’d he happy to do that.  It doesn’t seem that that’s the case in too many other places.

The creators of their metrics must realize how foolish they look when they present a one-sided view.  The discerning reader doesn’t put up with b.s., and it’s the discerning reader that should be their lifeblood.  They’re the only ones keeping them honest.  Anyway, I don’t want to sound all high and mighty about it.  A little honesty is all I expect from the other metric-creators.


#6    JB H      (see all posts) 2009/08/19 (Wed) @ 22:49

This looks like the thread:

http://www.insidethebook.com/ee/index.php/site/comments/true_talent_levels_for_sports_leagues/#comments

Too many equations for me right now, but it looks interesting.  I’ll look through it tomorrow smile


#7          (see all posts) 2009/08/19 (Wed) @ 23:39

Tango, #5:

My impression of Berri matches yours.  I used to read him more regularly, until I became a little more familiar with other basketball analysts (Pomeroy, Hollinger, etc), though I’m only about 5% as knowledgeable about advanced basketball analysis as I am about baseball and football analysis.

I can’t recall an instance of Berri ever being surprised at the results of his metrics.  Or acknowledging that it may underrate or overrate certain players or types of players.  It’s always that he’s right and that anyone whose analysis disagrees, whether it’s a fan, fellow analyst, or GM, is incorrect.  It’s caused me to only check his blog every few weeks, as opposed to a more daily basis.


#8          (see all posts) 2009/08/20 (Thu) @ 11:08

Off topic, but you just fixed NBA basketball.  Only play halves.  If a team is ahead after two quarters, they win.  You can even have doubleheaders, play one two quarter game, then take a break, then a second game.

My own idea was to determine who won by quarters, if a team won three quarters they won the game, with games lasting three, four, or five quarters depending on what was necessary.  But this idea is simpler, and better.


#9    Tangotiger      (see all posts) 2009/08/20 (Thu) @ 11:43

Ed, what a fantastic idea you have of my idea!

Every game is a double-header!  I love it!  LOVE it.

Understanding that there’s end-of-game strategies that come into play, can someone figure out the win% presuming the game restarted at each half?


#10    Tangotiger      (see all posts) 2009/08/20 (Thu) @ 11:46

You can even stretch it out to think of it like tennis.  Each half is a “set”, and first one to win two sets wins the match.

The third set is 5-minute OT.

So, if each team wins one set (they each win a half), then they play OT to see who wins the match.


#11    Aaron Delisio      (see all posts) 2009/08/20 (Thu) @ 15:19

Those are great ideas, especially the two-set variation. You could also make each quarter a set with the first team to 3 winning. I wonder if you would need to make other adjustments like the number timeouts and fouls.

One of the things that drives me nuts about basketball (and there are many) is that there is no point in watching a game until the final 2 or 5 minutes. There is just so much scoring and the action so repetitive that it becomes mind numbing. Anything that adds value to the earlier part of the game, especially the first half, would make it dramatically more appealing to me.


#12    Tangotiger      (see all posts) 2009/08/20 (Thu) @ 16:05

There would be some interesting strategy in play.  If you are losing the first quarter say by 9 points with 2 minutes to go, the losing team may sit all their starters and play their bench.  The leading team will be deathly afraid of blowing it, so they will keep their starters on the court.

Now when the second quarter starts, the losing team will be fresh.  A best 3 out of 5 periods, with the 5th period being half as long (say 6 minutes) would be pretty exciting.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Aug 31 15:28
Fans Scouting Report: Update

Sep 02 14:59
Roger Federer

Sep 02 14:59
It’s hard to beat the crowd (Vegas in this case) no matter how smart you think you are

Sep 02 14:57
Could Rob Dibble have been a comp for Strasburg?

Sep 02 14:49
Mail: rWAR v fWAR

Sep 02 14:15
WOWY Teachers

Sep 02 13:37
Who’s Waldo?

Sep 02 08:36
Team Elin

Sep 02 01:19
Can someone tell me why Trevor Hoffman is still allowed to pitch?

Sep 01 23:16
Strasburg II