THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Monday, November 06, 2006

Fielding Bible Awards

By Tangotiger, 04:47 PM

In the 2007 edition of the Bill James Handbook, there is a section called The Fielding Bible Awards, with a panel of 10 experts, including The Fans’ Scouting Report.  These awards are for fielding excellence, and the winners were recently announced.

Interesting to me is how did the Fans’ do.  That is…


Panelists against Peers
...how well did they hold up to the rest of the panel?  A correlation of the individual votes to the overall average of the experts is listed below:

0.73 John Dewan, BIS owner
0.73 BIS Video Scouts

0.66 Bill James
0.65 Rob Neyer
0.64 Mat Olkin
0.64 Nate Birtwell, BIS ops
0.63 Joe Posnanski

0.59 Fans’ Scouting Report
0.58 Hal Richman (Strat-O-Matic)

0.53 Mike Murphy, Chicago radio

Panelists against BIS
When the voting was conducted, a ballot containing BIS data was also distributed.  This was done as a way to provide data, but it’s not something I would do, if I’m also part of the panel.  I ran a correlation between BIS and the other 9 panelists, with the following results:

0.56 Mat Olkin
0.56 Nate Birtwell, BIS ops

0.41 Hal Richman (Strat-O-Matic)
0.40 Bill James
0.39 Joe Posnanski
0.38 Fans’ Scouting Report
0.35 Rob Neyer
0.33 Mike Murphy, Chicago radio
0.33 John Dewan, BIS owner

Since I know for a fact that the Fans’ results made zero use of the BIS data, it seems that correlations in the .30-.45 range were not influenced by BIS data.  I also know that in the past, the correlation between Fans’ and UZR was around .35.  It seems therefore fair to say that the correlation between any two systems that use completely independent sources of data will get you a correlation of .35.  The correlation between Olkin and Birtwell was 0.45.

Panelists against Fans
Next I looked to see how each of the individual panelists did against the Fans’.  If Roger Ebert best represents the average filmgoer, then who should the Fan listen to, to get like-minded analysis?  Who is the Fans’ Roger Ebert?

0.56 Hal Richman (Strat-O-Matic)

0.44 Nate Birtwell, BIS ops
0.38 BIS Video Scouts
0.36 John Dewan, BIS owner

0.28 Mike Murphy, Chicago radio
0.26 Mat Olkin

0.17 Bill James
0.17 Joe Posnanski
0.11 Rob Neyer

Hal Richman has his fingers on the pulse of the Fans, which may explain the success of Strat-O-Matic.  The Fans clearly agree the least with the James, Posnanski, and Neyer, all once, or currently, KC-based.

It’s interesting how much wider the gap is between the Fans and its peers, compared to BIS and its peers.  This may be some evidence that BIS data influenced the voters somewhat.  Or, that BIS is grounded in reality more than the Fans are.

Crazy Picks
Were there any really crazy picks?  Did any of the panelists select someone as one of the three best fielders, and yet, those fielders were not one of the TEN best, according to the panelists as a whole?  For example, the Fans chose Pujols, Teixeira, and Minky as the three best fielding 1B in the league, which matches the panelists overall.  So, great job there on the Fans.  How about for the other positions?  From this standpoint, Richman had 2 crazy picks, the Fans had one crazy pick (Mark Kotsay), as did James and Murphy also have one crazy picks, and everyone else had none.  So, it seems that there’s a strong consensus that if someone thinks of a fielder as being “great”, he’s at least “above average”.

Blind Voters
Finally, how about looking at the consensus top 3 picks, but that a panelist didn’t even list in his top 10?  Mike Murphy had 6 such players, Richman 4, and Fans’ 2.  Murphy doesn’t think that any of these players are among the 10 best at their position, while the consensus of the group has them in the top 3: Minky, Reed Johnson, Dave Roberts, Corey Patterson, Alexis Rios, JD Drew.  Richman did not think much of Corey Patterson, Alexis Rios, Chase Utley and Miguel Olivo.  Fans’ disregarded Andruw Jones and JD Drew.  For the Fans’, CF was the toughest position to nail down, as there was a huge group of players who were within a hair of each other.  Jones and Torii Hunter simply were a shade below.  JD Drew was considered above average, just not top 3.

Youngsters Fans Love
Betancourt (SS), Zimmerman (3B), Langerhans (LF), Matthews (CF), Markakis (RF) also received very high scores from the Fans, and ranked well by the panelists, just nowhere near that high.  It’ll be interesting to see if the panelists come around on these players, or if Fans are simply dazzled by style not substance.  That is, who is right?

How did the Gold Gloves compare?
Gold Glovers finished 1st and 2nd at 1B.  At 2B, the GG finished 1, 7.  At 3B, 2, 4.  At SS, 2 and 18 (! Jeter).  At C, 1, 5.  The 5 GG in CF finished 1,2,7,8,12 and the 1 GG in RF finished 1st in RF.  The two GG pitchers finished 1st and 2nd with the Fielding Bible.

Overall, Jeter was the only purely horrible pick.  Gonzalez of Boston, or Betancourt of Seattle would have been the appropriate choice.  All other choices were defensible, though not necessarily great.  That is, they deserve our respect.  Except for the Jeter pick.

All-in-all, a very interesting exercise, and of historical value.

#1    tangotiger      (see all posts) 2006/11/06 (Mon) @ 18:20

I added a blurb about the Gold Glove at the end.


#2    David Smyth      (see all posts) 2006/11/06 (Mon) @ 20:44

Well, instead of the misleading title “Fielding Bible Awards”, when it’s only a poll, I would have preferred the actual results of the Dewan plus/minus system.


#3    David Cameron      (see all posts) 2006/11/06 (Mon) @ 20:56

Gary Matthews is a “youngster”? He’s 32.


#4    tangotiger      (see all posts) 2006/11/06 (Mon) @ 23:26

Ouch, I thoughts he’s only been in the league 3 or 4 years.  And, being the Jr of Gary Matthews, who I grew up with, I was sure he was in the Time Raines Jr age class.  I didn’t realize he is an old journeyman.

As for the results, he’s got the top 10 for 2006, and 2004-2006 in the BJ Handbook, and I think studes posted somewhere that he was going to have the complete results?


#5    tangotiger      (see all posts) 2006/11/06 (Mon) @ 23:30

I meant “grew up watching”.


#6    MGL      (see all posts) 2006/11/07 (Tue) @ 17:29

When you say they were “given the BIS data,” what do you mean?  I find that odd.  I also agree that the title was misleading and I certainly expected the leaders in the plus/minus.


#7    tangotiger      (see all posts) 2006/11/07 (Tue) @ 17:49

We were given the MLB2006 version of Dewan’s Fielding Bible book (updated through mid-Sep), which included the top 10 for 2006 and for 2004-2006.


#8    MGL      (see all posts) 2006/11/07 (Tue) @ 21:24

Well that is really odd.  They told who who was the best according to his plus/minus system and then they asked you to vote on whom you “thought” was the best?  What kind of vote is that?  What was your vote supposed to be based on?  The data from the book? Your observations?  I don’t understnad the point at all.  Were you instructed to consider the plus/minus data?  Ignore it?


#9    tangotiger      (see all posts) 2006/11/07 (Tue) @ 23:24

The intent was to be helpful and provide data, and all were instructed to use the data or not, as they saw fit. 

In one sense, it’s a good thing, since Dewan provided something akin to this:
http://sportsillustrated.cnn.com/baseball/mlb/stats/2006/fielding/ml_6_byZONE_RATING.html
but better.  And for multiple years.  The booklet was something like 70+ pages, so it’s not like it was a cheat sheet.  It was comprehensive data.

As a scientist, I don’t like it, because it introduces a bias that could have been avoided.

But practically, Dewan could have just presented the link above instead, to the same effect.  So, in the end, Dewan distributing the booklet probably had more pluses than minuses.


#10    MGL      (see all posts) 2006/11/07 (Tue) @ 23:47

The point I am trying to make is that who the heck knows how much stock each person put in the data.  Some people probably didn’t even read it.  Some people probably don’t understand a PBP defensive metric.  Some people watch a lot of games.  Some people probably don’t.

I am mystified as to the point of such a vote.  If he didn’t give you any data then he could say that this is a vote based upon the opinions and observations of some so-called experts (although, what good is being an “expert” if you don’t watch or at least read about all the players in the league - and who can have an opinion on every player that means anything at all when it is impossible to closely watch more than a couple of players anyway?).

By giving you the data I have no idea what the vote is supposed to represent.  If it were me, and I watch a ton of games, I would simply rate the players according to their plus/minus ratings.  I have no confidence in my ability to figure out someone’s true defensive worth by watching them play, unless perhaps I were focusing on one or two players and taking notes throughout the season.

I completely diagree that this vote has any significance whatsoever.  I think it is a joke.


#11    tangotiger      (see all posts) 2006/11/08 (Wed) @ 00:05

I’m surprised by your conclusions, since it’s not supported by the data.

For example, I ran a correlation one panelist to another (45 such correlations), and the average was 0.34, which is pretty much what we’d expect any two intelligent people/systems to correlate, if they were independent.  The average correlation to the BIS video scouts and to Dewan was 0.41, which is a bit higher than we’d like to see.  The problem is we can’t tell if it’s because BIS and Dewan “know” more, or if the data presented influenced the vote.  However, the effect of this would be rather small. 

I mean, Dewan and the BIS video scouts themselves only correlated at 0.33, which is what we’d expect if they were independent.

One could create their own overall score based only on those panelists that didn’t correlate high with Dewan and the BIS video scouts.

An interesting case is Nate Birtwell, who works in the BIS operations.  He correlated at .56 with the BIS video scouts, but only .28 with Dewan.  Neyer correlated .35 with the BIS video scouts, but .63 with Dewan.

If you wanted the most “independent” group, you’d probably want Bill James, Dewan, BIS Video scouts, Richman, Murphy, and the Fans.  My guess is that the overall results would probably be similar to the 10 panelists results.

In fact, let me do that now.  The top 2 1B stick, but reversed.  Top 2 2B stick.  Top 2 3B stick, but reversed.  Top 7 SS stick.  Top 10 LF stick.  Top 2 CF stick.  Top 3 RF stick.  Top 2 C stick.

So, I think mgl’s complaint is more of a theoretical problem, and in all practical purposes, the voting itself is legitimate.


#12    tangotiger      (see all posts) 2006/11/08 (Wed) @ 00:12

I see that Dewan has forwarded a link to this blog entry to the panelists, so perhaps one of them will post, and describe their process.


#13    MGL      (see all posts) 2006/11/08 (Wed) @ 03:51

I am really trying (and failing) to get a handle on what this “poll” is attempting to represent.  “Experts” at what?  As I said in my last post, how would these “experts” know who was the best defensive player at each position without looking at the data, regurgitating (and processing I suppose) scouting and other information they may have heard or read, or some such thing?  Surely they have not watched in agonizing detail every player on every team, or even close to that.  As I also said, I watch as many minutes of baseball as anyone alive (who has somewhat of a life) and there are scores of players whom I barely even “know” let alone how good they are on defense.  So what “expertise” is this poll supposed to represent?  And why should we care (and believe) who James, Neyer, Tango, et al. thinks is the best defensive players at each position?

At least with the gold gloves, the coaches and managers presumably see these players a lot more frequently and have a lot more practical “expertise” then these so-called experts.

Again, I fail to see any significance to this poll.

The significance of the fan ratings is that there is tremendous observational (and other) wisdom among these fans who admittedly watch almost every game a player that they submit plays (I think).  We can combine this with UZR or some other PBP metric, especially for players with limited playing time, to get a pretty good idea of a player’s true defensive worth.  With these guys, the panel of experts, it is a completely different story.  I would not trust their opinions any more than I would trust the opinions of any 10 (or however many there were on the panel) fans, and even less since it is preposterous to think that a bunch of guys with real jobs and real other interests and families could possibly watch enough games to get even an iota of a handle on how good EVERY player in baseball is.

Even if this panel gets a lot of the votes “right” it is because they constantly read and hear about who the best defensive players are and because they have access to the PBP defensive data and metrics.

Tango, just give me some reasons for why you think this poll is so significant or interesting without all the correlation numbers (which mean almost nothing to be, BTW, and I suspect the standard error is so high that the differences you cite are meaningless as well, but I could be wrong about that) and maybe I can comment about that.  Right now, we are talking around each other.


#14    tangotiger      (see all posts) 2006/11/08 (Wed) @ 09:02

I can’t vouch for all the panelists, or even any of them.  The BIS video scouts, as I understand, do in fact watch every single play (multiple times) for every game.  That *is* their job!  So, they are probably outside your argument.  If you remember in last year’s Fielding Bible, they pulled out the top 30 and bottom 30 plays for Everett and Jeter.  Presumably, they did a UZR-style analysis on a PBP, and simply logged for each play, where these guys were +.50 or better on a particular play, or -.90 or worse on a particular play (or something like that).  I’m sure I’d be impressed with their logging system.

Dewan has something UZR-like, and James probably has his own number-crunching system.  Richman is Strat’s founder, and, again, it’s his job to watch every day.

I can’t speak for the others. 

And, like I said, even if you take just these 5 or so, you still end up with around the same answer.  Given a choice between these 10, and the 50 coaches in each league, these 10 are probably better.  Certainly, no Jeter-like pick was made.


#15    John Beamer      (see all posts) 2006/11/08 (Wed) @ 09:59

I mean, Dewan and the BIS video scouts themselves only correlated at 0.33, which is what we’d expect if they were independent.

How do you know that the 0.33 correlation is what we’d expect if they were independent?


#16    tangotiger      (see all posts) 2006/11/08 (Wed) @ 11:31

Re-read my blog entry, where I discuss that .35 is the correlation we expect of two independent systems.


#17    MGL      (see all posts) 2006/11/08 (Wed) @ 16:44

I have no doubt that any 10 sabermetric-oriented “experts” most of whom watch a lot of games, some of whom watch ALL the games, or even 10 “super-fans” will do A LOT “better” than 30 managers or coaches.  So what?

I still don’t get the significance of this particular poll or why it is called the “Fielding Bible Awards.”


#18    MGL      (see all posts) 2006/11/09 (Thu) @ 02:11

After reading Neyer’s Insider column today, I finally have a feel for what these awards are supposed to mean.  Maybe I was just being incredibly thick and obstinate for a couple of days.

Anyway, they were supposed to mimic (I guess) the GG awards except they were voted on by 10 persons who analyze baseball for a living, most with a decidedly sabermetric slant.  As well, all of the voters were provided with some objective relevant data to inform them if they so desire.

O.K., I got it.  I’m still not sure I care much about the results or about the correlations that Tango came up with.  And I think I would have preferred to see the results sans the data being available and then the results of the data alongside the data-free vote, or something like that.  But I am not so sure about that either.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Nov 20 01:43
Sabermetric Moves of the 2009 Pre-Season

Nov 20 14:20
Marcel 2009 is here

Nov 20 14:19
Nate Silver: hero to interviewers

Nov 20 13:42
Top Free Agent Pitchers

Nov 20 12:29
R.I.P. Tom Boswell, sabermetrician; P.A.L.L.(*) Tom Boswell, human being

Nov 20 12:27
David G. checks in again on whether experience matters in the post-season

Nov 20 10:42
Offense by position groups by decade

Nov 20 02:01
My 1B is better than your 1B

Nov 20 00:26
MLB logo

Nov 19 23:03
NBA’s Marcel