THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, December 23, 2008

Dual Positions, using bUZR

By Tangotiger, 04:31 PM

With so much of this being discussed at Primer and Fangraphs, I figured it was time to update my process to use the latest data from MGL’s bUZR.  So, sit back, and prepared to be numbed by numbers:


As I have done in the past few years, I will compare players’ UZR of those who have played multiple positions.

My datasource this time will be bUZR (UZR using BIS data, as published on Fangraphs).  The data will is from 2002-2008, and is NOT age adjusted.  The “games” I will be citing is the UZR games, which is the number of effective games a player has played, based on his opportunities.  For all intents and purposes, it’s games played.

As always, we start in the outfield.

There are 419 players that have played both LF and RF in that time frame.  This ranges from Geoff Jenkins playing 431 in LF and 333 in RF, to Junior who played 1 in LF and 239 in RF to a whole bunch of players who played 1 game in each position, and everything in-between.  The stats of each player is weighted by the lesser of his two games.

Like I said, I have 419 players, and those players totalled 11,728 matching games.  This is one-third of all games played. This is ALOT of games.  Around 70% of players who played one corner outfield also played the other corner outfield. We have prime evidence that teams treat the corner outfield as interchangeable. 

Their UZR was -0.1 per 162 G in LF and +0.9 in RF.  Remember, this is the same group of players, playing in two different positions.  (You perform better, relative to your competition, if your competition is worse, by definition.) So, in this case, we say that the average LF is 1 run better than the average RF, in 2002-2008 (excluding arm).  If we included UZR arms, I would presume that it’s pretty much a wash.

How did each corner OF do in CF?  We have 280 players who played both CF and LF, with 8391 matching games (roughly 25% of all games played).  This too is ALOT.  Those players were -2.9 runs in CF and +4.8 in LF, for a difference of 8 runs.

Comparing CF to RF: 7741 matching games, -5.1 in CF, +3.4 in RF, for also a difference of 8 runs.

There we have it: a pretty good match.  The average CF, using this sample is 8 runs better with the glove than the corner OF, and the corner OF are equal to each other.

What kind of biases could we have?  Well, not every player plays the dual positions.  Then again, I do have some 70% of the players that did.  Because of the “minimum games of the two positions”, I only get 25% or 30% of the matching games.  Nonetheless, it would be fairly hard to argue that we’ve got too strong a bias here.

The other place is aging.  Since this sample looks at seven years in total, a guy who is a CF in 2002-04 and a corner OF in 2005-08 looks like the same player playing two different positions. In reality, he got worse because he’s older.  And since players will likely move more from CF to the corners than the other way around, then we have a bias here.  Maybe I’ll handle this later. 

I suspect that 8-run gap would turn into a 9 or 10 run gap if we handle the aging bias.  And perhaps if we handle the bias that not all players will play in dual positions (Adam Dunn in CF?), then only players who don’t get exposed too badly get to compete in the dual positions.  So, maybe we have another 1 or 2 run bias here.  This could turn our 8-run observed difference (with bias) to an unbiased estimate of say 12 runs.

***

Let’s do one more that I’ve never done in all the years I’ve done this.  Since we suspect that LF and RF are interchangeable (both in lack of bias, and in talent levels being the same), let’s compare the dual “positions” of CF to LF/RF. 

Of the 380 players who have played CF, 303 of them have played at least one game in the corners.  The total number of games is 12,386, which is over one-third of all games.  These players were +4.4 runs in the corner, and -3.4 in CF, for a difference of 8 runs.

We didn’t learn anything new here, though we did manage to increase our sample size.  Remember this, as I may use this later.

***

Now, how about SS/2B?  While in the outfield, we see that there was alot of movement, and therefore, we are not that concerned about potential talent bias, this concern exists here.  We have 211 players that played both positions (around half the middle infielders), totalling 6243 games, roughly 20% of all games.  So, yeah, we may have an issue, especially since we know that guys who play SS are not the bad 2B (for the most part).

Players at these dual positions were +1.3 at 2B and -2.5 at SS, for a gap of 3.8 runs.  Possible age-bias, and likely talent-bias probably shows this as too small.  A 5 or 6 run difference would probably be more likely in an unbiased estimate.

What about the SS/3B?  We have 170 players (one-third of all players who played SS or 3B), totalling 4518 games (15% of all games).  In this case, they were both zero!  Even if we have the talent-bias (only the really good 3B play SS), why were they a zero at 3B to begin with? 

Finally, 2B/3B: 203 players, totalling 5915 games, -0.8 at 2B, -1.1 at 3B.  Even!  I don’t think we have an age-bias to contend with, but we may have a talent-bias.  Not necessarily in terms of quality of talent, but perhaps in breadth of skills.

There is one more thing to note here, which is probably the overriding factor, and that is the “experience bias”.  A guy who is a full-time 2B, if he plays 3B every now and then, will simply not be as good.  That is, there is a certain amount of benefit in having experience at a position.  If this experience is worth say 5 or 10 runs, and if all of the movement goes from the full-time 2B playing part-time 3B, and if they end up with equal UZR, that doesn’t mean that the baseline is the same.  That’s because we have an experienc-bias to account for.

That said, we know that, while there may be a disproportionate number of players moving between 2B/3B one-way based on experience level, it’s not all of them.

We can try to account for this in a few ways.  One is to separate players into “primarily 2B”, “primarily 3B”, “primarily SS”, and see how they do at 2B and 3B.  If for example we have that the primarily 2B are a +5 at 2B and +5 at 3B, while the primarily 3B are a +5 at 3B and -5 at 2B, perhaps the best interpretation is that the +5 primarily 2B would be +10 at 3B, but because of lack of experience at +5.  And, the primarily 3B of +5 at 3B would be +0 at 2B, but because of lack of experience are -5.

I don’t know yet.  I’ll put that on my check-the-bias list.

As it stands, the SS/3B/2B issue is fairly unresolved.  I suspect we’ll end up at something like SS is 5 runs ahead of 2B/3B as a group, while there may be a gap of 1 or 2 runs gap between 2B/3B.

***

Why don’t we look at SS compared to 2B/3B as a group, similar to what we did with CF compared to LF/RF.  So, we’ll compare a player who played SS and played either (or both) of 2B, 3B.

We have 231 such players, with 8707 games, performing -1.2 at SS and +0.8 otherwise, for a 2 run difference.  As noted, we have the potential talent-, age-, and experience-bias.  I can see that 2 run difference going up to 5 or 6 runs.  I don’t think much higher than that.

***

To complete this installment, let’s take our first infield/outfield look.  How do 2B/3B compare to LF/RF?  Since we think that players in each position group are roughly the same, we get to increase our sample size by combining positions.  We have 177 players involved here, totalling 5875 games.  They were -1.3 in the corner outfield, while these same players were -5.1 in the infield.  The infield is therefore 4 runs ahead of the corners.

In this particular case, the experience-bias will be stronger, since almost all IF/OF movements occur IF to OF.  So, there is a very strong disproportionate movement here.

In addition, we have an age-bias, where likely older infielders are moving out.

We also have the talent-bias makes it so that the lesser infielders are the ones going to the OF.  It’s possible that much better infielders would perform much much better in the outfield.

Finally, we have a handedness-bias: lefty throwing corner outfielders would make for very terrible infielders.  How bad?  We estimate there is a 30-run bias for such players.

All told, we can see how a 4-run gap can balloon to a 10 or even higher (15?) run gap between 2B/3B and LF/RF.

How much higher needs to be studied.

***

So, if we are reasonably sure that we have SS being around 5 runs ahead of 2B/3B, and CF being 10 runs ahead of LF/RF, the question is how to align the infielders and outfielders.  We think that the 2B/3B need to be at least 10 runs ahead of the corner outfielders. In that case, we’d get this:
7.5 SS
2.5 2B, 3B, CF
-7.5 LF, RF

These are consistent with the expectations of the above paragraph.

But, what if the gap between 2B/3B and the corner outfielders is closer to 15 runs?  Then we get this:
10 SS
5 2B, 3B
0 CF
-10 LF, RF

As you can see, the question is how to center 2B/3B/CF.  The other positions will be dependent on how those three are treated.

Until we can get more questions answered, I prefer this chart:
7.5 SS
2.5 2B, 3B, CF
-7.5 LF, RF

I’d rather say that the 2B, 3B, CF are sorta-kinda around the same value, than to create a chart that puts the CF as 5 runs below the other two infielders.  I think it would be more justifiable for me, having enough uncertainty in the process, and enough biases to account for, to keep the three in the same group, than to split them out at this moment.  I’ll find it easier to break them up, if I have to, then to end up in a situation where the CF might end up being 5 runs behind the 3B to 3 runs ahead of him.  By keeping CF=3B for now, I’m hedging my bets.

And seeing how both positions have both around league average hitters since 1980, there’s a certain amount of natural selection that centers me to that position.

#1    Sky      (see all posts) 2008/12/23 (Tue) @ 18:46

Wouldn’t adjusting for the aging bias actually move CF closer to LR/RF in talent?  Start with CF at +4.5 runs and LF/RF at -3.5 runs.  We’re positing that the players who moved between positions mostly started at CF and moved to LF/RF when they got old and their skills got worse.  So if we make them better as LF/RF, say -1.5 runs in their heyday, the gap shrinks to six runs.

Or no?  My head tends to flip-flop when thinking about things like this.


#2    Sky      (see all posts) 2008/12/23 (Tue) @ 19:01

Never mind.

The +4.5 is in LF/RF and gets bumped up to +6.5 when accounting for aging.  Ignore my last post.


#3    MGL      (see all posts) 2008/12/23 (Tue) @ 20:02

Tango, there were a few issues in the bUZR that we are correcting as well as some new things I added.  I have corrected most of them already, but the corrections are not reflected in the numbers of Fangraphs yet.  I’ll send you the new results. One of the new things I did, that is not in the Fangraph numbers, is that I now treat bunts separately.  Before they were treated as any ground ball, although since I use distance as one of the ground ball parameters, all of the bunts tended to be grouped together of course (although they got grouped with slow rollers and even hard hit balls that were caught at a short distance).  I also was previously treating a fielder’s choice where everyone was safe as an out.  I now treat it is an error on the fielder who fields the ball.

Tango, do you think that zeroing everything out for each year for both leagues combined is going to screw anything up?  It should all even out in the long run, but that could be an additional source of noise.  I think that for a study like you are doing, Tango, it might be nice NOT to zero every position out for each year.  What do you think?  I can send you the non-zeroed out data if you want.


#4    Colin Wyers      (see all posts) 2008/12/23 (Tue) @ 20:04

I would really love to see this further broken down, but I understand that with the presentation of UZR that could be difficult. But the compontents I’d like to see is:

* Plays made
* Opps
* Run value


#5    terpsfan101      (see all posts) 2008/12/23 (Tue) @ 21:11

Yes, it would be nice to see the components that Colin suggested, if this is at all possible. According to Appelman on Fangraphs, MGL is also going to incorporate DP’s for infielders and Arm statistics for outfielders in the new UZR ratings. Great news!


#6    MGL      (see all posts) 2008/12/23 (Tue) @ 21:53

#5, yes, that is also the plan.  I am tired of people wondering whether UZR includes arms for OF.  It will be interesting to see the relative differences in average arm value in the OF.  My guess is zero in CF (as the arbitrary baseline), +1 in RF and -1 in LF.  It could also be that CF and LF are the same, (or even LF better than CF) and then RF +1 to +2.  Somewhere in that range though.


#7    terpsfan101      (see all posts) 2008/12/23 (Tue) @ 23:30

Looking at your UZR data from 1999-2002, I took the weighted average of Arm Runs for each of the outfield positions. Of course this was a stupid thing for me to check, since I just read that you zero everything out by position. So you will need to tell us the difference in average arm value for each outfield position. I see your point about making everything sum to zero by position. It is then difficult to make comparisons by position.


#8          (see all posts) 2008/12/23 (Tue) @ 23:34

To remove age bias couldn’t you just look at players who play two positions in one year.  This could reduce your sample size too much.

If you want a bigger sample size then Just do it two years at a time.  Look at 2002 and 2003.  Then look at 2003 and 2004 and so on.

Takes a little more time but in theory shouldn’t be too hard and should take most of the age bias out.

The UZR for RF might be a little less than LF by a few runs because I bet many RF have less range than LF because more emphasis is put on arm strength for that position.  I’ll be interested to see what happens with RF/LF arms are compared.

The IF is going to be much more difficult to decipher I feel.  The OF positions have similar skill sets of running after the ball and throwing the ball to the IF.  2B, 3B, and SS all have different skills based on their position around the diamond which as you mentioned makes experience and a variety of other bias effects much more important.

Any reason (besides the time it takes) that you didn’t look at 1B?  I know the sample size of catchers would be too small but this would be interesting too.


#9    MGL      (see all posts) 2008/12/24 (Wed) @ 03:59

terps, you cannot compare average arms at each position, with or without “zeroing out,” any more than you can compare average UZR’s at different positions.  That is because each OF position throws to each base from a different location in the OF.  An OF arm rating for any player only makes sense in relation to the average arm rating for that position.  In order to compare arm ratings between positions, you have to do what Tango is doing with UZR - look at players who played different positions.


#10    terpsfan101      (see all posts) 2008/12/24 (Wed) @ 05:13

Thanks for the explanation. You can’t directly compare any of the ratings for one position to the ratings of another position. I was assuming that the raw cumulative UZR numbers would vary by position. A pretty stupid thing for me to think.


#11    Tangotiger      (see all posts) 2008/12/24 (Wed) @ 11:19

* Plays made
* Opps
* Run value

If
a) you treat the run value as .75 for middle infielders, .80 for corner infielders, and .85 for outfielders, and
b) since MGL has given us “expected outs”, and
c) presume the average out rate per ball in play is .70
then all three of the above can be estimated.


#12    Tangotiger      (see all posts) 2008/12/24 (Wed) @ 11:28

MGL: I think you need to force the things to add up to zero at the year level.  You have no idea about the quality of data year-to-year, and I’d hate to see the average 3B be a +3 in 2008 while he is -2 in 2007, all because line drives were treated differently.

If the averages were all within +/- 1, then, fine, I wouldn’t bother with the zeroing out.  Something tells me the consistency of the data is nowhere close to that.

(If this was 1945/46, where there was a huge turnover in talent, it would be different.  We’re not in anything close to that.)


#13    Tangotiger      (see all posts) 2008/12/24 (Wed) @ 11:33

MGL: no need to provide me with a separate file, as David’s layout makes it very easy to grab (plus he has extra stuff like PO, A, E, Inn, etc).


#14    Tangotiger      (see all posts) 2008/12/24 (Wed) @ 11:38

Bryan: right, been there, done that.  To me, the sample size is key, and I would really need to have say Craig Biggio in my sample, if he were to go from exclusive 2B to CF to 2B year to year.  So, in-year doesn’t work.  And, if I only look at back-to-back years, then I’m throwing out good data in the surrounding years.

I will be able to handle the age-bias.  I’ve done it in the past.  Just a bit of time needed, ‘is all.

As for 1B, all the biases I’ve mentioned, and more, applies here.  Very little of the skillsets overlap with 1B and the other 6 positions.

As for C, that one is even more problematic.  This is why I’m in favor of having 4 position groups.


#15    MGL      (see all posts) 2008/12/24 (Wed) @ 14:18

Tango, I was also suggesting that I send you the updated files after I made some changes in the methodology, which I can do right away.  I doubt that will change anything overall, but it may.


#16    Bryan      (see all posts) 2008/12/24 (Wed) @ 14:34

Another question I’ve been playing around with is why we compare all positions vs a yearly average for the position.

Maybe in 2008 teams value CF defense more than they did in 2002.  Maybe the effect of steroids made COF slower in 1999 than 2008.  If we want to compare a player year to year is there any way to do this?  Is there actually a difference year to year?


#17    Chris Dial      (see all posts) 2008/12/29 (Mon) @ 02:02

Maybe I missed it, but you say “games”.  How many innings are these being compared?  Are these players playing 300 innings in each or 600 and 50?


#18    Tangotiger      (see all posts) 2008/12/29 (Mon) @ 08:51

For all intents and purposes, a game is 9 innings.

The way MGL calculates a game is basically figures how many outs an average guy would have made given that player’s opportunity, and compares that to the average outs per 9 innings of the league.

So, if an average player given Andruw’s BIP distribution would have made 375 outs, and if the league average for CF is 2.5 outs per 9 innings, then Andruw played an effective number of 150 games.  (All numbers for illustration purposes only.)


#19    Chris Dial      (see all posts) 2009/01/03 (Sat) @ 18:40

Is there some reason you don’t use innings?  I mean, if you play one inning behind a FB pitcher, you can get 2 POs. 

Also, isn’t there significant selection bias?  That is, some players are considered interchangeable, and skew the results.


#20    MGL      (see all posts) 2009/01/03 (Sat) @ 20:29

Chris, what do you mean by those 2 sentences and to whom are you referring ("you don’t use innings")?


#21    Tangotiger      (see all posts) 2009/01/03 (Sat) @ 22:04

Personally, I use “BIP”.

The reason not to use innings is that including strikeouts does nothing for your opportunity. 

***

Yes, there is selection bias, which is why you have to try to account for it (just as you would in a telephone poll).  I haven’t yet, but that’s the next step.


#22    Chris Dial      (see all posts) 2009/01/04 (Sun) @ 20:32

Personally, I use “BIP”.

The reason not to use innings is that including strikeouts does nothing for your opportunity.But the number of innings played, not “zr chances” is a better description of how often a team will put a player in the field.


#23    Tangotiger      (see all posts) 2009/01/04 (Sun) @ 22:25

If Scott Rolen is say +20 runs, and Beltre is +18 runs, then the comparison point, the opportunity factor, can’t also include strikeouts.  You want to know how many “chances” of some sort they had, be it BIP, or Zone plays, or whatnot.

This is just like using PA for a hitter, and not games or innings.


#24    Chris Dial      (see all posts) 2009/01/06 (Tue) @ 00:04

I am not disagreeing with that, Tango.  But when you want to claim “Teams will play players at either position”, it has to be time spent, because they don’t put them out there for a prescribed number of chances.  So, for what you want to measure here, I don’t think chances is the appropriate unit.


#25    Tangotiger      (see all posts) 2009/01/06 (Tue) @ 11:04

Certainly, if you are looking at a selection criteria to determine how often, from the team’s perspective a player is playing, then using innings is correct as you are saying. 

However, in terms of his actual performance, the denominator can’t include strikeouts.  The denominator has to be linked to his actual opportunities.  A 20K perfect game means that there were 7 balls in play (of which maybe zero went to CF).  So, the opp factor has to recognize, in some way, the actual or perceived opps.


#26    Chris Dial      (see all posts) 2009/01/10 (Sat) @ 01:15

It seems to me, that you are taking ops and then saying what you wrote in teh first paragraph.  When your claim is “Teams consider the LF/RF interchangeable”, you aren’t using IP, and I think that’s inaccurate.


#27          (see all posts) 2009/01/10 (Sat) @ 12:08

Yes, you should use innings as a measure of playing time, but it cannot be used as a measure of fielding opportunities. This goes back to the original problems with Range Factor 25 years ago, when all we had was games or innings.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Aug 31 15:28
Fans Scouting Report: Update

Sep 02 15:21
The two uncertainties of UZR

Sep 02 15:17
Mail: rWAR v fWAR

Sep 02 14:59
Roger Federer

Sep 02 14:59
It’s hard to beat the crowd (Vegas in this case) no matter how smart you think you are

Sep 02 14:57
Could Rob Dibble have been a comp for Strasburg?

Sep 02 14:15
WOWY Teachers

Sep 02 13:37
Who’s Waldo?

Sep 02 08:36
Team Elin

Sep 02 01:19
Can someone tell me why Trevor Hoffman is still allowed to pitch?