THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, August 29, 2008

RARP v VORP

By Tangotiger, 03:32 PM

BP has two measures that do similar things.  One is called VORP, by Keith Woolner, and another is called RARP, by Clay Davenport. They both exist because BP was allowing its analysts to develop their own metrics, which is a good thing.  According to BP’s glossary, VORP is “The number of runs contributed beyond what a replacement-level player at the same position would contribute if given the same percentage of team plate appearances” and RARP is “A statistic that compares a hitter’s Equivalent Run total to that of a replacement-level player who makes the same number of outs and plays the same position”.

RARP data. VORP data

First, a big thanks to BP for making the data so accessible.  And, they are pretty good on name matches, making my life alot easier.  There were only some 40 or so names out of 912 that I had to match manually. 

The total VORP, as of today, is 4436 runs.  And for RARP, it is 4735 runs.  This is based on around 154650 PA, which means that per 700 PA, we have this:

VORP: +20.0 runs
RARP: +21.4 runs

As you can see, both have a very similar replacement baseline.  Indeed, this is a very common baseline.  MGL uses +19.4 runs, and I use (roughly) +19.8 runs.

The question is if either measure shows much difference.  This is the 10 guys that VORP likes more than RARP:


diff RARP VORP Name
11.2 42.7 53.9 AUBREY HUFF
10.3 62.7 73.0 LANCE BERKMAN
10.0 51.6 61.6 MATT HOLLIDAY
9.0 39.3 48.3 KEVIN YOUKILIS
8.4 36.0 44.4 DUSTIN PEDROIA
8.4 20.9 29.3 DERREK LEE
7.3 48.8 56.1 ALEX RODRIGUEZ
7.0 18.2 25.2 CRISTIAN GUZMAN
6.7 48.5 55.2 MARK TEIXEIRA
6.7 11.2 17.9 CARLOS ZAMBRANO

And these are the guys that RARP likes more than VORP.
diff RARP VORP Name
-9.0 37.3 28.3 BRIAN GILES
-8.9 25.3 16.4 JACK CUST
-8.0 -3.6 -11.6 BRAD WILKERSON
-7.5 -0.8 -8.3 JACK HANNAHAN
-7.2 3.1 -4.1 JOHN BUCK
-7.1 13.5 6.4 MARK ELLIS
-6.9 -2.0 -8.9 MARK TEAHEN
-6.4 2.2 -4.2 JEFF MATHIS
-6.4 47.4 41.0 JOE MAUER
-5.9 6.7 0.8 JASON VARITEK

Alex Rodriguez is either +1 run ahead of Mauer according to RARP or he is +16 runs ahead of him according to VORP.

Here are the positional totals:
Pos RARP VORP n
1B 660.3 700 53
LF 655.7 625.1 75
CF 585.8 571.6 51
3B 583.4 527.8 55
RF 581.6 483.7 50
2B 548.9 550.6 60
SS 476 477.7 66
C 459 302.1 88
Ot 171.9 189.3 88
P 12.3 8.0 326

We see an enormous gap among catchers and RF.  RARP likes catchers by an extra 157 runs, and RF by an almost extra 100 runs.

You will also notice that Carlos Zambrano (the hitter) is on the list.  Strangely, all the pitchers as a group are close to zero.  The range of VORP goes from +18 Zambrano and +10 Backe to -6 Lohse and Jimenez.  For RARP, it’s +7 Zambrano and +4 Owings to -2 Cueto and Lohse.  Indeed, when you look at standard deviations, VORP is 50% wider than RARP among pitchers-as-hitters.

I have no editorial comment.

#1    Colin Wyers      (see all posts) 2008/08/29 (Fri) @ 16:55

VORP uses a different replacement level for catchers than other positions - I think it’s 85%. It doesn’t seem like RARP does. My gut feeling is that RARP is correct on this.


#2          (see all posts) 2008/08/29 (Fri) @ 17:34

Based solely on your lists above, it appears that RARP may like walks more than VORP, with Giles, Cust & Wilkerson topping the delta list.


#3          (see all posts) 2008/08/29 (Fri) @ 17:49

I notice that 3 A’s are on the second list and, in fact, every Oakland regular is rated higher by RARP than by VORP. The difference is an average of 1.1 runs/100 PA. That has to be something systemic like they’re using very different park factors.


#4    tangotiger      (see all posts) 2008/08/29 (Fri) @ 19:28

I have no doubt that there are many nuances, some major, like the low walks for VORP, and some minor, like a park factor for a few parks.  Add in a differing replacement level by position, and, boom, it’s hard to tell why one is doing what it is doing, without alot of work.


#5    Colin Wyers      (see all posts) 2008/08/29 (Fri) @ 21:46

That’s why I did a reimplementation of MLV and BRAA to do my comparison - that way I avoided the question of whether the differences I was seeing were due to different datasets, park factors, replacement levels, and just focused on the run estimation.

Also, what are you doing about players who played multiple positions? I believe VORP counts each player’s production by position seperately, while RARP rolls them all up into one value.


#6    tangotiger      (see all posts) 2008/08/30 (Sat) @ 01:10

First off, I added up exactly the RARP and VORP numbers as BP has.

I assigned the primary position as Clay had to each player.


#7    Hugh      (see all posts) 2008/08/30 (Sat) @ 04:51

From a quick look at the 20 players with differences, I’d guess VORP to be the more realistic of the two metrics.


#8          (see all posts) 2008/08/30 (Sat) @ 12:54

VORP also uses 75% of positional average for 1B (and DH), which is lower than the 80% of most other positions.  That seems dumb and overrates 1B, and means I disagree with Hugh.


#9    Colin Wyers      (see all posts) 2008/08/30 (Sat) @ 13:05

Carlos Zambrano is an interesting test case, since he doesn’t walk at all.

Hugh, what makes you think VORP is “more realistic”? Because I don’t see it.


#10    tangotiger      (see all posts) 2008/08/30 (Sat) @ 13:36

The gap between Zambrano and Lohse is 24 runs in VORP and 9 runs in RARP.  Those are the leader and trailer by both measures.

There should not be any kind of difference like that, especially since the overall pitchers as a group are both close to zero.

B-R.com, using Palmer’s LWTS, has a 17 run gap between the two.  If you adjust for the slight PA difference, you’ll get close to 20 runs, which is “close enough” to VORP and “far away” from RARP to give you pause on RARP for pitchers-as-hitters.

I do know that I have a big issue with Equivalent Runs at the low-end, right around where the pitchers hitting levels is, actually.  This may be a manifestation of that.


#11    Colin Wyers      (see all posts) 2008/08/30 (Sat) @ 14:13

I presume this is what you’re talking about:

http://www.insidethebook.com/ee/index.php/site/comments/why_is_eqa_so_complicated/

If so, that’s a problem with EqA, not EqR. I don’t know if Clay uses EqA to figure out RARP or not. (You could do it either way, I think.)

But all linear run estimators should have some issues at the extremes, not just EqR.


#12    tangotiger      (see all posts) 2008/08/30 (Sat) @ 22:05

It may be an EqR issue:

“4. EQR = (2*RAW/LgRAW - 1)*(LgR/LgPA)*PA “

If RAW is half the lgRAW, EqR = 0.  It might even get in the negative.

I’ve never looked into pitchers-as-hitters, but I would bet that there is an issue there, be it EqR or EqA.

At the very least, we can see that the sympton is there: having Zambrano just be 9 runs better than Lohse as a hitter is inconsistent with the Woolner and Palmer approach.  Seeing that I have had misgivings with EqA at the bottom-end, it’s almost certainly an EqR issue.


#13          (see all posts) 2008/08/31 (Sun) @ 18:07

My comment at #8 is wrong.  VORP uses 85% of average for 1B/DH.  And #1 is wrong—it’s 75% for catchers.

Based on this, I’m not sure why DHs tend to appear overrated by VORP (see Aubrey Huff this year) or why the total catcher value always seems really low.


#14          (see all posts) 2008/08/31 (Sun) @ 18:10

Got it.  DHs are overrated by VORP because they are compared to other DHs—which tends to be a rather uninspiring position—instead of being compared to a level of performance at least equal in quality to 1Bs.  This is a good launching point for explaining why positional adjustments based on fielding or player pool are better.

I feel like I’m in a two year time warp thinking this much about VORP.


#15    Colin Wyers      (see all posts) 2008/11/15 (Sat) @ 13:51

Using the appearances table in the BDB and some linear weights, I managed to cobble together values that look a lot like VORP, but without some of the extra baggage:

http://www.editgrid.com/user/cwyers/Linear_Weights_VORP

I’m not exactly proud of the park factors - it’s just the Batting Park Factors from the teams table with a little regression thrown in. I am somewhat more pleased with how I figured replacement level - it’s close enough to how BP does it in aggregate, but on a position-by-position basis it’s based on Tango’s adjustments.


#16          (see all posts) 2008/11/15 (Sat) @ 14:09

I just posted projections for all of the current minor league rfee agents, including the hightest level they played at this past season.

I hope this may be helpful in empirically deriving values for replacement level.

http://statspeak.net/2008/11/freely-available-talent.html


#17    MGL      (see all posts) 2008/11/15 (Sat) @ 18:27

In the second sheet of your file, those numbers (like .121) are runs per PA?  If yes, your replacement level is almost exactly the same as mine, which is around -18 runs (below average) per 630 PA. How do you get these replacement numbers from your minor league FA?


#18    Colin Wyers      (see all posts) 2008/11/15 (Sat) @ 18:42

Unless I’m missing something, I think MGL is reading what Brian wrote and looking at my spreadsheet. I didn’t use minor league free agents at all.

Those are R/PA. I set replacement level at -20 runs per 700 PA, and then adjusted by position from there, using the figures Tango came up with from your UZR data.

That said, a quick look at Brian’s spreadsheet gives me -19 BRAA for the players listed per 700 PA. (Just did a weighted average of wOBA, using AB as the weight - not perfect but workable. .306 wOBA as a group.) So pretty close.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Aug 31 15:28
Fans Scouting Report: Update

Sep 02 14:49
Mail: rWAR v fWAR

Sep 02 14:15
WOWY Teachers

Sep 02 13:37
Who’s Waldo?

Sep 02 13:00
It’s hard to beat the crowd (Vegas in this case) no matter how smart you think you are

Sep 02 12:05
Could Rob Dibble have been a comp for Strasburg?

Sep 02 08:36
Team Elin

Sep 02 01:19
Can someone tell me why Trevor Hoffman is still allowed to pitch?

Sep 01 23:16
Strasburg II

Sep 01 22:11
PITCHf/x Summit 2010 - Recaps