THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, December 11, 2007

Marcel, 2008

By Tangotiger, 11:18 AM

Marcels.


#1    Anthony      (see all posts) 2007/12/11 (Tue) @ 13:46

Thanks for these.

I’d like to calculate linear weights. What did you use as the 2008 league batting line? Can I just use Chris Basak’s projection to figure out the league outs multiplier?


#2    Tangotiger      (see all posts) 2007/12/11 (Tue) @ 14:07

I don’t remember.  It may have been 2007, or a weighted of the last 3 years.

I’d suggest either:
a) take everyone with a reliability figure of .05 and below, and figure out the average

b) sort all players by PA, and take the top n number of players, such that the sum of their PA equals around 180,000 (total PA of nonpitchers in a typical year), and figure out the average

Note that I am showing the wOBA in the last column, which, with the league average wOBA you are seeking (get from either a or b above) and the PA will give you LWTS.

***

Doing what I said in step b (396 players), I get a wOBA of .338.  So Pujols, with a .421 wOBA and 603 PA gives us:
runs = (.421-.338)/1.15*603= +44 runs

Step a gives us an average of .334, which makes Pujols +46 runs.


#3    Tangotiger      (see all posts) 2007/12/11 (Tue) @ 15:52

When I repeated with the pitchers, I get Runs/G of 4.80.  That should be the environment.


#4    studes      (see all posts) 2007/12/11 (Tue) @ 17:31

Thanks, Tango.  Mind if we post them on THT again?


#5    Tangotiger      (see all posts) 2007/12/11 (Tue) @ 17:38

Sure, go ahead!


#6    Tangotiger      (see all posts) 2008/01/08 (Tue) @ 11:53

Hardball Times has posted the Marcels on its site:
http://www.hardballtimes.com/main/blog_article/2008/01/05/


#7    Tangotiger      (see all posts) 2008/01/14 (Mon) @ 15:01

Fangraphs has the forecasts of Bill James, Chone, and Marcel:
http://www.fangraphs.com/statss.aspx?playerid=1070&position=OF

Obviously, none of us have incorporated known league suspensions.


#8    gr      (see all posts) 2008/02/14 (Thu) @ 17:17

I’m curious as to how the ages are calculated.  The Marcel google doc shows Ryan Howard as age 29.  His birthdate is 11/19/79 - so wouldn’t he turn 29 after the 2008 season is over?

Thanks in advance.


#9    Tangotiger      (see all posts) 2008/02/14 (Thu) @ 17:32

Ah, my chance to showcase the wiki, so I don’t have to repeat myself:
http://www.tangotiger.net/wiki/index.php?title=Seasonal_Age

Cool, thanks for the opportunity.


#10    gr      (see all posts) 2008/02/14 (Thu) @ 18:47

Thanks!

Question #2 - has anything changed in the calculation since the 2004 introduction?

I ran the exact Beltran example and got the same numbers but when I tried it on 2008, it did not exactly equal for everyone.


#11    Tangotiger      (see all posts) 2008/02/14 (Thu) @ 19:10

No change.


#12    rg      (see all posts) 2008/02/14 (Thu) @ 19:38

OK - one more question:

What does step #6 “Rebaseline the results against an assumed league average of 2003.” mean?

If you don’t mind, would you be able to run through the 2008 calc for David Wright?


#13    Tangotiger      (see all posts) 2008/02/14 (Thu) @ 21:20

I think that’s the step where I have to convert the performance line that was based on 2001-2003 data into a 2003 context.  That is, I assume that the run environment context in 2004 will equal to 2003.  I could have assumed that 2004 = average of 2001 to 2003 at the league level I suppose.


#14    gr      (see all posts) 2008/02/15 (Fri) @ 00:09

Are you saying this?

04yrAdj = [r/g03] / [(3*r/g01+4*r/g02+5*r/g03)/12]

Do you know what your factor ended up being for 2008?

Thanks


#15    birdo      (see all posts) 2008/02/21 (Thu) @ 16:40

For the PA regression you use 5/4/3 and 1200.

For the pitching IP you use 3/2/1 but what league amount do you use?

Thanks.


#16    Tangotiger      (see all posts) 2008/02/21 (Thu) @ 17:12

Hmmm… good question.  I wrote this 4 years ago:
http://www.tangotiger.net/archives/stud0346.shtml#1028

But I didn’t mention what it was.  Weird.  Anyway, it’s 800/3 IP.

So, a guy with 200 IP in 2007, 150 in 2006, and 100 in 2005 would be regressed as follows:

totalIP = 200*3 + 150*2 + 100*1 = 1000
leagueIP = 800/3 = 266.67
regression toward league mean = 266.67 / (1000+266.67) = 21%
r = 1 - .21 = .79

Basically, the 1200 PA for a batter, if he had a .333 OBP, would be on base 400 times, and out 800 times.  So, that’s where the 800 comes from.

That is, the 266.67 IP for a pitcher roughly corresponds to 1200 PA for a batter.

So, the batter’s performance (the 5/4/3 = 12 weights) is more indicative than the pitcher’s performance (3/2/1 = 6 weights). 

A pitcher would have to face twice as many batters as a hitter would to get the same regression amount.

A pitcher with say 266.67 IP each year for 3 years would have an r=.857.

A hitter to have that level of r would need 600 PA each year for 3 years.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Jan 08 04:25
Sabermetric Moves of the 2009 Pre-Season

Jan 09 02:33
Cheers

Jan 08 23:45
The first Hardball Times Annual available for download!

Jan 08 21:16
Line Drives

Jan 08 20:23
(recent) Historical WAR on Fangraphs

Jan 08 16:07
Clint Eastwood is Archie Bunker

Jan 08 16:06
Hardball Times Annual 2008, starring…

Jan 08 15:58
Madoff’s Ponzi

Jan 08 03:41
Valuing relievers

Jan 07 17:41
The latest in park factors