THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, December 17, 2010

Fantasy valuation

By Tangotiger, 02:48 PM

David asks, in part:

How do you choose the player pool for averages and standard deviations? Do you use last year’s stats? Do you use projected stats? Do you use iterations? Do you use empirical data from similar fantasy leagues?

What you cannot do is just use the projected stats on its own to figure out the standard deviation.  Imagine, for example, that every pitcher is forecasted for between 8 and 12 wins.  That would set one standard deviation to be pretty low, say 0.5 wins, and so the top guy will be say 3 or 4 SD from the mean.  But, he’s only 2 wins above average.  That guy forecasted with 12 wins will win say between 6 and 18 games.  Now. imagine all base stealers are forecasted for between 8 and 12 steals.  And, if we presume stealing is far easier to forecast (for this illustration… if you can’t get past that, then call it quatlus), then the guy with 12 steals will also be 3 or 4 SD from the mean.  And in reality, he will end up with between 11 and 13 steals.

So, that’s why you can’t use the standard deviation of the forecasted stats.  You need to include the standard deviation from the uncertainty of the forecast and the random variation in the stat. 

The easiest thing to do is just look at the empirical data from prior seasons, and take the standard deviation of those observed data.  Then calculate your z-scores.  It’s going to be pretty stable each year.  For example, I pretty much stick with a 3/3/1/1/ model for RBI/R/HR/SB in terms of weighting.  Things change every year of course, and you can feel free to create a model that uses the forecasted data to determine the expected standard deviation.  That would be fun to do.  Until then, take the easy way out, day tripper.


#1    BrianK      (see all posts) 2010/12/17 (Fri) @ 17:12

I believe the correct spelling is quatloos.

Sometimes what you know is more embarrassing than what you don’t know. This feels like one of those times.


#2    philosofool      (see all posts) 2011/01/25 (Tue) @ 01:46

Is the thought that you should take the player pool of 150 (or whatever number) of batters and of pitchers from actual seasons to generate your standard deviations by a method of iteration?


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 06:39
Lack of hustle during a game

May 25 05:00
Help needed with sticky issue…

May 25 02:54
Largest demonstration in Canadian history?

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves

May 24 23:50
Rooting for laundry

May 24 17:04
Firefox, IE, or Chrome?

May 24 12:07
How to beat the shift

May 24 11:11
Incredible story

May 24 09:41
Racial bias in card collecting: not the collectors, but the players on the cards