Tuesday, October 06, 2009
Incorporating guts into a forecast
David says:
My expectation was that a computer would be much better at assimilating a lot of statistical information into one final prediction than the human brain, and while I still do believe that to be the case, it does appear that we humans can see something computers do not.
But then he goes on to say:
Looking at the hitters I thought would beat their projections, I saw a lot of special skills, most of them young, but all very talented.... The hitters I thought our projections overrated were mostly some combination of old, fat and strikeout-prone. ... The only thing that really jumps out at me is that I liked a lot of high-strikeout guys, while a lot of the pitchers I didn’t like are below-average at whiffing hitters.
The “computer” that David references is the algorithm he designed to create the forecasts, and the computer simply speeded up the process. It’s the only thing the computer did. Speed. The algorithm was designed by a human. Furthermore, that human chose to ignore the impact that would have helped his algorithm. So, he “knew” (or suspected anyway) that high-K pitchers have an extra oomph (something more real, or a better ERA to regress toward). But, he didn’t put that in his algorithm. This is the kind of thing that PECOTA would implicitly accept. For example, it would look at the high-K pitchers, find the comparable pitchers, and use that as an extra regression point.
Anyway, all David has to do is create additional parameters for his algorithm. He can set a “1” for anyone that satisfies his baseball guts for improvement, he can set a “1” for anyone that doesn’t. That gives us an extra parameter for the regression equation. If his baseball guts are worth 50 points of OPS and 0.50 ERA, then he can include that in his equation. Basically, if he has a reason to suspect that a player’s 2008 or 2007 stats are not representative of that player, he can fudge that data by introducing a Guts parameter.
I think MGL has said that he manually makes park factor changes, as he thinks appropriate. It’s the same deal here.
Kudos for David to showing that he’s got baseball guts. Now, just include that in his algorithm, so that next year, he can’t beat his own algorithm.


Recent comments
Older comments
Page 1 of 344 pages 1 2 3 > Last »Complete Archive – By Category
Complete Archive – By Date