THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, November 21, 2008

Run-based similarity scores

Great work… to which I disagree.  Pizza Cutter did similar work based on rate stats, to which I have lots of comments on his thread.  My key point is this:

If you are interested in looking for similar players to Vince Coleman, you may insist that the speed components (3b per 2b+3b and sb per sbOpp) be weighted much more than you otherwise would, because you are really interested in the speed players mostly.

So, in a run-based system, the speed components simply won’t have much differentiation.  However, since we know the speed is strongly tied to SB, and speed is such a huge component of a player’s skillset, I would heavily overweight that in terms of trying to find similar-style players.  Same deal for HR.  Perhaps this is best exemplified with the K, which is very close in run value to the typical out, but clearly, there’s a huge difference in a hitter with 180 K and 40K.  Basically, the more the component tells you about the player (rather than how much runs it’s worth), the more you should weight it.


(5) Comments • 2008/11/25 • SabermetricsForecasting
Page 1 of 1 pages

<< Back to main