THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Friday, November 21, 2008

Run-based similarity scores

By Tangotiger, 11:02 AM

Great work… to which I disagree.  Pizza Cutter did similar work based on rate stats, to which I have lots of comments on his thread.  My key point is this:

If you are interested in looking for similar players to Vince Coleman, you may insist that the speed components (3b per 2b+3b and sb per sbOpp) be weighted much more than you otherwise would, because you are really interested in the speed players mostly.

So, in a run-based system, the speed components simply won’t have much differentiation.  However, since we know the speed is strongly tied to SB, and speed is such a huge component of a player’s skillset, I would heavily overweight that in terms of trying to find similar-style players.  Same deal for HR.  Perhaps this is best exemplified with the K, which is very close in run value to the typical out, but clearly, there’s a huge difference in a hitter with 180 K and 40K.  Basically, the more the component tells you about the player (rather than how much runs it’s worth), the more you should weight it.

(5) Comments • 2008/11/25 • SabermetricsForecasting
Page 1 of 1 pages

Latest...

COMMENTS

Feb 11 20:29
Who is Jeremy Lin?

Feb 11 20:11
Clutch analogy

Feb 11 20:11
Fighting leads to goals?

Feb 11 19:55
Why do players get crappy caps?

Feb 11 19:12
Hero of the month: Brittney Baxter

Feb 11 17:59
MGL: Today on Clubhouse Confidential

Feb 11 16:48
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 11 10:29
Dwight Evans

Feb 11 02:12
Performance through the ages

Feb 10 23:01
For Your Soul

THREADS

November 21, 2008
Run-based similarity scores