THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, April 15, 2008

Hockey Databases

By Tangotiger, 02:16 PM

For a long time, there were two major online hockey databases that I was using: HockeyDB.com, by Ralph Slate, and EuroHockey.net by lots of people.  Now, we have Hockey-Reference.com by Justin Kubatko, with data provided by NHL-sanctioned Dan Diamond


#1    Colin Wyers      (see all posts) 2008/04/15 (Tue) @ 15:02

This is one area where MLBAM (and a lot of other people) are really behind the curve on. It seems pretty obvious to me that there’s money to be made in providing fans with this kind of information in the form of advertising revenue. But the people who have the most access - the leagues themselves - don’t seem to provide nearly the depth or quality of information that places like Sports Reference do.

(Nobody misses the boat, in my estimation, more than the Elias Sports Bureau. It’s no stretch to say that they have records of pretty much everything. And yet, look at their official website:

http://www.esb.com/

One book for sale! You’re kidding me, right?)


#2    Tangotiger      (see all posts) 2008/04/15 (Tue) @ 15:03

I think they position themselves like Warren Buffett: Just one Mutual Fund?


#3    Darren      (see all posts) 2008/04/15 (Tue) @ 21:19

Tango, is there any merit to this Goals Created stat they are using. To me it would value assists to low, but I am not sure what it is trying to determine.


#4    tangotiger      (see all posts) 2008/04/15 (Tue) @ 21:31

I seem to remember it having merit.  From what I remember, it normalizes all the goals to a common environment, and also aligns the assists down toward the goals.  Do you have a current reference for me to look at?


#5    Darren      (see all posts) 2008/04/15 (Tue) @ 22:10

They have a definition in the Glossary.


#6    tangotiger      (see all posts) 2008/04/15 (Tue) @ 23:04

Ok, it’s even simpler than I remembered.  It’s an ok definition.


#7    Ryan JL      (see all posts) 2008/04/15 (Tue) @ 23:27

Outstanding.  I have been waiting for this for a very long time.


#8          (see all posts) 2008/04/16 (Wed) @ 14:24

For the Crosby example, his 84 assists come out to the equivalent of 6 goals scored.  Does that mean that a line that he is on will be expected to score, in addition to his goals he puts in the net himself, an extra 6 goals that can be credited to his inherent skill?


#9    Tangotiger      (see all posts) 2008/04/16 (Wed) @ 15:39

Right, that’s the implication.  I don’t believe this to be true however.  It must be more.

The problem with these “partitions” that plague Goals Created and Win Shares is trying to split things into absolutes, when really you are trying to split the value-added.

Take Crosby: he had 36 goals and 84 assists.  Let’s say that someone with his ice time (ev, PP, sh) would have scored 30 goals and 50 assists (just a reasonable guess).  Crosby is +6 in goals scored, and +34 in assists. 

(Because you get about 1.7 assists awarded per goals, I would turn that +34 into +20.)

His “Goals Created” was 42, based on what’s on b-r.com

Presuming that Crosby played with average players, the “absolute” method would credit him as being +12 goals above average (42 minus 30).

In my case, I would credit him with being +26 goals above average.

In short, the difference between my method and their method is that theirs reports numbers as half as much as mine.  This is the general complaint I have with all these Win Shares type methods.  I’ve gone through all this stuff when I was developing metrics as a teenager.  It simply doesn’t work.

One method takes half the goals and half the goal-adjusted-assists, and adds them up.  In my case, I take the full goals above the average and the full goal-adjusted-assists above the average, and add that to the average.  To be more specific, I do:

Tango = (G-avg) + (gA-avg) + avg = G + gA - avg

B-r.com = 0.5G + 0.5gA

As you can see, in my method, you can get into negative goals.

I can prove that my way works out better if I spend more time on it with the data I need.  I’ve done enough research to know that I’m right, and I simply haven’t done it completely enough so that others know it.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

Feb 11 19:33
Clutch analogy

Feb 11 19:12
Hero of the month: Brittney Baxter

Feb 11 17:59
MGL: Today on Clubhouse Confidential

Feb 11 16:48
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 11 11:54
Who is Jeremy Lin?

Feb 11 10:29
Dwight Evans

Feb 11 02:12
Performance through the ages

Feb 10 23:01
For Your Soul

Feb 10 18:32
Moneyball at Villanova

Feb 10 17:00
Psst… wanna intern in Canada?