Monday, April 11, 2011
Baseball ID Mapping File: updated daily
Great stuff from Ted and friends. For all those who are already wasting your valuable time matching IDs and the like: stop doing that sh!t. I can’t count the number of hours I’ve wasted doing that. Must be in the dozens, if not past 100 already. And if you’ve got a new dataset that you can contribute (that’s not already mapped by Ted already… check to see if he has it), see if Ted can incorporate it. This is a beautiful thing, and I’ve been transitioning all my mappings to his file.
I am pleased to announce that the Baseball ID Working Group is now publishing a register of professional players, which is updated daily. The Baseball ID Working Group is a consortium of data providers and analysts who are working together to publish a definitive register of basic identifying information on players, managers, and umpires throughout the history of professional baseball internationally, including a cross-reference table of major person identification systems.
The current (provisional) download site for this is:
http://balco.sabr.org/data/baseballid/
Each day’s release is dated in the filename; for convenience, the file baseballid-latest.zip always points to the most recent update.
Updates should happen approximately every morning. However, there is a manual approval process, so there may be an occasional day in which an update does not occur if I am not available to complete it.
These data are available under a Creative Commons non-commercial license for the benefit of the community.
There is a README.txt available which contains full details of what is present, and how to interpret/use it. Importantly, it also contains the names of several key contributors who have been instrumental in helping get this together.
Please direct enquiries to me offlist at dataczar (at) sabr (dot) org.
Ted


Yes, this is fantastic!