Thursday, February 10, 2011
Yet another article that “samples” is not “true”
Carson does something that I’ve been hoping to see someone do for the longest time. What he does is figure out the surplus value of a player drafted, and link that surplus to the scout who signed him. So, if you signed Tulowitzki, you look like a genius. But this is a sample. It still is not something true, something real.
After all, I presume all thirty teams had Tulo ranked somewhere between #1 and #10. How much does it help us to give the 50MM$ surplus (or whatever it was) to the Rockies’ scout who drafted him #7, and 0$ surplus to the other 29 scouts who also ranked him quite high? In that same draft, Alex Gordon was drafted #2. I don’t follow the college scene, but presumably all the teams had him ranked pretty high as well. Do we slap down only the Royals for drafting him, because they are the ones that bought the Alex Gordon lottery ticket, even though the other 29 teams ALSO wanted to buy that ticket?
If you look at it as a sample, then you can say, yeah, the scout won the lottery ticket, that’s the money he made. But, you have to look at it from a true talent perspective. Perhaps you need to regress 99% of what you see from Carson’s process. Scout Bill Buck is credited with 74MM$ in surplus. Perhaps his true value is 740,000$, and the rest was his good fortune for having exclusive dibs on players.
The problem with the process is the exclusivity of it, the binary outcome: did he, or did he not draft that player. In many respects, it’s like looking at a player’s single game, where he puts the ball in play 4 times, and we’re trying to figure out what the true talent of the player is based on the observations of these binary outcome based on whether the batter was safe or out. Because, in this case, you would also regress what the batter did in 4 contacted PA 99% toward the league mean. On the other hand, if you had say his launch parameters (how hard he hit the ball, the spray and vertical angles, the spin imparted), then those 4 PA might regress 95% toward the league mean.
Today is sample-is-not-true day!


I’ve done similar research in the past - only using actual production from older drafts instead of projected valuations. As far as the meaningfulness of the data, of course, I ran into all of the same issues that have been mentioned at FG, of course. It is mostly a process of giving a name and some “credit” to mostly anonymous scouts. If you know the scouts well enough to drill down into their stories, though there are some neat nuggets.
I posted these results in the comment section if you’re interested:
Here are the 11 most productive area scouts from the 1987-1996 drafts.
Data is name, pre-FA slot WARP, actual pre-FA WARP, Actual – Slot WARP, some key players.
1. Tim Kelly; 6.0, 90.7, 84.7 WARP; Tim Salmon, Troy Percival, Chad Curtis
2. John Ramey; 10.9, 86.7, 75.8; Marcus Giles, Joe Mays, Jim Mecir
3. Joe Delucca; 22.1, 95.5, 73.4; Manny Ramirez, Charlses Nagy, Pete Harnisch
4. Guy Hansen; 11.0, 83.6, 72.6; Kevin Appier, Jeff Conine, Mike Magnante
5. Luke Wrenn; 26.8, 95.9, 69.1; Tino Martinez, Mike Hampton, Nomar Garciaparra
6. Roy Clark; 17.7, 86.5, 68.8; Kevin Millwood, Greg McMichael, Jerry Dipoto
7. Scott Trcka; 18.8, 81.4, 62.6; Scott Rolen, Jeff Jackson
8. Matt Sczezny; 20.3, 82.0, 61.7; John Valentin, Mo Vaughn
9. Marty Esposito; 32.3, 92.7, 60.4; KNoblauch, Mark Guthrie, Todd Walker
10. Ken Madeja; 4.5, 64.1, 59.6; Derek Lowe, Matt Mantei
11. Erwin Bryant; 2.1, 60.8, 58.7; Jeff Bagwell
I went to 11 so you can see signing one great player from a low slot (4th rd) is going to shoot a scout way up a list like this. In actuality all of these guys signed 2-3 good players tops.
And that’s over a 10 year period when some of these guys turned in a thousand reports or something like that.
My conclusion is that this is a great way to draw some attention to area scouts, but for all the reasons mentioned it’s just not a good way to rank scouts.
You need those preference lists for that.,
I’ve always been interested in asking someone this, and now that’s it’s come up I will. Tango, do you have any insights into how teams actually grade their scouts?
Just for my own curiousity, I’d kill to have access to some old pref lists to compare to the draft slot values that I have derived.
Just like you say forcasters should release old forcasts to the public to further research in general, it would be a great boon to understanding the scouting process to have access to old pref lists that are completely useless to organizations.