THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, September 13, 2011

Draft Order and Major League Pitching Performance…

By , 03:22 AM

I know there has been similar research published on the web, but I am too lazy to look it up right now.  I took the 1998-2010 draft list from BA and looked at how the various pitchers did in the major leagues, breaking the rounds down into various buckets.  It was a quick study and I just matched names from the draft lists with my major league databases.  I probably missed 3-5% of the players because the names, especially first names, did not match up exactly.

First I looked at the rookie years for all pitchers in each draft round 1-3+, for a total of 4 buckets. Each round included the supplemental rounds, so, the first round actually has 60 picks in most years.  The rest of the rounds pretty much have 30 picks each.  Perhaps I should consider the first 30 picks of the 1st round as the 1st round and then the next 30 picks in the 1st round supplement as the 2nd round, etc.

Does anyone know if there is anything special about the supplemental round after the first 30 picks or is it just the next 30 best players and then the second round is 61-90 best players?

For each pitcher, I looked at whether their primary role in their rookie major league season was as a starter (S), a reliever (R), or mixed (N).

The currency I used was my normalized, component ERA (nERC), which is an “ERA” based on a pitcher’s raw stats (s, d, t, hr, nibb, hp, outs, and wp) adjusted for park, opponent, and defense, and normalized to his own league, where the average pitcher in each league, weighted by TBF, is 4.00.  I weighted the aggregate nERC in each bucket by each pitcher’s IP.  If I used the simple average of all the pitchers (weighting each pitcher exactly the same regardless of how many IP they threw), the numbers would be much higher, as the pitchers who had the worst true talent generally had the fewest IP.


“PS” is pitcher seasons.

Rookie year only:


Round      Role     PS     IP     nERC
1    R    41    1062    3.87    
1    N    11    376    5.64    
1    S    79    5485    4.39    

2    R    24    507    4.41    
2    N    6    101    6.86    
2    S    31    2167    4.66    

3    R    64    1375    3.83    
3    N    19    575    4.81    
3    S    42    2531    4.37    

Here are the top 10 picks in the first round
:

Top 10    R    8    228    3.79    
Top 10    N    2    26    3.36    
Top 10    S    30    2357    4.26

Here is the same list but with the 1st round supplement considered the second round, the second the third, etc., so that each round has around 30 picks:

Round      Role     PS     IP     nERC
1    R    34    871    4.09    
1    N    6    236    4.71    
1    S    56    4145    4.24    

2    R    7    192    2.85    
2    N    5    140    7.21    
2    S    23    1340    4.84    

3    R    24    507    4.41    
3    N    6    101    6.86    
3    S    31    2167    4.66    

4
+    R    64    1375    3.83    
4
+    N    19    575    4.81    
4
+    S    42    2531    4.37

This time it is each pitcher’s entire ML career (from 2000 to 2010), not just his rookie year:

Round      Role     N     IP     nERC
1    R    193    7744    3.70    
1    N    42    2421    4.77    
1    S    337    45468    3.81    

2    R    111    4604    3.70    
2    N    21    991    3.78    
2    S    91    9327    5.15    

3
+    R    191    6753    3.90    
3
+    N    46    2688    5.00    
3
+    S    165    18738    4.13    

Top 10    R    19    582    3.99    
Top 10    N    14    670    5.05    
Top 10    S    131    18579    3.62

Same list (entire ML career from 1998-2010) but with the 1st round supplement considered the second round, the second the third, etc., so that each round has around 30 picks.

Round      Role     N     IP     nERC
1    R    151    5818    3.72    
1    N    26    1704    4.66    
1    S    272    39515    3.73    

2    R    42    1926    3.64    
2    N    16    716    5.04    
2    S    65    5953    4.33    

3    R    111    4604    3.78    
3    N    21    991    5.15    
3    S    91    9327    4.15    

4
+    R    191    6753    3.90    
4
+    N    46    2688    5.00    
4
+    S    165    18738    4.13

I’ll present one more list.  First, an explanation.  If you have the above lists and I tell you that a 1st round (top 30 picks) pitcher is starting tomorrow for team X, and I ask you what you expect his nERA to be at the end of the season, would the correct answer be 3.73?

No.  That is because the better pitchers have less attrition after their first start such that when we weight each pitcher’s season nERC by his IP, we get an aggregate nERC for each bucket which is less than “the average seasonal nERC for every rookie pitcher in each draft bucket.”

The only way to answer the question above (“What will this pitcher’s, who is making his ML debut tomorrow, nERC be at the end of the season?”) is to do the same analysis, but weight each pitcher’s seasonal nERC equally.  So now the bad pitchers will get just as much weight as the good ones who threw lots more innings.  If we do that, and only look at each pitcher’s rookie year, here is what we get.  You’ll notice that the numbers are higher.  These numbers should be close to the true talent of any pitcher who gets a ML debut.

Round      Role     N     IP     nERC
1    R    34    871    4.58    
1    N    6    236    4.76    
1    S    56    4145    4.74    

2    R    7    192    5.15    
2    N    5    140    7.01    
2    S    23    1340    5.18    

3    R    24    507    5.51    
3    N    6    101    7.59    
3    S    31    2167    5.76    

4
+    R    64    1375    4.70    
4
+    N    19    575    5.01    
4
+    S    42    2531    5.55

Top 10    R    8    228    3.73    
Top 10    N    2    26    2.62    
Top 10    S    30    2357    4.47

For comparison purposes, here are all pitchers in the major leagues (from 00-10), with each pitcher’s nERC weighted by their IP:

R 3892 158,677 3.74
N 463 29
,895 4.76
S 2267 294
,051 3.99

The same 
list with each pitcher season weighted equally:

R 3892 158,677 4.84
N 463 29
,895 5.42
S 2267 294
,051 4.57

#1    JD Sussman      (see all posts) 2011/09/13 (Tue) @ 08:43

"Does anyone know if there is anything special about the supplemental round after the first 30 picks or is it just the next 30 best players and then the second round is 61-90 best players?”

The supplemental round is draft compensation for teams. The players picked are, as you said, the next best players available. Though, due to various slotting and signability issues, the best players are not typically chosen one after the other. (See, Josh Bell 3rd Rd, 2011).


#2    Rally      (see all posts) 2011/09/13 (Tue) @ 09:18

If you have signing bonus data that would probably be a better proxy for perceived talent than draft spot.  But if you don’t, draft spot is an OK proxy.

That Josh Bell is supposed to be good, a first round talent who everyone thought was heading to college.  Pittsburgh took a chance on him in the third round and signed him.  Not to be confused with Josh Bell, catcher who couldn’t advance past A ball for Toronto, or Josh Bell, Oriole 3B with 73 strikeouts and 6 walks in 215 career PA.

Announcers forget the circumstances and sometimes declare Jordan Walden a 12th round genius find.  Not really accurate, he was one of the elite high school arms going into that season, then had some arm issues his senior year.  Everyone figured he’d just go to college, but the Angels drafted him and signed him well over slot money.  I’m not sure if they paid him first round money but his bonus really stood out among 12th rounders.


#3    Tangotiger      (see all posts) 2011/09/13 (Tue) @ 09:29

I think a better way to do this instead of “rounds” is to look at the pitchers as their own universe.  So, take the top 50 pitchers selected, and assign them a value 1 through 50.  (Or, take the 50 highest signing bonuses, and assign 1 through 50, with the draft order as the tiebreaker.)

I don’t think we need to worry about whether you had 50 pitchers selected in the top 100 or top 90 or top 110 picks.  We just want the top 50 pitchers.

Furthermore, you won’t have a gap in the sequence, so it would be easier to come up with a smoothed function of value for each draft pick.

Also, I’d set a floor for talent level for any pitcher, whether he pitched in MLB or not.  Something like 140% of league average (in MGL’s case, that means a component ERA of 5.60).

Finally, you have to adjust between reliever and starter performance.  It’s a huge difference and will impact anything that is done.


#4    Tangotiger      (see all posts) 2011/09/13 (Tue) @ 09:30

This study by Keith Scherer was pretty good, for those who never saw it:

http://sports.espn.go.com/mlb/columns/story?columnist=neyer_rob&id=1811682


#5    MGL      (see all posts) 2011/09/13 (Tue) @ 11:05

"I think a better way to do this instead of “rounds” is to look at the pitchers as their own universe.  So, take the top 50 pitchers selected, and assign them a value 1 through 50.”

Yes, of course I should I have done it that way.  Will redo.

“Finally, you have to adjust between reliever and starter performance.  It’s a huge difference and will impact anything that is done.”

That is why I pretty much broke it down by reliever, starter, and both.

Where can I get a draft database that includes signing bonus?  Anyone have a draft database (with or without bonuses) that includes retrosheet or MLB id’s?


#6    Tangotiger      (see all posts) 2011/09/13 (Tue) @ 11:30

http://www.baseball-reference.com/draft/?query_type=year_round&year_ID=2001&draft_round=1&draft_type=junreg&

If you hover over the name, or do “view source”, you can see the ID.

***

I wasn’t sure how you split the relief/starter, as I presume there’s not much 100%/0% kind of usage being done.  I guess what you can add is % of innings as starter in your 3 subgroups.  This way, it’s easier to see how much of an adjustment we can make.


#7    JD Sussman      (see all posts) 2011/09/13 (Tue) @ 13:24

Rally, I’m not sure using bonus size is totally going to fix things. The size of the bonus is directly correlates to the amount of leverage a player has, not his ability. If MGL was to do this, he would find many less HS pitchers placed above more talented college arms.


#8    MGL      (see all posts) 2011/09/13 (Tue) @ 20:15

On second thought, the actual round might be better than just ordering the pitchers.  The reason is this:

Say, in one year, there are no pitchers in the first round.  That means, to some extent, that the pitchers who were chosen in the second round are not that good.  So they should not be grouped in with pitchers in the first round from another year.

It’s probably six of one and half a dozen of the other.

I’m going to do the same analysis for BA top 100 prospects list from each year.  Of course, those are quite different from the draft since they include minor league performance and in some cases, a little major league performance…


#9    Tangotiger      (see all posts) 2011/09/13 (Tue) @ 20:30

Or that the batters were really top heavy (Braun, Zimm, Tulo all drafted together).

As I said, just so much cleaner to not have gaps, as you’ll see when you use the data for other purposes.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 05:00
Help needed with sticky issue…

May 25 04:38
The first time a pitcher has ever intentionally thrown at a batter….

May 25 03:39
Lack of hustle during a game

May 25 02:54
Largest demonstration in Canadian history?

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves

May 24 23:50
Rooting for laundry

May 24 17:04
Firefox, IE, or Chrome?

May 24 12:07
How to beat the shift

May 24 11:11
Incredible story