THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
Mailbag:You ask:We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, July 28, 2006

Spread in Talent

By Tangotiger, 11:38 AM

How much has the spread in talent changed over the years, among the league’s regular players?

Here’s a step-by-step process as to what I did, and what it shows:


1 - Start with the Lahman database, and exclude all pitchers.  That leaves me with 70,760 player seasons.

2 - Sort them by PA, descending order, and select a number of players equal to the number of regulars for each season.  (That’s 8 players per season, with 9 for the AL-DH seasons.) This leaves me with 20,494 player-seasons.

3 - For each season, calculate the standard deviation of the OBP, if all players in the pool were equals.  This is your random variation.

4 - For each player season, calculate his z-score, as the number of standard deviations he is from the mean.  The career leader is Bonds, 2004, with a z-score of 13.5.

5 - For each season, calculate the standard deviation of all the players’ z-scores.

If a league has a z-score of 1.00, that means that the distribution of the observed OBP exactly equals what one expected if all players were equals.  This has never happened.

6 - Finally, you multiply the z-score by the random standard deviation for the OBP to get the true distribution of OBP.

The tightest distribution in OBP talent was 1978, where one standard deviation = .025 OBP.  That is, we expect that in 1978, 95% of all the regular players had a true talent rate within .050 points of the mean.  The widest distribution, ignoring the early years, was 1923, with 1 SD = .042.

The 1978 may have been an anomoly, or perhaps it was at the peak of speed demons.  That is, if the true OBP spread was that tight, perhaps the true speed spread was very wide, so that, overall, the speed+OBP component was normal.

Then came 2005.  The true OBP spread is .027, the tightest it’s been since 1978.  This is a huge dropoff of the .033 of the previous season.  It is in fact the largest dropoff in OBP talent in modern baseball history.

This is not 1978, and therefore, we can’t say that the shift has gone to speed.  I doubt that the shift has gone to power.  When I remove Barry Bonds, the dropoff disappears.  The talent level is still the tightest its been since 1978, but now the tightening has been more gradual, from .035 in 2000 to .027 in 2005.

The problem now, with the talent level of the players being so tight, is that it’s hard to figure out who is better than whom.  In the past we just needed 200 PA from a hitter, to regress his OBP 50% towards the league mean.  We now need 300 PA to be as certain.

Next time, I’ll look at other components, like speed and power, to see what kind of shift there’s been in spread in talent.  And then pitchers.

#1    Tangotiger      (see all posts) 2006/07/28 (Fri) @ 12:52

Another way to look at this is the spread in talent for a fixed number of players.  Now remember, what I did earlier was based on the number of teams, so in effect what I was giving you was the spread in talent per team.

Now, I’m going to take a fixed number of players, 200, and tell you how much the spread in talent has changed in that group.

From the early 1900s through until WWII, the spread in true OBP among the top 200 players in PA, was in the .040-.045 range.  From that point onwards, until 1958, it’s been pretty flat at .035-.040.  Since then, until around 1980 it’s been in the .030-.035 range.

Afte that point, the spread in talent among the 200 regular players has been increasing a bit every year, with a sustained peak from 1993-2002.

It is important to note that I did not control for park factors, and introducing Coors into a study certainly requires one to do so.  Coincidentally, since the PED years happens to coincide with this time period as well, it certainly bears further investigation.

That is, while we expected a tightening in talent levels, as the years go by, since we are drawing from more players, and more people want to be ballplayers, and we are always looking at 200 players, we instead find a reversal in the last 10-15 years.  (A reversal that seems to have unreversed itself suddenly in the last couple of years).  Whether that has to do with PED, Coors, all the new parks, or a different selection process for players, more work needs to be done…


#2    John Beamer      (see all posts) 2006/07/28 (Fri) @ 21:43

It would also be interesting to correlate the impact of league expansion with changing spread in talent ...


#3    MGL      (see all posts) 2006/07/28 (Fri) @ 21:54

Do you have a standard deviation for the “spread in talent” per year (the standard deviation by chance of the estimated standard deviation of skill)?  We certainly want to know the uncertainty of your measure.  For example, how signficant is the difference between the .027 and the .033?  Inquiring minds need to know that.


#4    tangotiger      (see all posts) 2006/07/29 (Sat) @ 05:06

That’s next for me to do.  I gotta re-re-re-re-read our Appendix again… easily the most re-read piece in The Book for me.

One thing though is the Bonds-Williams-Ruth effect.  These guys are such outliers, even among the outliers that are MLB players, that it significantly skews the distribution.  If you look at their career z-scores, I think they are at 6 or 7 or so.  I’m thinking that at the least, I should artifically max out the z-scores of the players, since we are looking at the spread in talent, and having such a point that is really out there, really stretches the distribution to make it (try to) fit that point.  I am including the IBB, so that may be another issue to consider.


#5          (see all posts) 2006/07/30 (Sun) @ 08:26

I am not sure that you are really measuring a spread in talent but rather a change in philosophy of fielding.  The standard deviation of regular players in any given year would decrease if teams were making a decision to sacrifice defensive ability at the traditionaly defensive positions (SS, 2B, and CF) for offensive ability.


#6    John Beamer      (see all posts) 2006/07/31 (Mon) @ 01:19

Tango

This is a very interesting post and certainly got me thinking about standard deviation, luck and skill.

Just one quick question. By pre-selecting the best players (ie, point 2 in your initial recipe), you could be introducing selection bias into your estimates as you’d expect the better players to garner more player time. So the question is does the sample that you have selected approximate to a normal distribution? My guess, without replicating your analysis is that it probably does but I was wondering if you had checked?

John


#7    tangotiger      (see all posts) 2006/07/31 (Mon) @ 08:07

Whatever bias I would introduce, would apply to all the years, right?  I’ll check anyway.

Peter has a valid point that there are other reasons that the spread in OBP might change.  I was thinking of it only in terms of other hitting elements, but it could be other player components as well.


#8    tangotiger      (see all posts) 2006/07/31 (Mon) @ 10:08

I took all the teams since 1962, with a minimum of 158 GP (n=1022), and calculated the standard deviation of their actual (observed) winning percentage.  That’s .072.

I then figured the random variation, expected, given 161.8 trials (the average number of games per team in my sample).  That’s .039.

Since…
variance(observed) = variance(true) + variance(random)

...we can calculate the standard deviation as .060.

That is, the true spread in talent, at the team level, over the last 40 years or so, is .060 wins = one standard deviation.  (If all players were randomly placed on teams, we would have expected the variance of the true talent to be .060.)

***

The observed standard deviation for runs scored per game is .58, and runs allowed is .59, which is around .015 runs per PA, or .013 wOBA. 

That is, 50% of baseball is offense, and 50% is defense.

***

With each team having around 6000 PA, our random standard deviation is .0065, making the true wOBA .011.

That is, the variance of the players’ hitting, running, the parks, gives us true wOBA of .011.  That’s true on the offense and defensive side.

.011 ^ 2 = offense ^ 2 + parks ^ 2

Assuming that parks = .004 (that is, the standard deviation of the parks is .004), then players = .010

What does this mean?  That the true spread in talent, offense, is one standard deviation = .010 of wOBA.

For the defense, you have two forces, the pitchers and fielders.  For the moment, I’m going to assume that the spread in pitching talent is a bit larger than the spread in fielding talent.  In that case, we get:
.011 ^ 2 = pitching ^ 2 + fielding ^ 2 + parks ^ 2

That gives us pitching = .008, fielding = .006.

Anyway, going back to the hitters.  If the true spread at the team level is .010, then what’s the spread at the player level?  Assuming nine full-time hitters:
.010 ^ 2 = (h1/9 ^ 2 + h2/9 ^ 2 + ... + h9/9 ^ 2)

h1=h2=...=h9=.030

That is, the spread in individual hitting talent is .030 wOBA.  Remember what my original blog entry said?

The tightest distribution in OBP talent was 1978, where one standard deviation = .025 OBP.  That is, we expect that in 1978, 95% of all the regular players had a true talent rate within .050 points of the mean.  The widest distribution, ignoring the early years, was 1923, with 1 SD = .042.

Well, the average for 1962-2005 was .031 of OBP, which is .033 of wOBA.  Since I didn’t account for parks, the .033 of wOBA with no park adjustment is equivalent to .030 of wOBA with park adjustment.

So, there you go.  Two completely different approaches, to tell you that the spread in offensive talent is .030 of wOBA (or .028 of OBP).

Repeating the calculations for the pitchers (assuming 9 equal pitchers of 162 IP each), and we get a spread in pitching talent of .024 wOBA.  Fielding talent is .015 wOBA.

However, if we separate the pitcher as fielder from the other eight nonpitchers as fielders, the pitcher-fielding talent is probably .010, and the 8 nonpitcher-fielding talent is probably .019.

So, the final spread in nonpitcher talent is:
sqrt(.030^2+.019^2) = .036

And for pitchers, it’s:
sqrt(.024^2+.010^2) = .026

That is, 58% of baseball is nonpitchers, and 42% is pitchers.

(Given all my assumptions, regarding parks and the initial pitching/fielding split.)


#9    Guy      (see all posts) 2006/07/31 (Mon) @ 11:58

Very interesting, Tango.  A few questions/observations:

What is the basis of the pitching fielding split of .008/.006?  Just a guestimate, or based on research? 

How can you use team level offense variance of .01 to calculate individual player variance of .03, when the quality of player1 is not independent from quality of player2, etc.?  If teams were assembled by letting team1 pick the 25 best players, then team2...with team30 getting the 25 worst players, you would have the same underlying player talent spread but a much larger team spread.  Presumably, teams that are richer, smarter, etc. will have more good players than other teams. And this may not affect hitting and pitching in the same way:  one could imagine that hitters’ greater predictability could mean that offensive ability is more “efficiently” concentrated in certain teams. 

Similarly, when you say “So, the final spread in nonpitcher talent is: sqrt(.030^2+.019^2) = .036,” doesn’t this assume that hitting talent and fielding talent are independent?  In fact, I would think they are negatively correlated (good fielders = weak hitters), so the spread of total nonpitcher talent should narrow.


#10    tangotiger      (see all posts) 2006/07/31 (Mon) @ 12:28

Guy,

To your first point, since the results of assuming that hitting talent is independent produces a spread consistent with another method that assumes independence, we have a pretty good check here that hitting talent does not cluster.

As to your second point, you are right, that I should have made that clear, that I treated the two as being independent.  If they were totally dependent, then the spread would be much narrower (.023).  My guess, as yours, that there is some level of interdependence.  However, I seem to remember doing this work, and finding that it wasn’t that strong.


#11    tangotiger      (see all posts) 2006/07/31 (Mon) @ 12:31

For the moment, I just left it as a guesstimate, with limited research.

Pitchers are involved in 25% non-BIP, and 75% BIP.  But, the true variance in the 25% is much larger than in the 75%.  For fielders, they get 0% in the non-BIP, and a portion of the 75%.  Luck gets a little portion in the 25% and a large portion of the 75%.  The 8/6 split just seems reasonable, though you could easily make the case for a 9/5 split.


#12          (see all posts) 2008/02/07 (Thu) @ 10:28

I thought the reason Win Shares undervalued starting pitchers is because it places identical value on runs prevented (defensive marginal runs) and extra runs scored (offensive marginal runs). A run prevented is considerably more valuable than an extra run scored.


#13    Tangotiger      (see all posts) 2008/02/07 (Thu) @ 10:56

No, Win Shares undervalues pitching because it doesn’t understand the following three truths:

1. When given two variables, offense and defense, offense is 50% of the game and defense is 50%.

2. When given three variables, offense, pitching, and fielding, offense is 42%, pitching is 33%, and fielding is 25%.

3. When given two variables, nonpitchers and pitchers, the nonpitchers are worth 58% and the pitchers are worth 42%.

I have explained these two seemingly contradictory statements elsewhere on my blog (do a search).  Nonetheless, the summary is that you cannot try to split “absolute” numbers.  It simply won’t work.  The basis must be on the spread in talent (standard deviations or SD), and everthing does add up (the variances, which is SD squared).  That’s the whole problem.  The SD squared add up, but the allocation is based on SD.

Please, do a search, so I don’t repeat myself.  If you can’t find it, I’ll look it up.


#14    Tangotiger      (see all posts) 2008/02/07 (Thu) @ 11:32

Here’s one relevant post:
http://www.insidethebook.com/ee/index.php/site/comments/spread_in_talent/#8


#15          (see all posts) 2008/02/07 (Thu) @ 12:29

Obviously, the statements aren’t contradictory, and I think they actually support my point, which is that in MLB saving runs is more valuable than scoring runs. Or, as you say, pitching and fielding add up to 58 percent. But in Win Shares defensive and offensive marginal runs have almost the same value. If that system increased the value of defensive marginal runs to account for 58 percent of wins, pitchers would have many more Win Shares—20 to 25 percent more.

I also think Win Shares sets the zero-level too low, but I’m not sure how that affects the distribution of shares.


#16    weskelton      (see all posts) 2008/02/08 (Fri) @ 00:31

I’m a little confused on the breakdowns and I’m not sure what I should be searching for in the blog to clear this up.  If when given 2 variables, offense = 50% and defense = 50%, then why when given 3 variables, offense = 42%, pitching = 33% and fielding = 25%.  Why doesn’t pitching and defense sum to 50%?  What am I missing here?


#17    Rally      (see all posts) 2008/02/08 (Fri) @ 09:57

I don’t quite understand how fielding + pitching = 58% and defense = 50%, but I’m not going to argue with you on it.  I think I’ve been through the explanations before and it makes sense for a little while, then I forget about it.

Does it depend on position players not being selected solely for offense or defense, but a combination?

In other words, if we had a lineup of 9 DH’s, like football, and 8 separate defensive players + the pitcher, would fielding + pitching = 50% ?


#18    Tangotiger      (see all posts) 2008/02/08 (Fri) @ 10:09

Rally/33: perfect example.  If you had a football-like setup in baseball, then the spread in runs scored will almost certainly be lower than the spread in runs allowed.  It may not work out to 42/58, but it’s probably be close to that.

And yes, in that case, pitching+fielding would = 58, because all three components (pitching, fielding, hitting) would be independent.  And off=42 and def=58.

In real MLB, it’s because that nonpitchers have a hitting spread and a fielding spread that you get the issue.  You can add the variances, but the spread in talent is based on the standard deviation.


#19    Sky      (see all posts) 2008/02/08 (Fri) @ 10:22

Tango/32—You’re right, but the glory of investing in pitching is that you’re also investing in lowering the run scoring environment at the same time.  You can’t allow fewer/more runs without changing the RPG.  Since pitchers determine their own environment, I don’t have a problem with crediting them for more wins per run saved the better they are.


#20    Tangotiger      (see all posts) 2008/02/08 (Fri) @ 11:04

sky/36: Sure, and I do that, since I put the pitcher through PythagenPat to get his win%.

But, at the team level, you can’t imply that off/def is anything other than being very close to 50/50.


#21    weskelton      (see all posts) 2008/02/10 (Sun) @ 22:24

tango 32: “wes/30 : read posts 19,20, then let’s talk.”

****

Ok, been there, done that.  Now I’m ready to talk again.

I see how non-pitchers contribute more than pitchers, since non-pitchers also field.  I get where a run saved becomes worth more than run scored when you add enough to change your run environment.  I also understood Rally’s football example to a certain extent.  An even more extreme verion of that might be home-run derby with a pitching machine???

But I still don’t see how we can observe that the SD of runs scored is the same as runs allowed, then say that the offense/defense split is 50/50 and then finally conclude that pitching+fielding> defense.  Sorry if I’m being thick here, but I really don’t get it.


#22    Tangotiger      (see all posts) 2008/02/11 (Mon) @ 10:46

Elsewhere, I wrote the following:

1. When given two variables, offense and defense, offense is 50% of the game and defense is 50%.

2. When given three variables, offense, pitching, and fielding, offense is 42%, pitching is 33%, and fielding is 25%.

3. When given two variables, nonpitchers and pitchers, the nonpitchers are worth 58% and the pitchers are worth 42%.

I have explained these two seemingly contradictory statements elsewhere on my blog (do a search).  Nonetheless, the summary is that you cannot try to split “absolute” numbers.  It simply won’t work.  The basis must be on the spread in talent (standard deviations or SD), and everthing does add up (the variances, which is SD squared).  That’s the whole problem.  The SD squared add up, but the allocation is based on SD.

I will try to explain more here.  In post 8, I said this:

The observed standard deviation for runs scored per game is .58, and runs allowed is .59.

That is, 50% of baseball is offense, and 50% is defense.

This is what we talk about when we try to split up “percentages”.  We are in fact talking about the spread in true talent.  For example, imagine a game where you are playing against pitching machines, and you either hit a HR or are out.  Each pitching machine has its own serial number, but are all manufactured by the same company.  You’ve got 30 pitching machines, and 30 MLB hitters.  Each plays 162 games of 40 at bats.

The observed standard deviation of the runs allowed by the pitching machines will be EXACTLY equal to what you’d expect from random.  While some pitching machine will be the “leader”, it will be so purely by luck. 

The observed standard deviation of the runs scored by the 30 players will be a combination of luck and their true talent.

You can infer the true talent spread by this equation:
variance(observed) = variance(true) + variance(luck)

For the pitching machines, variance(observed) = variance(luck), leaving variance(true) = 0.

And that makes this form of baseball 100% hitting.  We on the same page so far?


#23    weskelton      (see all posts) 2008/02/11 (Mon) @ 11:31

Thanks for extending the HR-Derby w/ pitching machines example.  You’ve painted the picture exactly the way I imagined it would be.  So yes, we are on the same page, so far.


#24    Tangotiger      (see all posts) 2008/02/11 (Mon) @ 11:40

You can now extend the same example, but now you have robots on the field of play as well (or cutboard cutouts).  If a ball hits the cutout (which will be 10 feet high, 10 feet wide, and 10 feet long in the IF, and 30/30/30 in the OF), the batter is out.  It lands anywhere, and the batter gets a one-base hit.  He’s still doing this against a pitching machine.

Once again, we are still at 100% offense, 0% defense.


#25    weskelton      (see all posts) 2008/02/11 (Mon) @ 12:12

Cool so far.  I see the cutouts now.  Carew, Boggs and Gwynn are seemingly hitting line drives just barely to the left or right of them.


#26    Tangotiger      (see all posts) 2008/02/11 (Mon) @ 12:58

Now, let’s replace the pitching machines with real pitchers.  But, these pitchers are only allowed to throw fastballs, and are only allowed to throw them at between 75mph and 80mph.

So, we are removing from a pitcher his skillset in mixing up pitches, and even having more than 1 pitch, and resorting his skill to purely be one of location.

We expect that the true talent standard deviation of our 30 pitchers to be a bit greater than zero, but still nowhere close to the true talent standard deviation of our 30 hitters.

Let’s use our measure as outs per PA.  The standard deviation of outs per PA will be fairly tight for pitchers, say 1 SD = .001, while it will be wider for our hitters, say 1 SD = .010.  That is, the spread in hitting true talent is 10 times wider than that of pitching true talent.

The variance is SD squared, meaning the variance for pitchers will be .000001 and the variance for hitters will be .0001 (i.e., the hitting variance is 100 times bigger than the pitching variance).

What we care about here is the SD not the variance.  And, in our case, the spread in talent is 10 times bigger in hitting, which makes this game of baseball 90.9% hitting and 9.1% pitching.


#27    David Gassko      (see all posts) 2008/02/11 (Mon) @ 14:24

Tango, why do we care about standard deviations and not variance? I’ve always been on-board with this idea, but I actually can’t remember why we want to use standard deviation instead of variance.


#28          (see all posts) 2008/02/11 (Mon) @ 14:36

I have no opinion on David’s question of SD vs variance, but either way, I’m still with you.  We haven’t gotten to the point where this breaks down for me yet.


#29    Tangotiger      (see all posts) 2008/02/11 (Mon) @ 14:56

What is standard deviation?  It is, essentially, the average difference of points from its mean.

For example, if you take a population of numbers, from 0 to 100, each unique, the mean is obviously 50.

If you take each one, then 0 is 50 from the mean, 1 is 49 from the mean, etc.  The average of the absolute differences of these 100 points is 25.

The standard deviation is 29, which is a number very close to the average of the absolute differences.

(In this case, we had a uniform distribution of data points.  If we had a normal distribution of data points, then our SD would have matched our average absolute differences.)

But, what’s the variance of our data?  It’s 850 (the SD squared).  What does that tell us?  All it is is the average of the squared differences.

More to the point, if the unit of your data point is say “salary” or “on base percentage”, then the standard deviation or the average of the absolute differences is in units of salary or OBP.  The unit of variance is salarySquared or OBPsquared.

And that really doesn’t represent the unit you are trying to measure.


#30    Anthony      (see all posts) 2008/02/11 (Mon) @ 16:10

I had assumed that true var(all) = var(hitting) + var(baserunning) + var(pitching) + var(fielding), and so we use variance figure the percentages, not SD.


#31    tangotiger      (see all posts) 2008/02/11 (Mon) @ 17:38

You use the variance, because that is the equation.  The equation IS:
all^2 = hit^2 + run^2 + pit^2 + fld^2

Just because the equation allows you to add the variances does NOT imply that when you split the credit that you must stick to the square of the standard deviation.  After all, the units of those things would be runsSquared.  And that’s not what interests us.


#32    David Gassko      (see all posts) 2008/02/11 (Mon) @ 18:27

I’m not sure I follow or agree, Tom. I think you need a more rigorous mathematical proof here.


#33    tangotiger      (see all posts) 2008/02/11 (Mon) @ 18:57

Why not do SD^3 or SD^10?  It all gets back to the units.  Standard deviation is the spread in the exact unit of what you are looking at. In our case, that’s runs.  Why would you care about runs squared or runs to the tenth power?


#34    David Gassko      (see all posts) 2008/02/11 (Mon) @ 19:47

Right, but mathematically, it does not necessarily make sense to say that a facet’s share of credit should be it’s SD divided by the sum of SD for all facets. Intuitively, it makes more sense that it would be variance (since all variances must add up to the total variance, though frankly the results we get from adding up SDs seem more right). The right answer shouldn’t be too difficult to derive mathematically.


#35    weskelton      (see all posts) 2008/02/19 (Tue) @ 21:11

So I was hoping that this thread was going to go a little further.  Is there a plan to follow-up.  Has there been any re-evaluation of the process in light of David’s comments?


#36    tangotiger      (see all posts) 2008/02/20 (Wed) @ 09:15

There’s really no point for me to go on, until everyone else gets a satisfactory answer to the SD v Variance question.  To me it’s obvious that you need to keep everything in the unit you are measuring, and that means SD.  The property of variance (SD squared) is such that the variances add up.  But, that by itself doesn’t mean that *all* your calculations must be based on the variance.

The spread is measured in units, not units squared, and I contend that your share must be based on the units, not units squared.


#37    Anthony      (see all posts) 2008/02/20 (Wed) @ 09:52

But doesn’t it make more sense to say “pitching accounts for 40% of the variance, hitting accounts for 30% of the variance, etc.” If what we’re interested in is percentages…and the individual variances add up to 100%...and the individual SDs do not add up to 100%...then variance sounds like the right thing to use.


#38    tangotiger      (see all posts) 2008/02/20 (Wed) @ 11:33

It does sound like it, except it’s not.

Say that the average hitter is 1 SD = 10 runs, the average pitcher is 1 SD = 8 runs and the average fielder is 1 SD = 6 runs (made up numbers).

The variance would say:
100 + 64 + 36 = 200, so that hitting is 50%, pitching 32% and fielding is 18%.

First you have to explain to me how the pitching SD can be 80% as wide as the hitting, yet account for 64% of the value.  (You squared it, but that’s meaningless to me.)

Next, combine the SD for nonpitchers (hitting + fielding) as 1 SD = 11.7 runs and pitching remains at 1 SD = 8 runs.

Once again, I look at that, and I think that pitching should be 68% of nonpitching.  But, a squaring of the results makes it 47%.

But, the units we care about is runs, not runsSquared.  We pay by runs, not runs squared.  Every relationship we have is based on runs.

Just because a PROPERTY of standard deviation is that by squaring its values (AND its units) lets you add things up, doesn’t mean that you want to base anything beyond that on the square of the units.

After all, what does “one variance = 100 runs squared” MEAN?  I mean, to me, it means nothing at all.


#39    Guy      (see all posts) 2008/02/20 (Wed) @ 12:27

I agree with Tango that SDs matter here, not variance.  I think the best way to think about this is value above replacement, denominated in runs/wins (the units we care about, as Tango says).  If we say that replacement level is -1 SD, then the pitcher share of value will be SD(p)/(SD(p) + SD(non-p)).  I assume that gives us something close to Tango’s 42%.

Now, the shares for hitting and fielding MUST add to 58%.  If non-pitchers contribute 58% of the value above replacement, fielding and hitting cannot add to more than (or less than) 58%—this is a mathematical truism.  So a 43/32/25 split based on three SDs cannot be correct. 

How do we square this circle?  By recognizing that non-pitchers will sometimes have negative fielding value or negative hitting value, because they come as packages.  A great hitter may provide sub-replacement hitting, or vice-versa.  In other words, you can’t look at the SD for fielding and use that in isolation to calculate the total amount of fielding “value.” Doing that makes the assumption that there is something called replacement-level fielding which is -1 SD.  But in reality, you can basically get league-average fielding for free!  The key point is that fielding and hitting are not independent.  A -1 SD fielder will invariably be a decent hitter, and a -1SD hitter a decent fielder.  We have to take their combined contribution to measure the value of the player, and it’s THAT SD which gives us the amount of value above replacement contributed by non-pitchers. 

If baseball used football rules, THEN using the 3 separate SDs would be correct.  But then, the hitting and fielding SDs would both be noticeably smaller (a lot of minor lg 1Bman would have jobs as MLB hitters; a lot of minor lg SSs would have jobs as MLB 3Bmen and 1Bmen; minor lg CFs would play RF and LF).  The total value from hitters and fielders might be more than 58%, but certainly less than what you get using today’s 3 SDs. 

Now, dividing the 58% into hitting and fielding is a challenge, and I’m not sure there’s a “correct” answer.  You could use the ratio of the two SDs, and determine that it’s 36% hitting and 22% fielding.  Personally, I think it’s significant that the average fielder is only slightly better than a replacement player in the field, but much worse at the plate.  So I would use the run gap between average and replacement (maybe 18 runs for hitting, 3 runs for fielding), which gives you a division of something like 49% hitting and 9% fielding. That sounds like a low % for fielding, but much of the variation we see in fielding only exists because players are selected mainly for their hitting skill.  Average fielding doesn’t have a lot of value just because teams use worse fielders; the fact is it’s freely available.  In fact, we should say that below-average fielding has negative value—it’s a price that teams agree to pay in exchange for the offense these players provide.  By allowing for negative fielding value, you can have a system that allows for some fielders to be worth far more than others, while still having a total fielding value that’s quite small, as it should be.


#40    weskelton      (see all posts) 2008/02/20 (Wed) @ 12:49

I guess I’m still bothered by the fact that we’ve demonstrated that the SD in runs (and/or variance)by the offense and defense are essentially the same, which would suggest that the shares for offense and defense should be equal.  Since hitters don’t prevent runs and pithcers and fielders don’t score runs (when they on defense), it seems to me that picthcing and fielding have to add up to a max of 50%.


#41    Anthony      (see all posts) 2008/02/20 (Wed) @ 12:51

Tango/38:

It’s not that I necessarily disagree as much as I’m just trying to get my doubts out there and figure them out. What I’m wondering is that since variance is always expressed as the square of a unit, it’s never in the same unit as what you’re measuring. So then when is variance meaningful, if the unit it’s expressed in is, by definition, meaningless?


#42    Tangotiger      (see all posts) 2008/02/20 (Wed) @ 14:08

I find variance meaningless except when adding them. 

It’s also required to measure correlation (r-squared, coefficient of determination), but even then, I stick with r.


#43    david smyth      (see all posts) 2008/02/20 (Wed) @ 19:32

Guy #39, good post!


#44    Guy      (see all posts) 2008/02/22 (Fri) @ 09:47

Thanks, David. 

I notice now that the 3rd sentence in last graf is garbled.  It should read:  “Personally, I think it’s significant that the average player at any given position is only a slightly better fielder than the average replacement player, but is a much better hitter than a replacement player.”


#45    weskelton      (see all posts) 2008/02/22 (Fri) @ 10:56

I have a rather fundamental baseline question.  When we’re split up the percentages here, (e.g. offense 42% : pitching 33% : fielding 25%), what exactly are these percentages of?  In post 13, Tango is drawing a compariosn to Win Shares, but I don’t think that’s what we’re talking about here.

Last night I was reading the chapter “44 Percent of Baseball” in the Hidden Game.  Palmer actually gives about a half dozen possibilities of what this could mean and actually refers to it as a confusing and silly question.

I have a guess as to what we’re talking about in this case, but I’m interseted to see how others are defining this.


#46    Tangotiger      (see all posts) 2008/02/22 (Fri) @ 11:42

The percentages refer to the standard deviation, the actual spread, the absolute differences.

If you stick to runs (numbers for illustration only), and you have:
hitting: 1sd = 10 runs per 162G
pitching: 1sd = 8 runs per 162IP
fielding: 1sd = 6 runs per 162G

Then, the conversion to percentages is simply 10/24, 8/24, and 6/24.  That is the thing it does.


#47    weskelton      (see all posts) 2008/02/22 (Fri) @ 13:28

I guess I see what you are doing.  I’m just not sure I understand why it makes sense to do this?  What does the 24 represent?  The standard deviation of something??  I’m not convinced that our SD’s, while measured on the same scale R/162G, actually mean anything when added together.  This might make more sense to me if we were actually measuring the contributions of the various facets in wins, which is something that they actually do contribute to in a joint fashion.


#48    Tangotiger      (see all posts) 2008/02/22 (Fri) @ 13:36

10+8+6 = 24.

***

And you are right, we *shouldn’t* care!  This whole “percentage” thing is ridiculous, because we don’t need to use it for anything.

The allocation of payroll is driven by WAR, and as it turns out, the total number of WAR of your pitchers is roughly 43% of all WAR.  It is no surprise that pitchers in MLB also happen to get close to that percentage split in payroll.

Win Shares allocates roughly 35% of its win shares to pitchers, exactly because it STARTS with the percentage split.  And, as you are surmising, why should we even care about the split.


#49    weskelton      (see all posts) 2008/02/22 (Fri) @ 14:21

Well, I knew where the 24 came from, but I still don’t think we’ve defined what it represents.

As for WAR and payroll allocation, I see that as one use for having an accurate split percentage-wise.  Of course I think you could rather quickly come up with a list of reasons why this shouldn’t necessarily hold true. 

Curious, what does WAR say for the split between hitting and fielding for non-pitchers.  Do they also get to a point where offense < pitching+fielding?


#50    Tangotiger      (see all posts) 2008/02/22 (Fri) @ 14:42

"Curious, what does WAR say for the split between hitting and fielding for non-pitchers.”

That’s the crux.  There is NO replacement level hitters and replacement level fielders.  There are replacement level PLAYERS.

The calculation is:
hittingWAA
fldposWAA
WAR = hittingWAA + fldposWAA + replLevel

So, there is no split between hitting and fielding, anymore than there is a split between hitting and stealing, or drawing a walk and hitting a HR. 

That’s why we should never use the split as a starting or intermediate point.  The split will simply become a result of the equation.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main