THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, November 29, 2011

The existence of outliers doesn’t preclude luck as the cause

By Tangotiger, 03:11 PM

Given enough trials, and given a large enough coins, and we can always find outliers… and the same applies to pitchers.

Using this fact, it follows that in our first year, if we have 100 pitchers, we expect half to outperform their FIP. This means that there are 50 players that outperformed their FIP in year one. Of those 50 players that outperformed their FIP in year one, we would expect 25 (.5*50) of them to outperform their FIP in year 2 by pure chance. Of those 25 players that outperformed their FIP in year two, we would expect 12.5 of them to outperform their FIP in year 3 by pure chance. Similarly, we can continue down this path halving the number from the year before. In year four, we would expect about 6 pitchers to have continued to outperform their FIP, and by year 5 we would expect just over 3 pitchers to have consistently outperformed their FIP by pure luck.

Because we started with 100 pitchers, we expect that about three of the pitchers would outperform their FIP in 5 consecutive years, by randomness alone. Many people point to those three pitchers and say, “Clearly, FIP is not accounting for something those three pitchers do.” We can now completely discount that argument for the “simulation”, because we have assumed FIP to be perfect. Thought experiments are nice because they easily allow you to comprehend and visualize a phenomenon, but there is not a lot to glean, if the experiment is completely incongruent with reality.

To put it simply, to “win” something 5 times in a row, just by pure luck, and you have a 50/50 chance of winning each time, then you will win five and lose zero a total of 0.5^5 = 3.1% of the time.  Start with 100 coins or 100 pitchers, and you will flip heads, or beat your FIP, by flipping five times in a row, a total of 3 times.

And in reality?

giving us a not so unexpected final total of 3.6%
...
and finally 4% of the original starting pitcher group

This is not to say that FIP is perfect.  But, just relying on the fact that you’ve been able to identify 3 or 4 extreme cases, when that’s exactly what you would have expected to find if it was all luck, doesn’t prove your point. 

You need to find MORE than the expected extreme points, and NOT just “some” extreme points.  Some extreme points means nothing, unless you know how many you expected to find by luck.


#1    Wexler      (see all posts) 2011/11/30 (Wed) @ 03:29

What about the magnitude of the outliers’ distance from tje mean? Say you find the expected number of extreme cases, vut some of them are more extreme outliers than you’d expect to see by chance alone.

I remember someone at THT saying Mike Scioscia seemed to have a skill at beating his team’s pythag because their xw-l wad better than their w-l 6 years in a row or whatever. Then Greg R commented that if you do the binomial distribution with 30 teams you’d expect to get at least one team doing that. I was happy to hear that criticism because i think mike scioscia is the most overrated human being on the planet. But then i thought: surely you have to also a count for the fact that he beat his pythag not jusr by a couple games each year, but often by quite a lot.

So am i correct to think that magnitude is an important factor in considering whether chance is agood enough explanation? And if so, how do you go about accounting for that mathematically?


#2    Tangotiger      (see all posts) 2011/11/30 (Wed) @ 10:34

Yes, sure, the degree of extremeness matters.

The point here is just a simple example that you have to dig deeper.


#3    MGL      (see all posts) 2011/11/30 (Wed) @ 13:18

I love this article. Unfortunately the quality of the comments is terrible.

The article addresses the typical, “Yeah, but what about player x (Cain for example)?  Surely HE disproves your theory (DIPS for example).”

Half the comments on the site say that no one thinks or says that,which is ridiculous. People say that all the time.

In general I love the idea of working backwards. Create a scenario where the thing you think exists (like a certain number of outliers in this case) does in fact exist but it exists by sheer chance. And show how common that is. Thus its existence in the real world is not evidence of anything systematic (not occurring by chance). 

I’ve wanted to do that many times.


#4          (see all posts) 2011/11/30 (Wed) @ 14:43

Just because a result is likely to occur if the null hypothesis is true doesn’t mean the null hypothesis is true.  You still need a prior to establish whether the null hypothesis or some other hypothesis is more likely given the result.  The commenters are just applying a prior with a very high likelihood that the null hypothesis is false.

I’m hoping maybe this clarifies some of the “anti-Bayesian” sentiment and that someone can explain to me how to handle this sort of thing properly.  I imagine you can, but I haven’t studied it.


#5    MGL      (see all posts) 2011/11/30 (Wed) @ 17:01

What the author did and did quite well was: one, illustrate an important principle which is that if the null hypothesis is true or close to being true, because the distribution of baseball performance is usually normal, there will be by definition outliers at all sigma levels such that the existence of an outlier and nothing else is very weak evidence that the null hypothesis is false.  Two, if the number of outliers in an actual sample is close to that expected by chance alone given that the null hypothesis is forced to be true by the experimentor, then there is a very good chance that the null hypothesis is true, barring any further inquiry.

He probably could have explained those things better but the article was excellent and many of the comments were ignorant and unwarranted.


#6          (see all posts) 2011/11/30 (Wed) @ 18:53

I agree what the author did was excellent and well presented.  I love it as well.  My comment was tongue-in-cheek and made in light of the Bayesian conclusions discussion that was taking place.  Perhaps I should have made it there, as it seems to have gone unnoticed here.


#7    MGL      (see all posts) 2011/11/30 (Wed) @ 20:43

We’re kind of stodgy around here. Smart but humorless…

smile


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 14:44
What sabermetrics is NOT

May 25 14:14
Pete Palmer’s new book: Basic Ball

May 25 13:18
Do pitcher’s reach back for velocity when needed?

May 25 13:04
“Why Kickstarter works”

May 25 12:51
Chad Curtis

May 25 12:40
Largest demonstration in Canadian history?

May 25 11:32
Howard Stern

May 25 11:26
Lack of hustle during a game

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion