Monday, January 04, 2010
Non-stathead Chronicles - Bleed Cubbie Blue
(Note: I quite enjoy these. I encourage all non-statheads to send me tough questions. Give me at least five questions, and preferably at least 10. The tougher the better. Whatever it is that drives you batty. I’ll try to make some sense so we can move forward. These questions from Cubbie-Tim seem pretty tame to me, compared to the Silva questions. We also have 217, and counting, Q&A in our mailbag: http://www.tangotiger.net/wiki/index.php?title=Mailbags . If you haven’t checked those out, then please do so.)
A background for the readers, I saw a 232-post thread that started with a link to the Mike Silva Chronicles at http://www.bleedcubbieblue.com . I asked one of the commenters who seems to be on the anti- side to send me his questions, and I would answer them. Here it goes from Cubbie-Tim:
1. Who decides what data is “minimal effect” or “levels out” so to speak? Is that different data each time based on who the SABR is or a standard piece of data?
Just a correction based on all your other questions that will follow: SABR is the organization for baseball research. It has little to do with sabermetric research. The word you intend to be using is sabermetrician or saberist.
As to your question, there is no one person that decides anything. The only thing researchers do is interpret the data. That’s our job. It is not to advance a theory and look for the data to support it. Saberists follow the evidence. If Joe Carter has 115 RBIs, a piece of data that is factual, our job is to try to explain how someone with a SLG average below .400 could have done that.
2. If different SABR’s use different pieces of data (for one example I linked to a SABR says to use CERA and another says not to), what does this do to change how it is viewed by “nonstatheads” since there is not a constant?
Ideally, saberists will rally around some principles, so we can move forward to finding other principles. When that doesn’t happen, then you have to look at the merits of the arguments and decide who makes his case the best. And if that’s still not enough, then you have to hedge your bets and take a bit of everything.
As for catcher ERA specifically, you certainly can’t use it for a single season. Like most things, the more data you have, the more the signal sticks out from the noise. The person to listen to is the one who tells you how much signal is in the noise.
3. Does a SABR believe leaving any data out truly gives the best picture, since even small data amounts can skew a stat? I know that in most any industry a variance is attached, just curious more than anything.
Leaving data out gives the best picture? Well, if there’s a reason for leaving data out (say Coors data) because it’s a way to handle the bias, that’s ok. You could try to correct for the bias by keeping the data as well. That’s ok too. You just have to tell the readers what you are doing and why you are doing it. You make your case, and let the market judge you on the merits of the argument (not on their prejudices).
4. Why it is that “statheads” will say that intangibles are of little to no importance, and “nonstatheads” are to just accept it blindly? When I asked that I never can get a straight answer
You say it as if there’s a 100% position for each “group”. First off, each person is an individual, and no one follows the group blindly. If you want to say there is a predisposition to lean toward one side, that’s ok. That’s a far cry from the extreme position your statement suggests.
To the specific claim of intangibles, of course they are important. The question is not if they are important, but how do you identify it BEFORE the fact? Say for example your star goalie is mired in a messy divorce, with his spouse calling him minutes before game time. You are in the playoffs, and are making a run for the Stanley Cup. So, you know, for a fact, that this goalie, a human being, has his emotions in play. They are going to surface, in some form or other, at some point during the day. But, how do you know how he’s going to handle it during game time? Is it going to cost his team? Is it going to make him more resilient? Are you willing to bet, with money, on your opinion? Martin Brodeur’s Devils won the Stanley Cup that year. AFTER the fact, we can say that his powers of concentration are superb. Would you have bet on it before the playoffs started?
The question is not if something exists. No one denies the existence. The question is one of identification. How do you identify it, how do you find it, how do you measure it. And once you do, what do you do with this? And if you can’t identify it, what do you do with it?
5. Why does a “nonstathead” accept the importance of stats and projections, but many “statheads” will not agree on that?
A non-stathead accepts the importance of stats? And projections? If your question is what you intended, then I can’t agree this is the case. If your question is meant to be flipped, then I can’t speak for the position of non-statheads. The only thing I would say is that a forecast is more like an over/under bet. It’s something that you are about equally sure is just as likely to happen more than that line as it is to happen below that line.
6. As the SABR stat evolves almost annual (URZ for example), why do some SABR continue to use the old manner while some use the newer one, which results in SABRs having different results to the same question (so to speak, could not think of a better way to phrase this)
Some saberists are stubborn. Any half-assed metric based on play-by-play (PBP) stats is better than one that is not. A PBP metric will say it’s important to know one, or all of: where a ball was hit, how hard it was hit, which park it was hit in, who the pitcher was, who the batter was, what the base/out situation was. A non-PBP metric will ignore all that information, and instead try to fill-in-the-blanks in some other way. So, I would reject a non-PBP metric if a PBP metric exists.
Some of the parmeters being used are not objective, but subjective. So, there could be a good reason for not using some of that data. Even the location of a batted ball is not so objective. Two different people can look at the data, and be off by 10 or 20 feet. This is not that uncommon
As for the various competing PBP metrics (UZR, PMR, Dewan, among others), they each try to make decisions as to how much each of the parameters means to a particular ball that was hit. They try to establish a baseline, with as many parameters as they can, the chance that a particular batted ball would have been caught by an average fielder. It’s a tough job in terms of trying to find a common ground, as you can imagine.
Ideally, you would also have a human observer give his opinion when a ball was hit, so that we don’t have to infer so much, and we could rely on stringers to tell us how tough a ball actually was to field. But, we’ll get to that point soon enough.
7. Where would you recommend newbies or novice stat people to go for insight to the SABR world and how the various stats and projections have evolved?
I suppose my wiki is one such place:
http://www.tangotiger.net/wiki/
I’d recommend the annual Hardball Times:
http://www.hardballtimes.com
I would not recommend my book until you actually have some saber-leanings.
"The only thing I would say is that a forecast is more like an over/under bet. It’s something that you are about equally sure is just as likely to happen more than that line as it is to happen below that line.”
A little off-topic, but does anyone have any data on distribution of error terms for the major projection systems? Are median or mean errors zero for any? Which ones?