Friday, June 13, 2008
Aramis Ramirez: a new approach to hitting?
Eric takes a look. The most stable of the output stats is strikeouts and walks. If you see a big change there, then you know that something is going on. Now however, we don’t have to rely on the output stats. We can look at the quasi-input stats: how much is he swinging at pitches out of the zone (making sure to compare by count)? How often is he taking? All the things that lead to Ks and BBs would be based on his swing/take approach at each count. And while you might need say 100-200 PA to achieve 50% reliability, if you look at his Ks and BBs numbers, we’d need to have fewer than 100 PA if we know his swing/take approach. That’s why scouts are so important: their uncertainty level is much lower than the output stats at under 100 PA.
I am not, in general, a big fan of these, “Let’s look at his BABIP, line drive rate, swings and misses, swing rate at pitches in or out of the zone, etc., in order to see where a batter REALLY is.” Not a fan at all.
It is OK to dig deeper into a player’s stats in order to filter out some of the noise, but there is absolutely nothing you can do to “filter out” sample error. And a big part of what makes players’ performance fluctuate is pure sample error. And there is nothing you can look at that will help you filter out the “noise” in pure sample error. It is not like looking at a batter’s triples rate in order to estimate his speed (lots of noise) and then timing him around the bases (almost no noise).
For example, let’s say that a batter’s walk and/or K rate has changes significantly from his expected rate. And let’s say that we look at his swings at pitches out of the zone and find out that it is or it is not the same as it was before? So what? That really does not help us ALL that much. It helps us a little (we want to know, for example, whether he has received a lot of pitches in or out of the zone, or whether he has had a worse or better “eye” than in the past. But the problem is that there is plenty of sample error in how often a batter swings at pitches in or out of the zone (or anything else you want to look at). In fact, the uncertainty around that swing rate is a binomial, distribution, and there ain’t nuttin’ you can do about that (I suppose you can try and get even more granular than that by testing his eyesight, looking more closely at what counts he swings and misses, what counts he gets into, etc., but there is lots of uncertainty in almost anything a batter (or pitcher) does, no matter how granular you get in the data and there ain’t nothing you can do about it.
Anyway, to recap, while it definitely helps to look at more granular data when analyzing player performance, we have to be careful about assuming that we have more certainty than we actually have in that data, as far as an estimate of a player’s true talent is concerned.
And I would like to see more “hard numbers” provided by these guys who do the analysis. For example, let’s say we have a player who has an OPS of .900 and his projection was .800. So he has overperformed for some period of time. Now we hear all the time how since he has a BABIP that is high, that his .900 is not “sustainable” but that if his BABIP is normal, it is sustainable. I’m afraid that is not good enough for me. I want to know what that high or low BABIP does in terms of our new projection. “Sustainable,” “not sustainable” means nothing to me.
And BTW, I don’t know what to make of the data in this article. Ramirez is walking more and striking out more. OK. He is swinging at fewer pitches out of the strike zone. Well, no shizit. I would expect that from almost everyone who is walking more. He is striking our more even though he is walking more too and swinging at fewer pitches out of the zone. OK, that is fairly interesting. Maybe he is swinging harder when he does swing. Maybe he has just gotten a lot more tough pitches to hit so far this year. Maybe he has just swing and missed a lot this year for no apparent reason. I just don’t know what to make of the data in the article, and I am not sure what we are supposed to make of it.