Thursday, June 10, 2010
Figuring out command
I’’m glad there are others also thinking about this:
With fastballs, you either go high heat or throw at the knees. With sliders, there’s back foot or back door. Curves are intended to be thrown either anywhere in the dirt or anywhere in the zone. Anyway, those are the assumptions you need to make if you believe clustering makes sense. Furthermore, if you’re limited to k-means clustering, you might as well assume that all pitchers have two intended locations for their fastballs. That’s what I did, anyway. So I gave each pitcher his own two separate cluster centers, and found each pitch’s standard deviation from those centers, grouping by pitcher. Here were the leaders:
I’m not sure about a fixed “2”, but it’s a good start.


I have never, ever understood the difference between command and control. And since a lot of smart people that I respect use those two words a lot, I’m guessing it’s my issue and not an issue with bad terminology.
So if anyone wouldn’t mind clarifying the difference or pointing me to a resource that talks about it, I’d be really happy.