THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Hit_Tracking

Tuesday, November 22, 2011

HITf/x: (vertical) launch angle

By Tangotiger, 10:31 AM

A followup to Mike’s terrific piece of the horizontal speed off the bat, this time, with the added focus of the vertical launch angle

There’s actually plenty of info here, and I can’t comment properly yet until I do a second re-read.  There’s also something that seems inconsistent, and I’m hoping Mike can set me straight on whether one of the charts published needs to be updated, or my reading skills need to be improved.  I’m hoping it’s the latter.

(41) Comments • 2011/11/28 • SabermetricsBall_TrackingHit_Tracking

Wednesday, November 16, 2011

HITf/x: (horizontal) batted ball speed

By Tangotiger, 10:31 AM

Great stuff from Mike:

Batters have a good deal of correlation between halves of the sample, with a correlation coefficient of r=0.76 with an average of 201 batted balls in each half. That means that we would add 63 batted balls (or about one month’s worth) at league average to the observed average speed for each batter in order to estimate his true skill.
...
Pitchers have fairly good correlation between halves of the sample, though not as good as batters. The correlation coefficient is r=0.48 with an average of 251 batted balls in each half. That means that we would add 269 batted balls (or about three months’ worth for a starter) at league average to the observed average speed for each pitcher in order to estimate his true skill.

Just fantastic stuff, and I’m glad Mike did it, as well as showing the key points, which is the point at which r=.50.

***

I’m not really surprised by the results.  The closer you get to someone’s base physical and mental skills, the less observations you need.  This is why scouts are so important.  And the F/X and Trackman systems are, at their heart, scouting tools. 

What we’ve had until recently are outcomes, results, things like OBP and K/PA, etc.  What drives OBP and the like are the players’ base skills AND luck.  That’s why we infer a players’ base skills by stripping out as much luck as we can figure out.  We do this through a Bayesian process (or its equivalent in regression toward the mean).  We need a few hundred contacted balls for a hitter, and in the thousands for a pitcher, in order for us to be able to strip out that luck to infer the base skill.

Inside a player’s contacted ball skill is not only the horizontal speed off the bat, but placement as well.

Unseen in Mike’s data is what the horizontal speed off the bat really means.  Let’s take a pitcher’s fastball speed.  We presume that there’s a high degree of correlation in a pitcher’s fastball speed.  I have no doubt that if you do a split-half correlation, you’ll get something ridiculous like r=.99 (really, it’s a question of how many nines) for pitchers who throw 1000 fastballs.  So, we can ascertain a scouting observation: we can readily and easily ascertain a pitcher’s underlying true fastball speed.

But, what does THAT give us?  He throws really hard or really soft.  But, that by itself, still doesn’t tell us how EFFECTIVE he is.

The next step is to correlate that particular base skill, that scouting-level observation, into results.  And Mike has given us that:

We see that a player who hits the ball at close to 80mph has a BACON of close to .300, while those who hit the ball at close to the league average (70mph) has a BACON of close to .200, and those at the league low (60mph) is just above .150. 

I have to say, all those numbers look pretty low.  I guess that’s what happens when you have non-linearity.  For example, suppose you hit one-third of your balls at under 60mph, another third at 60-80, and the last third at over 80mph.  (Numbers for illustration purposes only.) If it’s under 60mph, you get a batting average of .050 to .150, or say around an average of .120.  If you hit it between 60-80, it’s .150 to .300, or an average of .220.  And above 80mph, it’s from .300 all the way up to .650, for an average of say .500.  That gives you an average of .280, for an average of 70mph.  As you can see, the overall average for a distribution around 70mph is way above the batting average at the 70mph point.

Anyway, so what I’d like to see is this: create a DISTRIBUTION for each player, centered around his true talent horizontal speed off the bat, and apply the rates from the above chart (or a more smoothed version actually).  This way, we can end up with a player’s true talent BACON, if all we know is his horizontal speed off the bat.

THAT will tell us how valuable knowing his horizontal speed off the bat is.

(21) Comments • 2011/12/15 • SabermetricsBall_TrackingHit_Tracking

Thursday, May 19, 2011

Hit Where Pitched

By Tangotiger, 05:08 PM

Max’s article from a few weeks ago.

(I’m leaving the office in 2 minutes, so I will comment tomorrow.)

Glove-slap: Peter.

(1) Comments • 2011/05/20 • SabermetricsBall_TrackingHit_Tracking

Wednesday, April 13, 2011

Batted Balls and Home Runs

By Tangotiger, 10:17 AM

Great job by Studes.  Love all the sleeve-rolling.

Wednesday, February 09, 2011

HITf/x

By Tangotiger, 10:52 AM

A pre-cursor to HITf/x by using Greg’s HitTracker to try to estimate the probability of a batted ball being a HR.  Love this stuff.

(46) Comments • 2011/02/10 • SabermetricsBall_TrackingHit_Tracking

Monday, November 29, 2010

Gameday slice bias

By Tangotiger, 02:22 PM

How much is the hit location biased based on where a fielder is normally positioned?  The following is a starting point, and as a result, will use a crude estimate.

From 1B to 3B line is 90 degrees.  The circumference of a circle is 2PI*r, or PI*r/2 for a quarter circle.  If we treat the radius as around 115 feet (meaning a spot somewhere between the 1B/3B bags and the 2B bag), then the distance from 1B to 3B bags, along a circular path is 180 feet.  Or, 1 degree = 2 feet.  I know it’s not a circle, but, we just need crude approximations.

Also remember that a SS / 2B are positioned around -16 / +16 degrees, where 0 degrees is 2B. 

Peter provided us with this data from HITf/x (FX) and GameDay (GD), based on spray angles in 4 degree slices:

Spray    FX    GD    rate
-44    107    61    175%
-
40    244    134    182%
-
36    253    297    85%
-
32    309    389    79%
-
28    306    458    67%
-
24    316    347    91%
-
20    369    382    97%
-
16    312    429    73%
-
12    339    300    113%
-
8    328    218    150%
-
4    329    204    161%
0    314    246    128%
4    291    180    162%
8    322    156    206%
12    305    214    143%
16    280    377    74%
20    293    455    64%
24    260    276    94%
28    234    314    75%
32    208    293    71%
36    205    248    83%
40    149    130    115%
44    85    55    155%

rate is FX / GD.

We see that at -16 (meaning -14 to -18 degrees), groundballs are recorded by Gameday far more frequently than HITf/x is recording.  We see an enormous bias in the holes as well.

Now, let’s try an experiment.  Let’s say that the Gameday scorers agree with HITf/x perfectly on half of the batted ball locations, and are off by 4 degrees (8 feet) on the other half?  Let’s start with the 314 balls up the middle (-2 to +2 degrees).  Gameday marks 157 of those up the middle, and for the other 157 record half to the left (-6 to -2) and half to the right (+2 to +6).

At the -4 degrees (-6 to -2), HitF/x had 329 balls, of which half Gameday agrees with, and the other half are all more toward the SS side (at -8 degrees). 

So, this is what we have at -4 degrees as per Gameday:
164.5 balls that HITf/x marked at -4 degrees
78.5 balls that HITf/x marked at 0 degrees

That’s a total of 243 balls marked by Gameday at -4 degrees under this illustration, compared to the 329 actally recorded by HITf/x… but still a far away from the 204 actually recorded by Gameday.

The same thing happens at -8 degrees: of the 328, 164 are properly marked by Gameday, the rest are marked toward the SS side (at -12 degrees).  And we go on until we get to -20 degrees, where the shift happens toward the 2B bag.  This is the result of all that:

Spray    FX    GD    rate    Tango    rate
-44    107    61    175%     53.5     88%
-
40    244    134    182%     175.5     131%
-
36    253    297    85%     248.5     84%
-
32    309    389    79%     281.0     72%
-
28    306    458    67%     307.5     67%
-
24    316    347    91%     311.0     90%
-
20    369    382    97%     498.5     130%
-
16    312    429    73%     510.0     119%
-
12    339    300    113%     333.5     111%
-
8    328    218    150%     328.5     151%
-
4    329    204    161%     243.0     119%
0    314    246    128%     157.0     64%
4    291    180    162%     224.0     124%
8    322    156    206%     322.0     206%
12    305    214    143%     305.0     143%
16    280    377    74%     426.5     113%
20    293    455    64%     423.0     93%
24    260    276    94%     247.0     89%
28    234    314    75%     221.0     70%
32    208    293    71%     206.5     70%
36    205    248    83%     177.0     71%
40    149    130    115%     117.0     90%
44    85    55    155%     42.5     77%

My illustration here shows that my model bridges some of the gap.  The standard deviation of the original FX/GD is 42%, while the Tango/GD is 34%.

And trying different inputs didn’t make much better difference.  If I treat anything between 50% and 75% of the HITf/x data as being perfectly recorded by Gameday, the remain balls in play are about 4 degrees (about 8 feet) biased toward where the fielder is positioned.

I think we can try to construct a more elaborate model, and we’ll probably end up at the following: about half the data from Gameday will match HITf/x, and the other half will be off by 2 to 8 degrees (4 to 16 feet).  The amount it will be off will be biased by either where a fielder is normally positioned, or whether a play was made or not, or how much space between fielders (the holes).

This is the framework I’m proposing.  Implementations will vary.

(25) Comments • 2010/11/30 • SabermetricsBall_TrackingField_TrackingHit_Tracking

Wednesday, September 22, 2010

Optimal Bat Speed/Hand-Eye Contact Point and Andre Ethier

By Tangotiger, 12:32 PM

I can’t see any of this at the office, but it looks interesting.

Saturday, August 28, 2010

PITCHf/x Summit 2010 - Recaps

By Tangotiger, 08:12 PM

I’ll update links as they come in.  First up is Colin:
http://www.baseballprospectus.com/article.php?articleid=11868

Ben:
http://www.baseballprospectus.com/article.php?articleid=11869

Dave:
http://baseballanalysts.com/archives/2010/09/pitchfx_2010_su.php

(22) Comments • 2010/09/13 • SabermetricsBall_TrackingField_TrackingHit_Tracking

Tuesday, August 10, 2010

Club Head Speed

By Tangotiger, 02:46 PM

I guess this golf data is from Trackman.

(1) Comments • 2010/08/10 • SabermetricsBall_TrackingHit_TrackingOther SportsGolf

Saturday, August 07, 2010

Alan Nathan at SABR40

By Tangotiger, 12:17 AM

Cecilia’s blog on Alan’s Mantle’s 565 foot HR.

(1) Comments • 2010/08/08 • SabermetricsBall_TrackingHit_Tracking

Wednesday, June 02, 2010

Launch angle + Speed off the bat = trajectory

By Tangotiger, 02:04 PM

Courtesy of Greg (click to make bigger):

(36) Comments • 2011/08/11 • SabermetricsBall_TrackingHit_Tracking
Page 1 of 1 pages

Latest...

COMMENTS

Feb 11 11:54
Who is Jeremy Lin?

Feb 11 11:27
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 11 10:29
Dwight Evans

Feb 11 08:56
MGL: Today on Clubhouse Confidential

Feb 11 02:12
Performance through the ages

Feb 10 23:01
For Your Soul

Feb 10 21:07
Hero of the month: Brittney Baxter

Feb 10 18:32
Moneyball at Villanova

Feb 10 17:00
Psst… wanna intern in Canada?

Feb 10 15:01
New PECOTA

THREADS

February 11, 2012
Clutch analogy

February 11, 2012
Who is Jeremy Lin?

February 10, 2012
Jose Molina

February 10, 2012
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

February 10, 2012
Performance through the ages

February 10, 2012
Hero of the month: Brittney Baxter

February 10, 2012
Win expectancy charts used in football… in 1983!

February 10, 2012
Dwight Evans

February 09, 2012
Psst… wanna intern in Canada?

February 08, 2012
Moneyball at Villanova