Friday, February 18, 2011
Ratios to rates
Carson has an equation to convert out ratios into contact rates:
rate = .18ln(ratio) + .38
I’ll repeat my comments over there:
==================================
Carson, a groundout to airout ratio means:
g/a
A groundball percentage means:
g/(g+a)
So, in order to convert a ratio into a percentage, you do:
ratio/(ratio+1)
A g/a of .5 means a gb% of 0.33. A g/a of 2 means a gb% of 0.67, and so on.
However, in MLB, they exclude lineouts from the numerator and denomiator in the g/o ratio. But, they are included in the gb%. So, a gb% is actually:
g / (g + a + l)
Furthermore, in g/a refers only to outs, while gb% refers to all contacted balls. So, you’d have to convert the go to a gb by saying doing go/.75 = gb. And so on.
***
All to say: I don’t doubt the best-fit of the equation you found.
I do think that we can come up with a different equation that is grounded (no pun intended) in logic. And you can then do a best-fit against that equation.
***
Right.
If you have a g/a ratio of .500, 1, 2 the ln of that is going to give you: -.69, 0, +.69. So, perfectly symmetrical. Which matches what the g/(g+a) would give you of .333, .500, .667, respectively.
But, the actual equation for gb% is g/(g+a+l). Would the ln(g/a) still necessarily hold as a core part of the conversion?
I don’t know, I’m asking.
***
Following up:
To convert the ratio to a rate, if we had the exact same parameters in both, we’d do:
g% = g/(g+a) = .x*ln(g/a) + .5
That x would approach 0.25 as g/a approaches 1. And in MLB, x would range from .24 to .25.
So, if we used all contacted balls, then a best-fit equation would come in at something like .25*ln(g/a) + .5.
But, as noted, the ratio actually uses only outs, and excludes lineouts. The rate uses all contacted balls.
Carson’s best-fit, using observed data, changes that .25 coefficient to .18. It changes the intercept from .5 to .38.
My question is if someone here would like to try to come up with an equation without relying on individual data, and simply use some logic to the process. To presume that 20% of batted balls are line drives, that 25% of those are lineouts, and so on.


Recent comments
Older comments
Page 1 of 344 pages 1 2 3 > Last »Complete Archive – By Category
Complete Archive – By Date