THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Thursday, July 16, 2009

A glaring example of how NOT to use a regression analysis

By , 12:47 PM

A few disclaimers and qualifiers:  Dan is a good guy and a good researcher.  I am by no means an expert in regression analysis.  I don’t even play one on TV.  Dan fully admits that there are problems with his methodology.

I am not sure why he even published this though.  The results are clearly bad - partially at least because of the methodology (the variables he used in the regression).

I don’t think you need to do a regression analysis for something as simple and obvious as GDP rate.  And if you do, I think it is pretty obvious that the only significant variables to use are handedness, speed, percentage of GB per PA, and how hard you hit the ball which you can proxy with any number of variables I would think.

The only other possible variables I can think of would be pull ratio independent of handedness (for example, Ichiro may hit into more DP because he may hit more balls to the SS side than the typical speedy left-hander), an ability to deliberately hit the ball to some beneficial location in a DP situation, if that even exists, and whether the runners on first base tend to me in motion more than is typical when you are at bat.

(2) Comments • 2009/07/16 • SabermetricsBatted_Ball
Page 1 of 1 pages

Latest...

COMMENTS

May 21 05:05
Cory speaks!

May 21 04:48
Extra, extra, read all about it: MLB has inter-conference play this weekend!

May 21 04:21
Is the Shift actually working?

May 21 02:57
Are bullpen sessions predictive?

May 21 01:01
Lincecum the catcher

May 20 21:02
Poll: I would have suspended Lawrie/Alomar for ___ part of the season

May 20 20:59
How do you incentivize a power hitter to bunt?

May 20 14:22
Combining Rock-Paper-Scissors (RPS) with “What Number Am I Thinking Of”

May 20 11:51
When to buy Facebook?

May 19 23:47
Sponsoring MLB jerseys

THREADS

July 16, 2009
A glaring example of how NOT to use a regression analysis