THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Tuesday, December 04, 2007

Giving the finger to correlation coefficient

By Tangotiger, 11:06 AM

I get mighty p.o.ed when talk comes to correlation coefficient (r), and how the r-squared “explains” something.  What it’s really saying is that, given that size of sample, the variance of the parameter you are studying explains a certain percentage of the total variance (including the variance from luck which is completely based on the size of the sample).  You can, in effect, control your r-squared to your liking.  Wanna prove clutch exists?  Use a sample size of 5000 PA.  Wanna prove clutch doesn’t exist?  Use a sample size of 50.

Anyway, while I get mighty p.o.ed, Phil takes it to another level.  Here are his archives on the matter.

If you see a researcher (baseball or otherwise) give a correlation coefficient, without giving you a regression equation (like Phil does), or a correlation coefficient equation (like r=PA/[PA+200]), give him the finger.


Page 1 of 1 pages

<< Back to main


Latest...

COMMENTS

Dec 05 04:40
Sabermetric Moves of the 2009 Pre-Season

Dec 05 05:33
Avery being Avery

Dec 05 05:06
NYC’s 3 1/2 year mandatory jail time sentence for carrying a loaded weapon

Dec 04 23:42
Poll: Would you vote Raines for the Hall?

Dec 04 23:07
How to calculate the area of a baseball field

Dec 04 22:48
Complete Run Expectancy, Retrosheet Years

Dec 04 22:03
Raines for the Hall

Dec 04 15:55
Mailbags on Parade

Dec 04 14:01
What would happen if the shootout period was 10 minutes, not 5?

Dec 04 11:49
Estimating BABIP