THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Sampling

Wednesday, October 29, 2008

Introduction to the landscape of sabermetrics

By Tangotiger, 11:17 AM

Derek Carty gives us his take. 

The quickest explanation of sabermetrics was given by Theo Epstein early in his GM career, when he said that he sees statistics and scouting as two lenses of the glasses.  Unsaid by him is that the glasses is sabermetrics.  So, anyone who thinks of the choice as beer or nuts, ignores the reality that the choice is beer and nuts.

Also, the more performance statistics you have, the less value you can place in scouting, because at some point under certain conditions, the reality of what actually transpired is more important in determining what will happen than what the scout thinks that player will be doing.  And vice-versa of course, if the performance numbers was based on small samples.

(0) Comments • • SabermetricsSampling

Monday, May 05, 2008

How can the inputs remain the same, but the outputs change?

By Tangotiger, 10:27 AM

Joe Sheehan points out that the “slash” data is the same, but run scoring is down:

AL AVG OBP SLG ISO R/G
April 2008 .260 .334 .398 .138 9.04
April 2007 .255 .327 .404 .149 9.36

NL AVG OBP SLG ISO R/G
April 2008 .256 .331 .404 .148 9.11
April 2007 .258 .332 .400 .142 9.31

Is that random variation, or is something else going on?  Taking a quick crack at it:

We have in Mar/Apr 2007 in MLB: .256 .330 .402
And this year: .258 .332 .401

That’s remarkably close.  The runs scored per 27 outs in each year: 3785 runs in 7490.1 innings, 4.55 runs per 27 in 2008.  3360 in 6670.1, or 4.53 runs per 27 in 2007. 

Huh?  What’s Joe talking about?

Here’s the data I’m using:
http://www.baseball-reference.com/pi/psplit.cgi?team=TOT&lg=ML&year=2007#dates-month
http://www.baseball-reference.com/pi/psplit.cgi?team=TOT&lg=ML&year=2008#dates-month

Either Joe misstated his facts, or Sean has a bug, or I’m reading something wrong.

I’ll let the Wisdom of the Crowd make the decision.

(45) Comments • 2008/08/19 • SabermetricsSampling

Monday, February 11, 2008

How you can support just about any argument using silly statistics and logic…

By , 09:12 PM

This is from Chris Jaffe, no less, a baseball analyst.  While I have read plenty of his stuff and I recognize the name, I admittedly know little about him (and get him mixed up with the other Jaffe). This is also an example of how when you start writing for a (somewhat) mainstream web site or publication, you invariably develop a case of “I can write crap too, just like the rest of the guys (mainstream writers)...” (See my past comments about Keith Law.)

Read More

(26) Comments • 2008/02/16 • SabermetricsSampling

Sunday, December 02, 2007

Do Baseball Insiders Really Understand Baseball (and Statistics)?

By , 01:39 AM

I did not really know how to title this entry, but…

Here is an article that, in my opinion, is a good example of how baseball “insiders” are woefully inadequate in understanding the confluence of baseball and statistics, such that it can and will lead to bad deicision-making.

Read More

(6) Comments • 2007/12/03 • SabermetricsSampling

Thursday, October 11, 2007

Why the Phillies, Cubs, Yanks, and Angels lost the DS

By , 12:44 AM

Actually, I’ll generalize to all teams that have lost any game or series throughout the history of baseball (and most other sports).  Their opponents probably outhit and/or outpitched them, likely outscored them, and definitely won more games than they did.  Oh, and several players on the losing teams had a bad game/series - worse then their regular season stats.  And the winning teams probably played with more heart, guts, guile, and confidence, and some of them were even teams of destiny.  Did I leave anything out?

(6) Comments • 2007/10/13 • SabermetricsSampling

Friday, September 21, 2007

Eric Gagne

By Tangotiger, 04:08 PM

If we look at his 2002-2004 data, we see the following totals: 202 GB, 187 FB, 108 LD.  This year, he’s 52/54/32, which is almost exactly in line with his 2002-2004 performance.  Nate Silver points out the enormous flip in GB to FB ratio of Eric Gagne, between Texas and Boston this year. Excluding bunts, in Boston, he’s at: 14/20/13.  If you divide his 2002-2004 data by 10, you’d get this expectation: 20/18/11, which means he’s given up a couple more FB, a couple more liners, and a few less ground balls.  When your sample size is 50, that really means nothing. Of his 14 groundballs, batters are 6 for 14.  But again, that’s 14 PA.  Of the 20 FB, batters are 6 for 20 (all extra base hits).  Of the 13 liners, batters are 11 for 13.  He’s given up 2 more groundball hits than he should have, a few more extra base hits than he should have and one more line drive hit than he should have.  In high-leverage situations though (LI of 1.8 or higher), opposing batters reached base 13 of 21 times, which is horrible.  But still, it’s only 21 PA.

All this to say that with some 70 PA, Gagne needs to be evaluated on his mechanics and pitch effectiveness, and not on the resulting batter performance.  Rereading Nate’s piece, he says exactly this, and he’s right:

There may be scouting evidence that Eric Gagne is not the same pitcher in September that he was in June. But there is little or no statistical evidence based on an informed reading of his numbers.

(13) Comments • 2007/09/24 • SabermetricsSampling

Tuesday, August 14, 2007

A fascinating study, worthy of some discussion I think…

By , 01:33 AM

Here is the Study

There is a discussion of said article, where I made some comments, on BTF.

(95) Comments • 2008/04/07 • SabermetricsSamplingStatistical_Theory

Monday, July 02, 2007

Another article I have a problem with…

By , 07:12 PM

This time from a sabermetric web site.  Where are their editors?

Read More

(12) Comments • 2007/07/10 • SabermetricsSampling

Wednesday, January 24, 2007

What does 17 at bats mean?

By Tangotiger, 03:31 PM

Abbott Katz, in the November 2006 issue of By The Numbers shows us that players who had exactly 17 at bats hit .171 from 1959-2005. 

Does it mean anything?  Obviously, if you only have 17 seasonal at bats, it means alot.  It means you are a September callup, it means that you are on your last legs, it means you got hurt, it means that you did so badly that the manager doesn’t want to look at you.  It could mean a whole lot of things.  It might even mean that you suck.

In order to figure out more about what it means, you need to look at the data outside from which you selected from.  And that means, look at the data in the season before and after that selected season.  Which I will right now:

Read More

(10) Comments • 2007/01/25 • SabermetricsSampling

Wednesday, August 23, 2006

Selective Sampling - How NOT to Choose Players

By Tangotiger, 08:21 AM

Cy Morong takes a look at establishing the replacement level.  He says:

Read More

(17) Comments • 2006/08/24 • SabermetricsSampling
Page 1 of 1 pages

Latest...

COMMENTS

Jan 06 21:13
Sabermetric Moves of the 2009 Pre-Season

Jan 06 21:23
Coaching your son, or against him?

Jan 06 11:04
Dual Positions, using bUZR

Jan 05 23:05
Cheers

Jan 05 18:42
WAR to dollar valuation, but roster space management

Jan 05 00:06
Calculating the Elias rankings

Jan 04 23:42
And more on positional adjustments

Jan 04 16:55
Bill James in Baseball Digest, 1970s

Jan 03 15:29
The state of fielding sabermetrics in MLB

Jan 03 05:38
Renaming multiple files in UNIX

THREADS

September 30, 2008
Sabermetric Moves of the 2009 Pre-Season

January 06, 2009
Coaching your son, or against him?

January 04, 2009
Cheers

January 03, 2009
Bill James in Baseball Digest, 1970s

January 02, 2009
Jamie Moyer IS Jack Morris

January 02, 2009
Renaming multiple files in UNIX

December 31, 2008
WAR to dollar valuation, but roster space management

December 31, 2008
If you build it, 75000 will come…

December 30, 2008
Hockeynomics

December 30, 2008
Best Eligible Players, Not in the Hall of Fame (Win Shares)