THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Streaks

Saturday, May 12, 2012

Performance after an error

By Tangotiger, 08:34 AM

Great idea to check for performance of pitchers following an error.  Obviously, this method is a bit crude, but it gets us started.  Somebody want to expand on this, and simply look at wOBA of the pitcher following an error, and compare to their overall wOBA?

It should also be noted that GB pitchers get more errors behind them simply because they are GB pitchers.  Errors per GB will far outstrip errors per FB.  (That’s a big reason that Webb has a huge number of unearned runs and Johan Santana has a tiny number. )

(0) Comments • • SabermetricsStreaks

Thursday, May 03, 2012

HR do not come in bunches

By Tangotiger, 02:58 PM

So shows Ben over at BPro.

First chart is for the population of players, and the second one is Albert Pujols.  Of course, I think most people would mean “bunches” as in many HR spread over a short period of time.  But, naturally, if you are going to have that kind of distribution, you are going to get more multi-HR game than expected.  So, while you can ask Ben to followup with a part 2 (and that’s fair of the reader), the likelihood is we’re going to find nothing.

There isn’t a streak alive that a person hasn’t tried to build a narrative around.  Random variation is such a real concept that it makes everything you see actually not real!

hrbunches.png

(8) Comments • 2012/05/04 • SabermetricsStreaks

Thursday, October 06, 2011

Hot hand free throws

By Tangotiger, 01:09 PM

Study:

Over all 5 seasons examined there is a improvement in the second attempt success rate conditioned that the first attempt was good. This data is for the mean of the individual level, eliminating any skew of the results due to aggregation.

And data is made available.

Glove-slap: mettle.

(2) Comments • 2011/10/06 • SabermetricsStreaksOther SportsBasketball

Tuesday, September 13, 2011

Momentum is this inning’s reliever

By Tangotiger, 12:55 PM

Great stuff by Ben Blatt.

All games between 1980 and 2010 in which the game was tied in the ninth inning, a total of over 2300 games, were included in the data set.
- When the home team blew the lead in the ninth, it won in extra innings 53.1% of the time.
- When the home team came back in the ninth, it won in extra innings 53.8% of the time.

(12) Comments • 2011/09/14 • SabermetricsStreaks

Wednesday, September 07, 2011

Consistency value

By Tangotiger, 01:39 PM

Will asked me about the value of consistency.

He gave me an extreme situation, and I responded that from a pitching standpoint, you’d rather have the guy who gives up 0, 20, 0, 20, 0, 20 runs rather than 10, 10, 10, 10, 10, 10.  The inconsistent pitcher will win .500, while the consistent one will be close to .000, even though both gave up 10 runs per game.  The hitting is the exact opposite, so the preference would be on a consistent hitter.

So, technically, yes, consistency matters: you want it in hitters, and don’t want it in pitchers (all else being equal).  Practically though, players are not like that.  You’re not going to get anything close to an extreme situation that it’ll matter.  That is, first you have to be able to identify a consistent/inconsistent player, and then you have to establish the degree to which he’s like that.  To then put a value on that, and I’d be shocked if we’re talking about more than 0.1 wins for a player.  Basically, I called it a rounding error.

(14) Comments • 2011/09/08 • SabermetricsStreaks

Friday, May 06, 2011

Re-ordering batting streaks

By Tangotiger, 12:20 AM

It’s an interesting exercise, and well-presented.  I think the most likely explanation is that the PH and non-PH games were intermingled.  What do you guys think?

(10) Comments • 2011/05/06 • SabermetricsStreaks

Thursday, January 20, 2011

Streakiness

By Tangotiger, 04:26 PM

I’m just waiting for part 2.  So far, all we’ve seen is that David Wright’s 2010 season was streaky, but we still don’t know if he just happens to be one of the extreme point (i.e., SOMEONE has to be streaky, just by luck), or if the number of streaky players is larger than expected by random.

I’ll bet we’ll only see a bit (not much, but some) of streakiness, for the following reasons:
1. you see the same pitcher in 3 or 4 PA
2. same park
3. same weather
4. same health

By re-ordering the PA as the author is doing, he is removing all this.  So, just by virtue that a player is going to be the same guy in the same day facing the same pitcher on 3 or 4 occasions, we expect SOME persistence, over and above any “streaky” factor innate to him as a player.  Once you factor all that out, I would say the streakiness factor will be close to zero.

(10) Comments • 2011/01/21 • SabermetricsStreaks

Wednesday, July 21, 2010

Hot Pitchers better than Cold Pitchers, of equal talent

By Tangotiger, 03:13 PM

Bill James study.  The degree to which we observe a difference was pretty big for me:

From the years 2000 to 2009, I identified 504 “matched sets” like this in which two starting pitchers had nearly-identical records, but one was hot and the other was not.  Details:
...
For example:
Randy Johnson as of September 5, 2000, had made 30 starts with a won-lost record of 17-6, 2.45 ERA, 299 strikeouts.
Randy Johnson as of September 7, 2001, had also made 30 starts with the same won-lost record (17-6), same ERA (2.45), but 320 strikeouts.  But he was ten degrees hotter at that time in 2001 than he was in 2000.
...
The “hot” pitchers, in their 504 “next starts”, had a won-lost record of 199-175, an ERA of 4.28, and an average Game Score of 50.62. The “cold” pitchers, in their 504 next starts, had a won-lost record of 177-177, an ERA of 4.74, and an average Game Score of 47.94.

I like the overall idea.  The results seem pretty large.

I think we need better controls.  Pitchers with a 177-177 record (i.e., .500 on 354 decisions) don’t have a 4.74 ERA.  If they did, they come from hitter’s parks in the higher run seasons in the 2000-2009 decade.  In order to do a study like this, you have to eliminate potential sources of bias.  So, runs allowed per league average, adjusted for park, would have been preferable.  I don’t know that we even need to look at the W/L record.  There’s a few more tweaks that can be done.

Overall, I like it, I like the idea, I like the execution.  It just needs different eyes looking at it from a similar angle.

(38) Comments • 2010/07/27 • SabermetricsStreaks

Friday, July 16, 2010

And yet more HR derby hangover effects

By Tangotiger, 12:03 PM

Matt takes a look, and he does it along the lines of what MGL did for the hot/cold streaks in The Book: focus on a very narrow time frame, on the idea that if something were to happen, it would be most noticeable immediately.

As you’d expect, little to no change.  The article’s value is in its process, not in its findings (or lack of).  People who read the article and say they learn nothing, well, you missed learning something.

***

I also want to point out, as I always do, that just because something is statistically significant, it does NOT mean that that particular observed difference is the estimated true difference.  All we know is that we estimate that we have a true non-zero difference. 

In short, it tells you that there is a difference, but it says nothing about how much the observed difference is the true difference (other than the higher the t-test, the more the difference is real).

(11) Comments • 2010/07/17 • SabermetricsStatistical_TheoryStreaks

Wednesday, June 16, 2010

Past history v Future story

By Tangotiger, 03:01 PM

Fack Youk captures it for me:

[Halladay v Yankees is] a valid storyline. He’d pitched more than a full season’s worth against the Yankees in his career and had great results. How do you not bring that up?… Essentially, what you are looking at when you try to analyze Halladay’s career stats against the Yankees (or any pitcher’s line against a certain team) are a bunch of very small samples, recorded over a very long time, and smushed together to look like one big one....But there’s two parts to this - the validity of the stats and how they are being used.… So why do we fans have the desire read about games before they happen? Why do we bother to write a preview for all 162 of them here?… More specifically, when I write a preview or read one that someone else has written, it’s because I want to have an understanding of any trends and storylines coming in and try to develop some sort of a framework that will make what’s unfolding on the field a little more coherent and interesting to me. Some of those things might be statistical, but the personalities and rivalries and the who-owns-who are compelling in their own way, even if they don’t pass the statistical smell test.

Basically, don’t conflate what has happened with what may happen, no matter how much you try to dissect each player’s career.

(2) Comments • 2010/06/17 • SabermetricsStreaks

Tuesday, June 08, 2010

The Individual Perfect Game

By Tangotiger, 04:16 PM

A perfect game is when a team’s batters gets 27 straight outs.  Well, how about a single batter who gets 27 straight outs?

Andy Fox went 38 plate appearances in 2004 without reaching base safely.  Joe McEwing had 34 PA in 2002.  Wil Nieves had 31 over a span of THREE YEARS!

Those are the only three players with at least 27 straight PA of not reaching base, since 1993.

(Note: actually, baseball-reference counts streaks on a game-basis rather than PA-basis.  I’ll leave it to someone else to see if in the surrounding games they extended their streaks.)

(2) Comments • 2010/06/09 • SabermetricsStreaks

I thought this was a good enough quote to merit its own thread

By , 12:23 AM

All players need rest. Benching a player for one day here or there because he has been cold or because you didn’t like the color of his shoes, or for whatever reason, is fine.  Same with dropping a player in the batting order.  It’s just not that big a deal.  Just don’t pretend that somehow that is going to make your team better for that day. It doesn’t. It makes it worse.  You can sometimes do the right things (rest a player occasionally) for the wrong reasons (because he has been “cold").

Monday, April 12, 2010

Hot goalies will win you the Stanley Cup

By Tangotiger, 10:07 AM

So yes, you need a ‘hot’ goalie to win the playoffs.  It’s just not possible to figure out who’s going to be ‘hot’ until after the fact.

(0) Comments • • SabermetricsStreaksOther SportsHockey

Thursday, March 18, 2010

Trends in spray charts?

By Tangotiger, 12:47 PM

Jeff takes a look, and this at least has ALOT more potential than looking at OBP and SLG “trends”.  Love the idea.

Wednesday, June 03, 2009

Joe Mauer

By Tangotiger, 11:28 AM

Walt Davis says:

Still, it’s over a 5-sigma variation which does suggest that this is a “process out of control” but all that tells us is that the “failure” rate (where each HR is a “failure") is higher than it used to be. The question then becomes what’s the new rate. It will be several hundred PA before we’ll have a good estimate of that but I’ll just point out that Adrian Gonzalez had 11 HR in May in 5 fewer PA so there’s no reason to think Mauer’s new level of talent is any higher than Gonzalez’s (who, to this point, has been a 25-35 HR guy though obviously he’s on a torrid pace right now).

And as always, I point folks towards Steve Dillard, Aug 2-17 1979—this sort of thing can happen “randomly.”

My response:

Read More

(3) Comments • 2009/06/04 • SabermetricsStatistical_TheoryStreaks

Monday, May 18, 2009

Another nail in the Hot Hand coffin

By Tangotiger, 03:54 PM

The Baseball Analysts has now become my 1A to the 1 that is Hardball Times.  This time, it’s Sky that looks at how hitters perform after they’ve been on a 30-game hitting streak:

The “during streak” line is actually “performance from game 31 until his last hitless game” (as best as I can figure). 

(10) Comments • 2009/05/20 • SabermetricsStreaks

Tuesday, April 21, 2009

The Myth of the President’s Trophy Curse

By Tangotiger, 01:00 PM

Carl Bialik:

But in the 22 prior NHL seasons since the trophy was first handed out to the Edmonton Oilers in 1986, President’s Trophy winners have won the Stanley Cup seven times, including the Detroit Red Wings last year. Considering that 16 teams make the playoffs each year, that’s a remarkable run of success.
...
In Major League Baseball during that span, by contrast, just three teams — the 2007 Boston Red Sox, 1998 New York Yankees and 1986 Mets — have won the World Series after finishing with at least a share of first place for regular-season wins.
...
Since the ABA-NBA merger, 15 of 33 champs have at least tied for the best record, compared to nine titles for teams that finished in second or a tie for second. (Good news for the Cleveland Cavaliers, who won their opener against Detroit by 18 points.) And 20 of 39 Super Bowl champs since the AFL-NFL merger held the league’s best record, or finished in a tie — compared to 11 for second-place finishers.

(0) Comments • • SabermetricsStreaks

Monday, October 27, 2008

Do first_half/second_half splits mean anything?

By , 08:11 PM

I thought the results were important enough to warrant its own thread, rather than continuing the last one on first half, second half splits.  If you didn’t follow the discussion in the thread I was talking about, here is the link:

http://www.insidethebook.com/ee/index.php/site/article/first_half_second_half_splits/

To summarize the discussion, a few people pointed out some unusually large first half, second half splits in performance for some players.  The discussion is whether for players in general, those splits “mean” anything, which is the same thing as asking whether they have any predictive value, which is the same thing as asking whether they correlate to any degree from one year to another.  For example, we find that platoon splits for RHB have very little predictive value.  No matter what a RHB platoon splits are in any given time period, they will tend to revert to near league average for all RHB in any other time period.  For LHB, there is some predictive value - the larger the sample size of data we have, the more predictive those sample results are.

For RHB (since there is some predictive value), we might need 10 years of split data to “tell us anything” about that player’s true talent platoon ratio or difference.  For LHB, it might be 2 or 3 years of data.

Anyway, I was skeptical that any sample of first half and second half splits means anything, i.e., has any predictive value.  Of course, even if there is a tiny amount of predictive value in any sample data, if the sample is large enough we eventually get tremendous predictive value. But, in baseball, we really only get to use one year at the least and maybe 5 or 10 years at the most, worth of data to have any practical significance, of course.  If we have to wait until we get 15 or 20 years of data for it to have much predictive value, that is not particularly interesting, to me at least.

Anyway, one way to see how much predictive value there is in a certain amount of data, we can run a regression from one time period to another.  If the correlation is really low, then there is little or no predictive value to that particular stat for that amount of data (the number of opportunities underlying each element in the regression).  Hopefully we have enough data (both data points in the regression and a decent sample size for the underlying number of opportunities for each data point), such that the uncertainty in the resultant “r” is fairly low (a small standard error).

I did such a regression on first half, second half splits. Here is the methodology and the results: 

Read More

(15) Comments • 2008/11/04 • SabermetricsStreaks

Monday, October 06, 2008

Last time each team won a playoff series

By Tangotiger, 11:33 PM

Nice list.

(0) Comments • • SabermetricsStreaks

Monday, June 23, 2008

Time Zone change and effect on performance

By Tangotiger, 02:19 PM

Phil looks at it on the team level.

Phil: do you count the Expos games as “home” or “away” when in P.R.?

(9) Comments • 2008/06/23 • SabermetricsStreaks
Page 1 of 2 pages  1 2 >

Latest...

COMMENTS

May 26 13:18
What makes for a successful GM?

May 26 12:47
Do pitcher’s reach back for velocity when needed?

May 26 07:27
“Why Kickstarter works”

May 26 03:03
Pete Palmer’s new book: Basic Ball

May 26 01:11
Largest demonstration in Canadian history?

May 25 19:41
What sabermetrics is NOT

May 25 16:59
Howard Stern

May 25 12:51
Chad Curtis

May 25 11:26
Lack of hustle during a game

May 25 10:58
Rooting for laundry

THREADS

May 26, 2012
What makes for a successful GM?

May 25, 2012
Pete Palmer’s new book: Basic Ball

May 25, 2012
“Why Kickstarter works”

May 25, 2012
Chad Curtis

May 25, 2012
Which pitchers are the forecasters betting on a good rest-of-season?

May 25, 2012
What sabermetrics is NOT

May 25, 2012
Sports pic of the year?

May 25, 2012
Do pitcher’s reach back for velocity when needed?

May 24, 2012
Largest demonstration in Canadian history?

May 24, 2012
Rooting for laundry