THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, October 24, 2008

First half, second half splits

By Tangotiger, 10:04 AM

Dewey was one of my favorite players.  He’s also played in only three all-star games, a shockingly low total for a player who is at least a borderline hall of famer.  Jim Rice has been in 8, and Fred Lynn in 9.  Dave Parker in 7.  Dave Winfield 12.  Andre Dawson 8.  Those are his peers, more or less.  I could go on, but I’d guess all his peers were in at least 5.  The reason, back when I was a kid and didn’t really study the issue, was that Dwight Evans would get hot in the second half, and so, lost out on the half-year/popularity All-Star game.  His career stats show him with about a 13 point improvement in wOBA in the second half based on around 5000 PA.  One SD is around 7 wOBA points, so, there may be something there, not only in my memory (13 points) but even in the significance of his second-half performance.  Then again, I cherry-picked him and so, we expect some players to be at the 2 SD level, just by chance.

We have a similar player in our midst: Johan Santana.  (Hat tip: Joe Poz reader.) On around 3000 PA, his wOBA difference is around 28 points or so comparing 1st and 2nd half.  One SD is 9 points, so he’s at THREE SD.  You expect to see 99.8% of all data at between -3 and +3 SD.  It’s certainly possible that Johan is among the handful of players that simply is at the extreme range by pure luck.  But, why does it have to be the best pitcher of this decade?

I’d love to see someone tackle the issue of half-splits (preferably using the All-Star game as the split) by this method and see who are the true extremes, and if the standard deviation of all the z-scores is more than 1, or equal to 1.


#1    MGL      (see all posts) 2008/10/24 (Fri) @ 12:14

Or just run a correlation for all players.  My guess is that it is as close to zero as possible, especially if you remove catchers.  In fact, I will lay 2-1 that the correlation is between .1 and -.1.  Once you do that (run the correlation), and it is near zero, you’ve pretty much ended the story.  The essentially means that everyone’s splits gets regressed nearly 100%, so who cares what any individual’s splits are, whether it is 1 SD or 3 SD.

With all due respect to Tango (in this case, I actually mean that), I hate looking at a split and pointing out one or more players that have an extreme one, without first looking at correlation or a similar measure (such as what Tango is suggesting, which is to look at the SD of the Z scores, to see if the distribution looks random, I guess, or is more or less spread out - less is unlikely of course).

I mean, we can play this game forever.  Make a list of the players who have extreme splits for anything you can possibly imagine (1st half.2nd half, day/night, home/road, half the opposing teams/the other half, odd days/even days, ad infinitum).  My guess is that virtually every single one of them will have a regression of near 100% other than the obvious ones that we know have a spread in “talent”, like some platoon differentials.


#2    Tangotiger      (see all posts) 2008/10/24 (Fri) @ 12:20

The correlation won’t help, because it’s only 300 PA in each sample.  So, I don’t like year-to-year correlations.  If the r=.50 at PA=3000, then obviously, the r will plummet at PA=300.  In this illustration, that would make r=.09.

That’s why I suggested someone run the SD of all the z-scores.  I wouldn’t be surprised that you would get something like an SD of 1.10.

Like I said, Santana is a 3 SD, and I never come across stuff like that.  2 SD, sure, every now and then.  But never 3 SD. And that it’s the best pitcher in the league makes it stand out a bit more.


#3    Tangotiger      (see all posts) 2008/10/24 (Fri) @ 12:21

"year-to-year” in this case would be “in-year”


#4    terpsfan101      (see all posts) 2008/10/24 (Fri) @ 16:15

The difference between Delgado’s 1st and 2nd Half splits are pretty extreme as well.

Half PA OBP SLG
1st 4812 .373 .526
2nd 3733 .397 .573

Beltran as well

Half PA OBP SLG
1st 3643 .346 .474
2nd 2877 .371 .525

I’d love to see somebody research 1st and 2nd half splits as well.


#5    Tangotiger      (see all posts) 2008/10/24 (Fri) @ 16:25

30 wOBA difference for Delgado, and 32 for Beltran.

Can you show us the league difference?  This could be a heat issue.

Each are over 3 SD in difference between their two splits.  Clearly we’ve got something going on here…


#6    terpsfan101      (see all posts) 2008/10/24 (Fri) @ 16:52

To get the league difference, I would have to copy and paste the 1st and 2nd half stats for each league from Baseball Reference for the years Delgado and Beltran were in the big leagues. Delgado and Beltran were just the first 2 names that came to mind when I read your post.

It wouldn’t be that difficult to compile the splits with a PBP database. You would need to know the dates that the all-star game occurred for each season. What about the seasons where there were 2 all star games?


#7    dq      (see all posts) 2008/10/24 (Fri) @ 18:00

Being an old guy, first names that came to my mind were Barry Zito, Rick Monday, Manny Trillo and Bill Madlock. Using OPS +

Santana is 89/109 + 20

and Evans 95/105 + 10

So..

Zito 90/109 + 19
Trillo 107/92 - 15
Madlock 92/110 + 18
Monday 106/91 - 15

First 4 guys I think of are more extreme than
Evans

I used to call Monday Mister April before anyone ever heard of OPS.

I asked BBR for the ability to show splits pre post All-Star game awhile back.


#8    tangotiger      (see all posts) 2008/10/24 (Fri) @ 18:28

wOBA is ROUGHLY:

(OBP*1.75+SLG)/3

1 SD is ROUGHLY .5/sqrt(PA)

So, if you guys can report the number of SD, that’d be a bit more helpful.

But, clearly, since we can easily find such players shows there’s something there.

***

I looked at the 2005-08 splits, and in 07 and 08, the split was about +9 wOBA points in the second half.  In 06, it was even.  In 05 it was -3 in the second half.

So, we need to be careful.  These 4 suggest around a +4 average in wOBA in the second-half, almost certainly due to hot weather in Jul/Aug.


#9    MGL      (see all posts) 2008/10/24 (Fri) @ 18:32

I am afraid you guys can up with players left and right, and I can only yawn.  Tango, if in fact for 3000 PA the r (and thus the regression) is .5, then all of those one-year splits would get regressed about 90%.  So why should we care about them, unless you want to get excited about 10% of those observed numbers.  First, let’s see about that r.


#10    terpsfan101      (see all posts) 2008/10/24 (Fri) @ 18:42

To prove anything here, we are going to need complete first-half and second-half statistics for all players. Sean Forman has already compiled these. I’ll send him an email asking him if he can send the data to Tango.


#11    terpsfan101      (see all posts) 2008/10/24 (Fri) @ 18:54

I sent Sean an email. So be on the lookout Tom for an email from Sean or Baseball Reference.


#12    tangotiger      (see all posts) 2008/10/24 (Fri) @ 19:11

That .5 was purely for illustration.

I don’t see how you can just yawn here.  We’ve come up with a handful of players already who are at 3 SD and higher.  Since 99.8% of all players will have splits at less than 3 SD, that we’ve already been able to come up players outside that range already tell us something.

You won’t find that with career clutch splits.  Not even close.

We also don’t want to do in-season correlation because the r will be very low, since you’ve only got 300 PA to work with.

The best way to do it is to do what we’re doing, for all players, and then finding the SD of the z-scores.  I said we’d get the SD at 1.1, and now I think it’ll be more like 1.2.


#13    tangotiger      (see all posts) 2008/10/24 (Fri) @ 19:17

Anyone with the Retro DB can compile the split stats.  It’s a matter of who actually wants to spend the time to do it.


#14    dq      (see all posts) 2008/10/24 (Fri) @ 22:52

Mickey Mantle

.437/.602 = woba .456 by #8 above
.420/.535 = woba .423

difference is .033, which based on what I think you said in 8 above is 4 sd.


#15    MGL      (see all posts) 2008/10/24 (Fri) @ 22:56

We also don’t want to do in-season correlation because the r will be very low, since you’ve only got 300 PA to work with.

First of all, you can’t do “in-season correlations.” You have to do one year the next, or something like that.  You are correlating each player’s 1st half, 2nd half differential from one year to another year.  And yes, you are only going to have 200-300 PA for each half. And rather than the correlations, we can of course, look at the average distribution of differentials over several years, and see how close it is to 1.0 (standardized SD).  I assume that is what you are talking about.

The point I was trying to make, and you actually made it for me with your hypothetical example, is that if we find a large correlation, say .2 or so (I would consider that large for such few PA), then we probably have something.  But if our underlying samples (the 300/300) are so small that we don’t find anything (say, less than .1), which is what you are suggesting might be the case, then that automatically tells us that one year splits are almost meaningless.  Isn’t that what we are discussing here - whether one year splits are meaningful or not?  Now, whether 5 or 10 year splits are meaningful (maybe get regressed 50 or 60%) is another story.  I would not be surprised if, as with clutch hitting, they are. (Of course the other problem we could have is that between few PA in each data point in the correlation (300/300 or so) AND a limited amount of data points, we might have a large uncertainty in the resultant r.  For example, if we come up with .1, but with a standard error of .1, that .1 is not very meaningful one way or another, sort of like your “momentum” results wink.)

Anyway, if we find that there is some correlation, but that it takes 2000 or 3000 PA for there to be anything less than a heavy regression, well, having to wait until a player is basically done with his career before a split becomes meaningful is pretty uninteresting to me. Basically the case with clutch hitting, right?

I’ll see if I can run some numbers.  First think obviously is to establish the 1st half/2nd half baselines.  I will try and remove catchers if it is not that difficult (it may be too much work), as it is generally thought that as a group they do worse in the second half.  Although I am not sure that one group having a (presumably) fixed different differential is going to affect the correlations much at all.


#16    terpsfan101      (see all posts) 2008/10/25 (Sat) @ 04:47

Tom,

I compiled 1st and 2nd half splits for hitters from 1974-2007. I sent an e-mail to your yahoo address that contains a zip file with the stats. I didn’t include years prior to 1974 because the PBP data isn’t always complete. If somebody could tell me how to parse the box-score files than I could generate splits dating back to 1954.

If anyone else is interested in 1st and 2nd half splits, I can e-mail you the zip file.


#17    terpsfan101      (see all posts) 2008/10/25 (Sat) @ 05:06

I forgot to mention that player’s who had multiple stints with the same team during the same season have a 0 in the stint column.


#18    dq      (see all posts) 2008/10/25 (Sat) @ 09:53

The random odds of a player having the strong half and weak half 6 times in a row would be 1 ouf of 32.

So, if we determine there is significance to 1st and 2nd half splits, wouldnt we be fairly sure after 6 years


#19    dq      (see all posts) 2008/10/25 (Sat) @ 09:55

terpsfan101

Can you email me the zip file?

DQuinn1575 at wowway.com

(As Im surre you know, with the @ replacing the at)

Thanks a lot


#20    Colin Wyers      (see all posts) 2008/10/25 (Sat) @ 10:24

This is just spitballing hre, but for my money the most logical effect to find would be that players tend to perform better in the first half than the second half.

Why? Because playing time is allocated based upon performance. So if you have two players, both with a true talent wOBA of .338, one of whom hits .308 in the first half and the other who hits .368 in the second half, the second player is the one who will be playing in the second half.


#21    MGL      (see all posts) 2008/10/25 (Sat) @ 12:09

So, if we determine there is significance to 1st and 2nd half splits, wouldnt we be fairly sure after 6 years

No, that is not the way it works.  Would that it were, life as a baseball researcher would be easier.

Colin, good point about the selective sampling effect.  I’d have to think about how that would effect things.

terps (what is a “terp”?), can you send me the file, please?


#22    terpsfan101      (see all posts) 2008/10/25 (Sat) @ 13:26

Terps is short for Terrapin. A terrapin is a turtle. It’s the University of Maryland’s mascot.

I just sent you the file MGL.

I sent DQ the file as well.


#23    terpsfan101      (see all posts) 2008/10/25 (Sat) @ 15:46

Some people will probably be disappointed that I didn’t include Games, Runs, and RBI in the splits.

This was mainly just a preliminary excericse to see if I could generate split data. When Retrosheet releases new data, I will compile Home/Away, 1st/2nd Half, and Bases Empty/Runners On splits for batters and pitchers. These are the only splits that have practical value in my opinion.

And if somebody could tell me how to compile the data in the Box Score Event Files, then this process will go a lot smoother.


#24    dq      (see all posts) 2008/10/25 (Sat) @ 18:32

21
If you have 10% of the players who have extreme half splits and 90% that don’t, then if we have 8 good 1st halves, that is a 1 out of 128 chance - 0.7%

So we now have a 0% chance he is a good second half player.

But, do we have a 5% chance he is a good 1st half player, versus a 0.7% chance * 90% that he is a fluke?

Do we take 5% versus .63% (0.7 * 90%)?

I think that makes it 88% that the player has the trait, and 12% it is a fluke.


#25    MGL      (see all posts) 2008/10/25 (Sat) @ 20:55

#24, I don’t follow.

You originally said:

So, if we determine there is significance to 1st and 2nd half splits, wouldn’t we be fairly sure after 6 years?

That can’t be true, because it depends on the spread of “splits talent” (the “significance” of the splits, as you say).

If the “significance” is small, then it may take 60 years for us to be “fairly sure.” If the significance is large, then it make 2 years or 3 years, or whatever.


#26    terpsfan101      (see all posts) 2010/08/16 (Mon) @ 15:38

Adam Laroche has increased his second half production for the 5th season in a row and 6th season out of 7 for his career:

Career 1st Half: .341 wOBA, .252/.327/.449
Career 2nd Half: .393 wOBA, .302/.364/.549

2006-2010 1st Half: .341 wOBA .249/.329/.450
2006-2010 2nd Half: .405 wOBA .315/.375/.569

Year: wOBA 1H, wOBA 2H
2004: .313, .405
2005: .359, .323
2006: .355, .437
2007: .338, .376
2008: .332, .416
2009: .340, .395
2010: .342, .409

I used Tango’s quick wOBA weights:

0.7: NIBB, HBP
0.9: 1B, RBOE
1.3: 2B, 3B
2.0: HR


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 06:43
Largest demonstration in Canadian history?

May 25 06:39
Lack of hustle during a game

May 25 05:00
Help needed with sticky issue…

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves

May 24 23:50
Rooting for laundry

May 24 17:04
Firefox, IE, or Chrome?

May 24 12:07
How to beat the shift

May 24 11:11
Incredible story

May 24 09:41
Racial bias in card collecting: not the collectors, but the players on the cards