THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Friday, October 21, 2011

Performance in tied games

By Tangotiger, 09:38 AM

Note: You may feel you are walking in the middle of a conversation.  You can take a step back and go here.

***

Ok, given that we accept that there are pecularities with the 9th inning, bottom half, tied game scenario (unintentional-intentional walks; fielders de-aligned from norm to either defend against the bunt, to guard the line, or to play the OF shallow), I’m now going to focus only on the tied-game scenarios for all innings.

So, here’s the A_empty group (as a reminder, all bases empty situations, regardless of outs), for tied games, broken down by inning (1st, 2nd thru 8th, 9th, extra innings):

topBot inn     obp      babip      wBABIP     diff
0    1     0.333      0.297      0.297      0.000 
0    2.8     0.325      0.289      0.289      0.000 
0    9     0.326      0.295      0.289      0.006 
0    10     0.336      0.302      0.298      0.004 

1    1     0.360      0.310      0.309      0.001 
1    2.8     0.337      0.298      0.298      
(0.001)
1    9     0.349      0.313      0.307      0.006 
1    10     0.361      0.315      0.310      0.005

First 4 lines is top of inning, and next 4 is bottom. 

Our “control” group is basically innings 2 thru 8.  As we can see, at home, you get a 12 point advantage in OBP, 10 point advantage in BABIP, and since wBABIP=BABIP that means the rate of extra base hits per in-play hit remains constant (i.e., excludes HR). 

wBABIP is BABIP but with extra weight placed on extra base hits and less weight placed on singles.

As we saw in the other thread, the massive difference in OBP in the 9th inning is explained by the walk rate and singles rate.  The extra innings carries this issue as well.  So, any analysis of tied games in the 9th inning has to be able to handle these parameters, otherwise you are going to introduce a bias.

The interesting one though is the 1st inning.  We ALSO see a massive gap in the 1st inning, with a 27 OBP point gap at home than on the road.  As we said, 12 points of that is the home field advantage.  The K and BB rates are simply hugely different, as it plummets to 14.5% K rate in the bottom half (compared to 17.3% in the top half) and jumps to 10.0% in BB+HB rate in the bottom (compared to 8.7% in the top).  It might be clearer to see it as a K to BB+HB ratio: 2.00 in the top half and only 1.45 in the bottom half.

My original speculation was that the road pitcher, being the second pitcher to take the mound, simply hasn’t found his mound groove.  I wish I would have included one other parameter in my dataset (which I can do on the weekend): handedness of pitcher.  If both pitchers are right-handed, they may “compete” for the mound groove.  If they were opposite-handed, then it might make it easier for each to take their own side.  Just a thought, one that I will also leave with the PITCHf/x-ers.  Anyway, someone asked about day/night splits maybe.  So, let’s see.

Here are the splits by day:

topBot inn     obp      babip      wBABIP     diff
0    1     0.327      0.292      0.292      0.000 
0    2.8     0.321      0.286      0.286      
(0.000)
0    9     0.325      0.300      0.294      0.006 
0    10     0.338      0.306      0.302      0.004 

1    1     0.356      0.308      0.307      0.001 
1    2.8     0.335      0.294      0.295      
(0.001)
1    9     0.349      0.309      0.302      0.007 
1    10     0.362      0.315      0.311      0.003

The control group (innings 2 thru 8) shows a 14 point home advantage for OBP and a 8 point advantage in BABIP.  There’s no extra base hit advantage.

Innings 9 and extra innings shows the same massive gap, as explained earlier. 

Inning 1 shows a 29 point OBP advantage at home, and 16 point advantage on BABIP, numbers that are a bit bigger than overall.

Here it is at night:

topBot inn     obp      babip      wBABIP     diff
0    1     0.335      0.299      0.299      0.000 
0    2.8     0.327      0.291      0.291      0.000 
0    9     0.326      0.292      0.287      0.005 
0    10     0.335      0.299      0.296      0.004 

1    1     0.361      0.312      0.310      0.001 
1    2.8     0.338      0.299      0.300      
(0.000)
1    9     0.349      0.314      0.309      0.005 
1    10     0.360      0.315      0.309      0.005

The gap is somewhat smaller at night in terms of home advantage.  The control group shows a 11 point OBP advantage and 8 point BABIP advantage.  The first inning shows a 26 point OBP advantage and 13 point BABIP advantage.

Across the board, the day time advantage is an extra 3 points in OBP.  I would not be surprised that if you look at historical home/away splits, that the day splits show a slightly higher home advantage.

Anyway, let me now focus on the B_2out grouping (all PA with runners on base and 2 outs).  Maybe by this point, the mound familiarity starts to come into focus (and maybe not using the windup helps too in combating the mound effect).

topBot inn     obp      babip      wBABIP     diff
0    1     0.354      0.292      0.294      
(0.002)
0    2.8     0.336      0.290      0.290      (0.000)
0    9     0.327      0.281      0.277      0.004 
0    10     0.333      0.294      0.290      0.003 

1    1     0.372      0.305      0.307      
(0.002)
1    2.8     0.350      0.293      0.293      (0.000)
1    9     0.337      0.289      0.269      0.019 
1    10     0.361      0.307      0.286      0.020

Control group shows a 14 point OBP advantage and 3 point BABIP advantage at home.  The first inning effect shows only an 18 point OBP advantage at home, but 13 point BABIP advantage.  The OBP gap between 1st inning and control group is no longer very wide.

I think the next thing to look at is simply by 1st, 2nd, 3rd batter in the game.  I’d bet we’ll see a huge gap in OBP when comparing the 1st batter’s first plate appearance in games at home and on the road.  And then a progressively smaller gap for the 2nd batter faced in the game, and 3rd batter.  And that by some point, maybe the 2nd inning, the gap remains static.

If it’s not due to the mound, then perhaps this effect is that the road pitcher has to sit around waiting to pitch in the 1st inning.  The home pitcher gets to pitch right away after his mound session.  Maybe the solution is that the road pitcher should warm up during the top of the 1st inning even?


#1    MGL      (see all posts) 2011/10/21 (Fri) @ 12:15

Good stuff. I assume that none of these numbers is adjusted for the pool of batter/pitchers?  Obviously this makes a big difference in comparing rows, and maybe a small difference in comparing home and away.

I would love for you to at least have a column for the seasonal (or career) wOBA of the pool of batters and pitchers for each group (row).

Also would love a HR, BB, and K% column, or at the least HR%…


#2    Tangotiger      (see all posts) 2011/10/21 (Fri) @ 14:25

I didn’t track the identity of the players for this go-round.

Here are all the elements I pulled out.  Let me know how you want the breakdown (groupings), and I’ll generate it:

daynight_park_cd
resp_pit_start_fl
bat_home_id
inn_ct
home_diff_score_ct
start_bases_cd
outs_ct
event_cd
att_sacbunt_fl
event_outs_ct
event_runs_ct
n


#3    Myron S.      (see all posts) 2011/10/21 (Fri) @ 14:36

In addition to “n” as cited in #2, what’s the variance in wOBA and wBABIP within each row?


#4    Tangotiger      (see all posts) 2011/10/21 (Fri) @ 14:40

How would I calculate variance (in this case)? 

What are the entities to which I am calculating wOBA, and of which I will then calculate the standard deviation of those wOBA?


#5          (see all posts) 2011/10/21 (Fri) @ 15:19

You have n pulls from a multinomial distribution (BB, 1B, 2B, 3B, HR, Out, etc..).  You are then getting an estimate for the mean of a linear combination (using the wOBA weights or wBABIP weights) of the number of each event.  The variance(or sd) on that linear combination is what Myron is asking for.  I believe you’ve quoted this before as:

sd(wOBA) = sqrt(wOBA * (1.1 wOBA) / N)

which is very close to the theoretical value for using multinomial theory.  Not sure for wBABIP, but it is probably close.

The real issue is are the differences in the “diff” column significant, which means you should calculate the SD for diff, which is a different linear combination of the event counts.


#6    Tangotiger      (see all posts) 2011/10/21 (Fri) @ 15:37

There’s really no point in quoting the standard deviation of wOBA then in this case.  They’re all going to be so very close from the standard equation, that all you need is the number of PA.


#7    MGL      (see all posts) 2011/10/21 (Fri) @ 19:40

"Let me know how you want the breakdown (groupings), and I’ll generate it:”

Not sure what you mean.  I’m fine with the groupings.  I was just hoping for some categories and the batter and pitcher pools for sure.  In fact, I’m not sure how you can do any kind of analysis without knowing at least the pitcher pools.

And you would surely need the batter pools, for example, to compare the first inning with any other inning, since the first inning should have the best batter pool…


#8    Tangotiger      (see all posts) 2011/10/21 (Fri) @ 22:52

For the first inning, the pool for home and away should be pretty identical.  After all, Jimmy Rollins is going to bat in the first inning whether he’s at home or on the road.

I’ll email you what I have, and see if you can work with that.


#9    MGL      (see all posts) 2011/10/21 (Fri) @ 23:20

Sure. I was referring to comparing certain innings to other innings.  For that, you obviously need to adjust for batter and pitcher pool.

Anyway, I can’t do any research for a while, as I am packing up my stuff for my annual journey west…


#10    Tangotiger      (see all posts) 2011/10/24 (Mon) @ 16:52

MGL: still working on getting you everything.

***

I’m wondering: can we figure out if we have alot of unintentional-intentional walks based on the number of pitches thrown in that walk?  The typical walk gets 5.5 pitches (maybe 5.6 these days).  The IBB is obviously very close to 4.0 (I think it was 4.1).

So, if in the situation where the walk rate jumps substantially (tied game, bottom of 9th, bases empty), if we see that the average number of pitches per walk is much less, maybe that’ll explain it.

For example, if the walk rate should be 8%, but we observe 10%, maybe it’s actually 8% as regular walks and 2% of the more intentional variety.  So, if we have 5.5 pitches per regular walk, but 4.5 pitches per “pitch around walk”, we’ll see 5.3 pitches per walk, instead of the neutral 5.5 pitcher per walk.

At this point, I don’t have the capacity to look into this, but I’d love it if someone else did.


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

May 25 13:18
Do pitcher’s reach back for velocity when needed?

May 25 13:04
“Why Kickstarter works”

May 25 12:51
Chad Curtis

May 25 12:40
Largest demonstration in Canadian history?

May 25 11:32
Howard Stern

May 25 11:26
Lack of hustle during a game

May 25 11:22
What sabermetrics is NOT

May 25 10:58
Rooting for laundry

May 25 02:38
NFLPA lawsuit against collusion

May 25 01:43
Neal Huntington’s best moves