THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews
If you are a media member and would like a review copy of The Book, please contact Kevin Cuddihy of Potomac Books.

Buy The Book from Amazon

MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, January 31, 2007

Is Vladimir Streaky?

By Tangotiger, 01:38 PM

Buster Olney quoted a scout:

“He’s on the slide,” said one scout. “He’s turned into much more of a streak hitter than he used to be. It used to be that if you tried to pitch him inside and you didn’t bury the ball inside, he’d hit it good. Now you can get away with a little more, and I think it’s because there are days when his back doesn’t feel so good. He seems to go through periods where any tweak in his back affects his swing.”

Is this true?  Here’s what I did:


I start with his gamelogs, and did two calculations: one was his linear weights, and another was simply marking the game as “good, typical, bad” (22% of the games were good and 21% were bad).  I then looked at the average score of his next 5 games.  What did I find?

This is his best 5-game streak and this was his worst.  (Thank you Sean Forman!)

The standard deviation of all his next-5-game LWTS was 0.44 runs.  Is that typical?  Darned if I know.  His next-5 game “good, typical, bad” was 0.296.  Is that typical?  If he was not streaky, if all his performances were random, we would have expected a standard deviation of 0.293.  So, we have some evidence here that Vlad is not streaky any more than any other hitter.

Another way to test it is to simply throw all his games into a bucket, and pick them out one at a time.  For example, the first six games of this random season are: 86, 91, 155, 151, 85.... .  The standard deviation of the next-5-game LWTS was 0.43 runs, and the next 5-game “good, typical, bad” was 0.29 units.  If I redo the exercise, I get: 0.44 runs, 0.28 units.  Another random ordering gives me: 0.40, 0.30.  And another gives me: 0.37, 0.28.

You bored too?  Yeah, me too.

Let me try a 10-game streak instead.  Our expected random standard deviation of “good, typical, bad” is .207 units.  Vlad’s actual is .227 units, a bit streakier than expected.

What if I repeat the random game exercise?  I ran the exercise 100 times, and he exceeded the .227 level 18 times.  That is, 18% of the time, we expect Vlad’s streakiness to have a level of .227 or higher, just by luck.

Seems to me that Vlad is as streaky as any other player.

#1    Los Angeles Waterloo of Black Hawk      (see all posts) 2007/01/31 (Wed) @ 15:25

Where the notion of Vlad’s streakiness comes from is his propensity in the last couple of years to just fritz out for a month or two.

Looking at last year:

Season OPS:  934
Apr:  820
May:  976
Jun:  665
Jul:  989
Aug:  1176
Sep:  946

His worst month was 269 points below his seasonal average, and 511 points worse than his best month.

2005:

Season OPS:  959
Apr:  996
May:  697
Jun:  1274
Jul:  640
Aug:  1047
Sep:  1064

His worst month was 319 points worse than his season, and 634 points worse than his best month.

2004:

Season OPS:  989
Apr:  972
May:  1021
Jun:  1006
Jul:  837
Aug:  916
Sep:  1150

Here, his worst month is only 152 points worse than his season and 313 points worse than his best month.

So there’s a bigger gap between his best and worst now than there was when he first came to Los Angeles of Anaheim.  I’m not saying that means anything (or that it doesn’t, I don’t know)—but that’s where the perception of his new-found “streakiness” is coming from.


#2    tangotiger      (see all posts) 2007/01/31 (Wed) @ 16:04

Ok, let’s extend it to 20 games then.  His standard deviation is .174 units, compared to a random of .147.

In that case, only 8 times in 100 random sequencing did I get a standard deviation above .174.

So, maybe there is something there.

If one of the math-heavies can recommend a more thorough test, let me know.


#3          (see all posts) 2007/02/06 (Tue) @ 10:22

Couldn’t you just borrow the methodology found in “Curveball”?  Pick a player and calculate his rolling game-by-game average of whatever metric you choose (they used BA but linear weights would obviously better, IMO).  Then compare that to a randomly distributed game-by-game performance of a fictional player with the same rate stats.  I did something similar below, when I was looking at the perennially streaky Geoff Jenkins:

http://photos1.blogger.com/blogger/7391/569/400/jenkins.0.jpg

I ran “robot Jenkins” 10,000 times and then compared the results to the real Jenkins.


#4    tangotiger      (see all posts) 2007/02/06 (Tue) @ 11:29

The rolling average is what I did, and I did 5, 10, and 20 games.

The random game-by-game fictional with the same rate stats is similar to what I did.  I chose the same player, and just randomized his games. 

With the 5 and 10 game sequences, it doesn’t look like Vlad is streaky, while the 20 game shows signs of streakiness.  But, that may have simply been luck of the draw.  After all, why 20 and not 12 or 32?  If I keep trying different numbers for different players, and select after-the-fact, I’ll get non-random results.


#5    rluzinski      (see all posts) 2007/02/06 (Tue) @ 12:14

"I chose the same player, and just randomized his games.”

Well, if you do that with a .300 hitter that batted .400 for half a season and .200 for the second half, you are just mixing up already curious numbers.  It just seems better to me to use a theoretically non-streaky player as a base.

Of course, since we are trying to quantify steakiness but don’t exactly know what it is, this whole exercise is troublesome.


#6    Tangotiger      (see all posts) 2007/02/06 (Tue) @ 12:46

I’m not mixing up “.400” and “.200”.  I mixing up game-by-game.  So, 5-5, and 0-4, and whatnot.  What I’m doing is, using as a given, each day’s performances.  If he’s more likely to go 5-5 than some other great hitter, that’s fine.  I’m only counting games as “plus, average, minus”.  So, I’m not differentiating between a 5-5 or 3-5 game.  They are both pluses.

The “streak” is: does he piece together great games together more than random would dictate.

Even if I do it by LWTS, I’d be surprised if Vlad’s plus games have a different spread than other great hitters’ plus games.


#7          (see all posts) 2007/02/06 (Tue) @ 13:10

"So, I’m not differentiating between a 5-5 or 3-5 game.  They are both pluses.”

Got you.  I still like the “Curveball” method better but personal preference, I guess.

“Even if I do it by LWTS, I’d be surprised if Vlad’s plus games have a different spread than other great hitters’ plus games.”

So would I, but I was also suprised that the Jenkins has been so consistently inconsistent the last few years.  I looked at him to try and dispel the fan “myth” that he was inconsistent and failed miserably. smile His 2005 season was absurd and he atcually did better as the nagging injuries mounted (had a broken bone near his hip, IIRC).  His 2006 season proved to be no less inconsistent.


Page 1 of 1 pages


Name (required)
E-Mail (optional)
Website (optional)

<< Back to main


Latest...

COMMENTS

Nov 20 01:43
Sabermetric Moves of the 2009 Pre-Season

Nov 20 14:38
Nate Silver: hero to interviewers

Nov 20 14:20
Marcel 2009 is here

Nov 20 13:42
Top Free Agent Pitchers

Nov 20 12:29
R.I.P. Tom Boswell, sabermetrician; P.A.L.L.(*) Tom Boswell, human being

Nov 20 12:27
David G. checks in again on whether experience matters in the post-season

Nov 20 10:42
Offense by position groups by decade

Nov 20 02:01
My 1B is better than your 1B

Nov 20 00:26
MLB logo

Nov 19 23:03
NBA’s Marcel