Thursday, April 26, 2007
MGL Archives
For those new around here, you’ll enjoy some fantastic old articles by MGL:
DIPS Revisited
UZR Part 1, Part 2 (includes extensive methodology)
Super LWTS Part 1, Part 2, Part 3
Buy The Book from Amazon
For those new around here, you’ll enjoy some fantastic old articles by MGL:
DIPS Revisited
UZR Part 1, Part 2 (includes extensive methodology)
Super LWTS Part 1, Part 2, Part 3
Someone is asking for more feedback on his work.
Is this cool or what? It’s by Joe Sheehan, but not THAT Joe Sheehan. If he keeps doing work like this, the BP Sheehan will be introduced as “not THAT Joe Sheehan”.
Wouldn’t we need to look at the quality of the pitchers in both groups? The cutoff seems to ensure we are at least only looking at starting pitchers. I also don’t know that the “ratio” is what we want: I would rather have seen the before and after totals, instead. The actual list of pitchers might have been interesting as well, as maybe some were clumped in the 70s, a time period when pitchers threw alot more than before or after.
We find that non-breaking ball pitchers throw, on average, 382% more pitches after turning 23 than before, while breaking ball pitchers throw 311% more.
Schmanzy-fancy acronym aside, this is the way to look at it. And, if you include the count, this is exactly how I look at it, which alllows you to generate a Markov for each pitcher and hitter. That is, rather than relying on the 2000 historical PA for a pitcher or batter, you rely on their 7500 pitches to create a much stronger profile as to exactly how they hit and pitch. You remove tons of luck this way. You can further add parameters to your Markov model by also considering base/out and inning/score, as well as two-pitch or three-pitch sequence state, rather than the current count. And if you want to get even more creative, you include the profile, or identity, of the opponent.
The only thing missing from the article was the actual data itself. All we see was one example.
Yet another in the category of insightful, easy-to-digest, but heavily-researched articles. Lots of great researchers out there who roll up their sleeves to bring great ideas to the masses. This is what sabermetrics needs.
When you get data into the hands of smart, resourceful people, you get answers. And boy, did we get answers:
Yet another in my favorite category (straightforward, well-researched, insightful) is how often does a team take the platoon advantage. Andy showed us that teams will take the platoon advantage 78% of the time with the pinch hitter. DanAgonistes adds to that by showing relievers coming in with the platoon advantage 62% of the time. He then further breaks it down by when a manager is consciously trying to take a platoon advantage with the reliever (close game, non-ace reliever) to get a total of 66%. Which got me thinking:
John Walsh wrote a great piece about OF arms in the Hardball Times Annual. It’s one of those great “well-presented, intensely-researched, easy-to-grasp” articles that I like. You don’t have to get the math-mumbo-jumbo to understand what’s going on. John is one of my favorite sabermetricians, and for that reason, I hold him to a higher standard. In his article:
I don’t have anything to say at the moment, but I did notice something interesting with him. In the last three years, the umpire has called “ball” on him 1125, 1117, and 1122 last year. How many called, swinging or in-play strikes? 2509, then 2309, then 2137. He’s still got good numbers, going from a Schilling-like 69.0% to 67.4% to 65.6%. I don’t know if it means anything, but I thought that was interesting.
Rich Lederer gives us a great graphical presentation of pitchers’ GB and K rates. If he could have figured out how to do a three-dimensional chart to include the walk rate, that would have been perfect. Otherwise, I would have suggested K minus BB rate instead (I think Guy first suggested it, and I’m on board with it). It is almost as good as FIP and nice and clean.
I meant to criticize Jeff Sackmann’s recent article, which selects the top 5 starters on each team, after the fact, but Fifth Outfielder did it clearer and better than I would have. You *must* select before the fact.
Eric Van says:
I calculated the total CluRat for 2002-2005 and compared it to CluRat for 2006, for the 27 guys who pitched 20 or more innings in 2006. Would you expect CluRat to have any predictive value? The correlation of 2002-5 CluRat to 2006 CluRat is .47, hugely significant at p = .01. ... The worse CluRat for 2002-5? Rudy Seanez. The worse for 2006? Rudy Seanez.
His long equation is really this:
(IP * 5.00 / 9 - WPA / pLI * 10.81) / R
where the first part, IP*5/9, simply gives you the league average runs, and the second part, the WPA*11/LI tells you how many runs better than average he was. It’s possible that what we’re seeing though is a bias in the equation. He uses 10.81 as a constant, when in fact it should differ. However, he is likely saved in that a relief pitcher only pitches 1 or 2 innings, and therefore, doesn’t perform long enough to effect his run environment enough to change the runs per win value of 10.81. But, maybe I’m wrong.
What’s great about the web is seeing instant feedback by people who follow in-depth the workings of their own team. Take for example, Horacio Ramirez and Rafael Soriano.
On one side…
The replacement level winning percentages for nonpitchers, and pitchers is .390 and .420. This makes the average nonpitcher +.110 above replacement, and the average pitcher is +.080 above replacement. This means that 58% of the value is in nonpitchers, and 42% in pitchers. In 2005, teams spent 59% of their payroll on nonpitchers. So, teams get it. They know how to split their money between nonpitchers and pitchers.
But, what about between starters and relievers?
Someone asked me if I would run a pitcher scouting report. A little background first…
Sabermetrics is the convergence of performance analysis and scouting observations. David scratches the surface. Tracking pitch-by-pitch, including count, pitch type, velocity, location, famigli of batters, pitchers, as well as tracking hit-location, millisecond-by-millisecond, including fielder positioning, is the holy grail. There is a mountain of haystack for us to find that needle. All we need is more researchers.
One interesting question is…
What happens when you take a guy who spends alot of time collecting and presenting data, and another guy who spends alot of time analyzing said data? You get great research, and potential for even more:
http://sonsofsamhorn.net/index.php?showtopic=12439
I will reiterate here what I said there:
From the beginning, and the gap has been widening. Here are some numbers to consider:
May 16 23:57
Now you frame it, now you don’t
May 16 23:47
Dodgers’ win reversed because Mattingly did not attest to proper score!
May 16 20:44
How to beat the shift
May 16 20:02
Sponsoring MLB jerseys
May 16 16:56
Did Manny Pacquaio actually quote Leviticus?
May 16 16:06
Does changing your pitch frequency lead to substantial change in results?
May 16 14:18
Extra Innings: One-minute review
May 16 14:16
This particular criticism of UZR is unfounded
May 16 13:21
Psst… wanna intern for the Astros?
May 16 12:23
Arena wars
THREADS
May 16, 2012
Now you frame it, now you don’t
May 16, 2012
Dodgers’ win reversed because Mattingly did not attest to proper score!
May 16, 2012
Does changing your pitch frequency lead to substantial change in results?
May 16, 2012
Sponsoring MLB jerseys
May 15, 2012
Andre The Hawk Dawson speaks
May 15, 2012
Euro 2012 Preview
May 15, 2012
How to beat the shift
May 15, 2012
Will Pujols end the season with at least 30 HR and .500 SLG?
May 15, 2012
Kershaw v Strasburg, part 2
May 15, 2012
Did Manny Pacquaio actually quote Leviticus?
Recent comments
Older comments
Page 18 of 342 pages « First < 16 17 18 19 20 > Last »Complete Archive – By Category
Complete Archive – By Date