THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

Filter posts by...

 

Scouting

Friday, December 09, 2011

Velocity as a scouting variable

By Tangotiger, 11:23 AM

Matt follows up John Mayne’s article from almost two years ago, with his article.

I’m not really surprised by the results.  However, I’d like to correct Matt’s interpretation when he says things like:

Not only do pitchers who throw faster succeed more often, but they improve more as well.

Let’s go back to what ERA and component ERA (like FIP) tells us: it is an INFERENCE of their true skill, based on the OBSERVATIONS available.  That’s all these things are.  If all I know is his K/PA, then I can infer how good he is, with a certain confidence interval.  If I know his BB and HR and BABIP, then I can infer better.

So, if you have two guys who have an identical K, BB, HR, BABIP rates, but one also happens to throw 95mph and the other throws 90mph, then we can INFER that the harder thrower guy has less good luck in his ERA than the 90mph guy.  It’s more real for the harder-thrower than the softer-thrower.

The job of the saberist is simply to figure out: how much more real.  That’s the only job of the saberist, to infer as best he can, given the observations at hand.

Now, if the softer-thrower actually locates his pitches better, then that’s another variable to consider.

But, the important point is this: ERA is simply an observation, and observation that has a certain amount of good luck and bad luck.

It’s more obvious with things like steals per SB attempt.  If one guy can run from 1B to 3B in 6.2 seconds, and another guy can run that in 8.2 seconds, and both guys have an 80% SB success rate, guess which guy got luckier?  Now, obviously, you want to measure things like jump and reading-the-pitcher, etc.  But, given a large enough group of players, say 20 in each, those kinds of things will likely cancel out.  So, you’d bet on the faster guys to have a better success rate (since this will presume “all other things equal").

And, hard-throwers are more likely to perform better than soft-throwers, all other things equal.

(6) Comments • 2011/12/09 • SabermetricsPitchersScouting

Tuesday, November 01, 2011

Fans Scouting Report: players on multiple teams

By Tangotiger, 10:03 AM

Below you will find the results of the 26 players who played on more than one team in 2011, and who received at least 5 votes from each of their teams’ fans.  Wilson Betemit for example received an identical (overall) score from both the Royals fans and the Tigers fans.  You will see broad agreement for most players. 

Those who didn’t get much agreement either points to a gap in the methodology, or perhaps captures some dissatisfaction with that player.  Colby Rasmus for example probably embodies this the best, with a low score with the Cards fans (in 2011) of 44, while he got a 63 with Jays fans.  Last year with the Cards, his fans gave him a 57, and in 2009 he was a 71.

Read More

(2) Comments • 2011/11/01 • SabermetricsScouting

Fans Scouting Report 2011: final week

By Tangotiger, 09:28 AM

This is the last week to submit your ballots for the Fans Scouting Report.

http://tangotiger.net/scout/

(1) Comments • 2011/12/01 • SabermetricsScouting

Saturday, October 08, 2011

Moneyscout

By Tangotiger, 08:25 AM

Looks more like quantitative analysis at the scouting level.  Good stuff.

Helping determine who those players are is a major part of Weisbrod’s job as assistant GM of player personnel. A Harvard University English grad with a fondness for the classics, Weisbrod played hockey, but a shoulder injury ended his minor pro career. He found administrative work in the IHL, where he ran the Orlando Solar Bears. The Bears owner, Amway co-founder Richard DeVos, liked Weisbrod so much he named him president of the NBA’s Orlando Magic.

Unfortunately for Weisbrod, he got off to a bad start with Magic fans. Instead of drafting Emeka Okafor, he went for Dwight Howard fresh out of high school. The result was garbage bags filled with hate mail, a sampling of the hostility that followed when Weisbrod traded Tracy McGrady to the Houston Rockets. For that, Weisbrod received hand-delivered death threats at his home.

“[McGrady] was one of the most talented players in the league, very popular, but I came to the conclusion he didn’t have the internal fortitude to win a championship,” Weisbrod said. “I went to the ownership and said, ‘He can be Robin, not Batman.’ The FBI moved me out of my house to a hotel under an alias [because of the public’s anger].”

Weisbrod resigned in 2005, returned to the NHL and later joined the Bruins, where he was hired as a scout by former Harvard teammate Peter Chiarelli. Impressed by Weisbrod’s résumé, Feaster got permission from the Bruins to talk to him. That was before the 2011 playoffs began. Once the Stanley Cup was secured, Weisbrod listened to Feaster and saw another chance to redefine a franchise.

(2) Comments • 2011/10/08 • SabermetricsScoutingOther SportsBasketballHockey

Thursday, September 29, 2011

Do teams put the better fielder at 2B or 3B?

By Tangotiger, 03:29 PM

A few years ago, MLB was filled with great fielding 3B.  Lately, however, things have started to change back.  I looked at the Fans Scouting Report, to see how the fans evaluated their team’s 2B and 3B.  With only three teams did the fans strongly prefer the fielding talents of their 3B over their 2B:
Cleveland Indians
St. Louis Cardinals
Atlanta Braves

On the flip side however, there were 9 teams that strongly preferred their 2B to their 3B.

And how about on offense?  Well, the shift has been so dramatic that the average 2B is a slightly better hitter than the average 3B!  Just a few years back, MLB not only had the better fielder at 3B, but also the better hitter at 3B.  Now, in 2011?  A better fielder at 2B and just a shade of a better hitter at 2B, too!

***

How about LF/RF?  I noted in the past that the better fielder was by far in RF than LF.  Is this still true in 2011?  Only 1 team had the clearly better fielder in LF than RF (Yankees).  On the flip side, there were 13 teams with the better fielder in RF than LF (with 8 of them being in the NL, a league that doesn’t have a DH, and so, may use LF as a DH-like spot).

How about hitting?  Not only is the RF a far far far better hitter than the LF (average RF is a slightly worse hitter than the 1B), but EVEN THE CF is a (slightly) better hitter than the LF.  The LF in 2011 is clearly the spot where teams “hide” or otherwise “play with” their players.

Never do what others have done in the past, and that is to “zero out” the stats such that the average LF (offense + defense) is considered equal to the average CF or average RF.  This is clearly the wrong thing to do.

(10) Comments • 2011/09/30 • SabermetricsFieldingScouting

Globe Glove winner 2011: Troy Tulowitzki

By Tangotiger, 10:24 AM

Congratulations to Troy Tulowitzki, on being voted the best fielder in baseball, by the hardcore fans who visited Tangotiger.net.

Voting will remain in effect until shortly after the World Series.

(0) Comments • • SabermetricsScouting

Tuesday, September 27, 2011

Fans Scouting Report 2011: hotseat: Dodgers

By Tangotiger, 09:24 AM

I’m going to make one post like this a day, focusing on a team that requires more participation.

Today, it’s the Dodgers.  If you follow the Dodgers, have a Dodgers forum you frequent, know where the Dodgers fans congregate, then send them my way.  Or post a link below, and I’ll post in that forum.

http://www.tangotiger.net/scout/

(0) Comments • • SabermetricsScouting

Monday, September 26, 2011

Fans Scouting Report 2011: hotseat: Nationals

By Tangotiger, 04:11 PM

I’m going to make one post like this a day, focusing on a team that requires more participation.

Today, it’s the Nationals.  If you follow the Nationals, have a Nationals forum you frequent, know where the Nationals fans congregate, then send them my way.  Or post a link below, and I’ll post in that forum.

http://www.tangotiger.net/scout/

(1) Comments • 2011/09/26 • SabermetricsScouting

Monday, September 19, 2011

Fans Scouting Report and the position-neutral aspect

By Tangotiger, 03:38 PM

While I have no qualms with his basic point, his conclusion misses the larger point.  If it is impossible for Jason to evaluate Mark Ellis and Luke Scott’s throwing, then this only invalidates the results of the data if you end up comparing Mark Ellis to Luke Scott.

HOWEVER, and this is important, this does NOT invalidate comparing Ellis to Brandon Phillips.  If let’s say everyone is having a hard time following the instructions, then this bias applies to all secondbasemen, to some similar degree.  Which is why getting 20 or 30 evaluators for each player is important: if my instructions to insist on position neutral is too complicated, then those instructions are NOISE.  (Random noise, within each position.) And how do your counter random noise?  With sample size.

So, it works on two levels:
1. If you believe that fans can go the job on a position-neutral sense, then you get great results.
2. If you don’t believe they can do that job on a position-neutral sense, then random noise of the instructions is reduced by sample size, and you get great results (if you stay within position).

Jason ignores the FSR for whatever reason.  But, if you actually look at the results, are you left scratching your head thinking “that’s totally off”?  No, you don’t.  Well, you shouldn’t in most cases.  Indeed, I’ve asked a few teams in the past to evaluate their players (and they do follow a position-neutral aspect to it, as they should).  And guess what?  They always say the same thing: most of it looks really good.

(1) Comments • 2011/09/19 • SabermetricsFieldingScouting

Tuesday, September 13, 2011

Draft Order and Major League Pitching Performance…

By , 03:22 AM

I know there has been similar research published on the web, but I am too lazy to look it up right now.  I took the 1998-2010 draft list from BA and looked at how the various pitchers did in the major leagues, breaking the rounds down into various buckets.  It was a quick study and I just matched names from the draft lists with my major league databases.  I probably missed 3-5% of the players because the names, especially first names, did not match up exactly.

First I looked at the rookie years for all pitchers in each draft round 1-3+, for a total of 4 buckets. Each round included the supplemental rounds, so, the first round actually has 60 picks in most years.  The rest of the rounds pretty much have 30 picks each.  Perhaps I should consider the first 30 picks of the 1st round as the 1st round and then the next 30 picks in the 1st round supplement as the 2nd round, etc.

Does anyone know if there is anything special about the supplemental round after the first 30 picks or is it just the next 30 best players and then the second round is 61-90 best players?

For each pitcher, I looked at whether their primary role in their rookie major league season was as a starter (S), a reliever (R), or mixed (N).

The currency I used was my normalized, component ERA (nERC), which is an “ERA” based on a pitcher’s raw stats (s, d, t, hr, nibb, hp, outs, and wp) adjusted for park, opponent, and defense, and normalized to his own league, where the average pitcher in each league, weighted by TBF, is 4.00.  I weighted the aggregate nERC in each bucket by each pitcher’s IP.  If I used the simple average of all the pitchers (weighting each pitcher exactly the same regardless of how many IP they threw), the numbers would be much higher, as the pitchers who had the worst true talent generally had the fewest IP.

Read More

(9) Comments • 2011/09/13 • SabermetricsMLB_ManagementPitchersScoutingTalent_Distribution

Monday, September 12, 2011

On Defense: Subjective Data, Objectively Considered

By Tangotiger, 04:10 PM

Great piece by Keith Isley.

Sabermetrics evaluates a player’s “true” abilities as statistical regularities found in large samples. Defensive evaluation is the one aspect of the game that has been resistant to the statistical approach, at least so far. The customary sabermetric solution to this problem is to get more data and treat it with stronger mathematics, which is exactly what the advanced PBP-based defensive metrics do. Sometimes, though, progress requires not simply more data and better processing, but new methods that produce new observations that lead to new abstractions.

Fortunately, there is an established (if not well known) framework for the objective and rigorous analysis of subjective data. It’s called Q methodology, and it’s one of my favorite tools for understanding a phenomenon from a different perspective. Q provides instrumentation (the “Q sort”) for the quantification of subjectivity and a technology (factor analysis) for data reduction and interpretation.

A Q sort ranks orders subjects by a series of relatively subjective variables (such as those typically provided by baseball scouts) and the factor analysis uncovers the commonalities among those subjective elements. The result is a better understanding of what it is that makes a fielder great. Let me run through an example, and I’ll put the more technical details in a footnote.

Note: that article was from Feb, 2006.  I took a two year blogging hiatus between Apr, 2004 (my old blog at Primer) and Jun 2006 (when I started blogging here), between which, I worked on The Book.  I confined my comments to threads at the old Fanhome/Scout otherwise (for all intents and purposes, inaccessible now). 

So, I happen to run into the above article, and, since there’s no active thread for it, I figured it would be a good one to link to.  So, if you happen to see me link to more old pieces between those two dates, will, just treat it as something new.

(0) Comments • • SabermetricsFieldingScouting

Tuesday, August 23, 2011

Scouting Report 2011 - By the Fans, For the Fans

By Tangotiger, 08:24 PM

The ninth annual ballot is now ready!

This is my single favorite project that I’m involved in, and its success is completely dependent on your participation.  Your help benefits everyone out there.

http://www.tangotiger.net/scout/

And help spread the word on your blog!

(15) Comments • 2011/08/29 • SabermetricsScouting

Wednesday, May 11, 2011

Video scouts: saberists that roll up their sleeves and watch a game… and a speadsheet

By Tangotiger, 10:36 AM

Perfect, just perfect.

Glove-slap: Neil.

Saturday, March 12, 2011

Greg Maddux scouting report - when he was in high school

By Tangotiger, 12:59 PM

This is why we need scouts

Glove-slap: Mike Fast.

(16) Comments • 2011/03/16 • SabermetricsHistoryMinors_CollegeScouting

Monday, March 07, 2011

Sabermetric see-saw

By Tangotiger, 09:06 AM

Works for me:

“Jerry DiPoto probably made the best analogy I’ve heard as far as how to describe the balance between traditional scouting and statistics. He said that it’s almost like you want to imagine a seesaw. On one end of the seesaw is your pure statistics, your pure analytics, and on the other end is your pure traditional scouting where you’re using your gut and what you see to make your call. As evaluators, we decide where we want to stand on that seesaw.

(0) Comments • • SabermetricsScouting

Thursday, February 24, 2011

Forecasting the past

By Tangotiger, 03:59 PM

Buster, wanting to forecast the past, must have talked to a dozen scouts, until a minority of two said what Buster was hypothesizing, and then presented the view of those 2 (and to the reader it’s 2 out of 2, as Buster is obviously not going to report on the 10 that disputes him ):

A couple of scouts say this: they saw wainwright’s arm angle dropping down the stretch last year, a sign of trouble.

This is called confirmation bias.  But, this is 2011.  And last year is 2010.  The data is so easily available.  And Jeff looked at it.  And Mike looked at it.  And they report that the data shows nothing of the sort.

Why not hypothesize that his speed was down, his break was less, his pitch selection less varied, and the time between pitches is longer (all of which are legitimate hypothesis), and then… ask a couple of scouts?  Why would you need to ask scouts about counting numbers?  You ask the number counters instead.  You test your hypothesis. You don’t assert it with selectively sampling the scouts who happen to agree with you.

I love scouts.  I’d hire at least 10 scouts for every one saber person.  It’s guys like Buster Olney that gives scouts a bad name. 

(19) Comments • 2011/02/27 • SabermetricsScouting

Thursday, February 10, 2011

Yet another article that “samples” is not “true”

By Tangotiger, 10:46 AM

Carson does something that I’ve been hoping to see someone do for the longest time.  What he does is figure out the surplus value of a player drafted, and link that surplus to the scout who signed him.  So, if you signed Tulowitzki, you look like a genius.  But this is a sample.  It still is not something true, something real.

After all, I presume all thirty teams had Tulo ranked somewhere between #1 and #10.  How much does it help us to give the 50MM$ surplus (or whatever it was) to the Rockies’ scout who drafted him #7, and 0$ surplus to the other 29 scouts who also ranked him quite high?  In that same draft, Alex Gordon was drafted #2.  I don’t follow the college scene, but presumably all the teams had him ranked pretty high as well.  Do we slap down only the Royals for drafting him, because they are the ones that bought the Alex Gordon lottery ticket, even though the other 29 teams ALSO wanted to buy that ticket?

If you look at it as a sample, then you can say, yeah, the scout won the lottery ticket, that’s the money he made.  But, you have to look at it from a true talent perspective.  Perhaps you need to regress 99% of what you see from Carson’s process.  Scout Bill Buck is credited with 74MM$ in surplus.  Perhaps his true value is 740,000$, and the rest was his good fortune for having exclusive dibs on players. 

The problem with the process is the exclusivity of it, the binary outcome: did he, or did he not draft that player.  In many respects, it’s like looking at a player’s single game, where he puts the ball in play 4 times, and we’re trying to figure out what the true talent of the player is based on the observations of these binary outcome based on whether the batter was safe or out.  Because, in this case, you would also regress what the batter did in 4 contacted PA 99% toward the league mean.  On the other hand, if you had say his launch parameters (how hard he hit the ball, the spray and vertical angles, the spin imparted), then those 4 PA might regress 95% toward the league mean.

Today is sample-is-not-true day!

(9) Comments • 2011/02/11 • SabermetricsScoutingStatistical_Theory

Wednesday, February 09, 2011

Paralysis by Analysis

By Tangotiger, 04:41 PM

Ken speaks.

(7) Comments • 2011/02/10 • SabermetricsScouting

Wednesday, February 02, 2011

Baseball America on 17yr old Felix

By Tangotiger, 11:44 AM

This is why scouting is critical:

Strengths: Hernandez has scary upside. He’ll open this season as a 17-year-old and he doesn’t need to develop any more stuff. The only guy in the organization with a comparable arm is big leaguer Rafael Soriano. Hernandez has the best fastball in the system and commands his mid-90s heat well. He regularly touches 97 and could reach triple digits as his skinny frame fills out. Hernandez’ curveball is also unparalleled among Mariners farmhands and gives him the possibility for two 70 pitches on the 20-80 scouting scale. Though he’s young and can easily overpower hitters at the lower levels, he understands the value of a changeup and is developing a good one. He can pitch down in the strike zone or blow the ball by hitters upstairs. He has poise and mound presence beyond his years.

Weaknesses: Hernandez just has to learn how to pitch. He needs to tweak his command and refine his pitches. Typical of a teenager with a lightning arm, he’ll overthrow at times but should grow out of that. Arm problems would appear to be the only thing that could derail him from stardom, and Hernandez has been perfectly healthy so far. The Mariners will go to great lengths to make sure he isn’t overworked in the minors.

The Future: Seattle wants to move Hernandez slowly, but he may not let that happen. He’s not going to need to spend a full season at each level and might need just two more years in the minors. He’ll probably start 2004 back at low Class A Wisconsin—the Mariners concede he could have spent all of last season there—and could be bucking for a promotion to high Class A Inland Empire by midseason. It’s easy to get overexcited about young pitchers, but Hernandez has the legitimate potential to become the best pitcher ever developed by the Mariners.

Put simply: if you only look at performance outcome results, you can’t get a report to read like that.  That’s because with limited performance data and against competition that is unlike MLB, you are going to heavily regress.  That’s why PITCHf/x is the goldmine, as it’s the one thing that can merge scouting observations with performance results.

At the same time, I can’t just look after-the-fact of this fantastically glowing report, see that Felix exceeed expectations, and determine that scouting “is all that”.  You have to look at ALL the glowing reports of 17yr old pitchers and see how often they hit and how often they missed.  Then we can say how much impact scouting can have.

This is similar to a statistical-only analysis of saying “hey look, I nailed [whoever… Latos, Strasburg, Weaver, Gooden, etc]”, but then ignoring all the others you ranked highly but didn’t nail.  You can get great forecasts for these pitchers by regressing only a little, but then that simply means you are going to include alot of pitchers that shouldn’t be there to begin with (and you’ll be saved by having the MLB managers not pitch those guys and thereby removing them from the sample!).

That’s why Marcel does so well by simply saying: “all minor leaguers will perform at league average”.  Because historically, this is close to true (90% of league average or so).  But, that’s an after-the-fact view, because managers have selected which players they think will perform the best based on in part, you guessed it, scouting reports.

PITCHf/x, FIELDf/x, HITf/x will eventually remove all this doubt, and finally put us in a position where scouting and statistics can converge to a single common point.  Just a matter of time until MLB will create its own academy league so that they can put in the f/x system in parks all across the country.

(8) Comments • 2011/02/02 • SabermetricsScouting

Wednesday, November 17, 2010

Stats v Observation

By Tangotiger, 05:55 PM

Excellent article, especially for PSYC 101 readers.  However, I think this can be worded better:

The human brain just doesn’t do this naturally, especially over a large number of events and/or a long period of time. So even if he could observe a prospects performance through a completely rational framework, he’d be unable to properly couch the prospect’s level of performance in the context of his peers without the use of quantitative methods.

I don’t know what “quantitative methods” actually means.  Is it simply to assign a numeric value to a subjective impression?  Because all scouts do that, as evidenced by their 20-80 scale.  Words accompanies those numbers, but those numbers are a “shortcut” for what they want to say.

Or is he trying to say something else?

(5) Comments • 2010/11/18 • SabermetricsScouting
Page 1 of 5 pages  1 2 3 >  Last »

Latest...

COMMENTS

Feb 11 04:03
MGL: Today on Clubhouse Confidential

Feb 11 04:02
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

Feb 11 02:12
Performance through the ages

Feb 11 02:10
Dwight Evans

Feb 10 23:01
For Your Soul

Feb 10 21:07
Hero of the month: Brittney Baxter

Feb 10 18:32
Moneyball at Villanova

Feb 10 17:00
Psst… wanna intern in Canada?

Feb 10 15:01
New PECOTA

Feb 10 14:28
Win expectancy charts used in football… in 1983!

THREADS

February 10, 2012
Jose Molina

February 10, 2012
Reader Mail of the Day: Why do we need X years of fielding data?  And what about outliers?

February 10, 2012
Performance through the ages

February 10, 2012
Hero of the month: Brittney Baxter

February 10, 2012
Win expectancy charts used in football… in 1983!

February 10, 2012
Dwight Evans

February 09, 2012
Psst… wanna intern in Canada?

February 08, 2012
Moneyball at Villanova

February 08, 2012
MGL: Today on Clubhouse Confidential

February 08, 2012
New PECOTA