THE BOOK cover
The Unwritten Book is Finally Written!
An in-depth analysis of: The sacrifice bunt, batter/pitcher matchups, the intentional base on balls, optimizing a batting lineup, hot and cold streaks, clutch performance, platooning strategies, and much more.
Read Excerpts & Customer Reviews

Buy The Book from Amazon


SABR101 required reading if you enter this site. Check out the Sabermetric Wiki. And interesting baseball books.
MOST RECENT ARTICLES
MAIL : You ask | We say

Advanced


THE BOOK--Playing The Percentages In Baseball

<< Back to main

Wednesday, January 20, 2010

A saberist reviewing a paper of economists reviewing saberists

By Tangotiger, 11:17 AM

Dave Berri and JC Bradbury wrote a paper called “Working in the Land of the Metricians”, whose purpose is to discuss “how economists can benefit from sabermetric advances while avoiding its pitfalls.”

I will give you my thoughts on the paper, as I read it.


Dave Berri and JC Bradbury wrote a paper called “Working in the Land of the Metricians”, whose purpose is to discuss “how economists can benefit from sabermetric advances while avoiding its pitfalls.”

I will give you my thoughts on the paper, as I read it.

Part 1

Specifically, how should economists react to the work and ideas of the nonacademic analysts who are often seeking to answer the same questions?

Excellent question.  Indeed, it’s probably the only question I have. 

Fundamental to the rest of the article,we observe that these communities of nonacademic sports analysts evolved outside the typical parameters of an expert peer-review system familiar to most academics.

Yes, good setup.  This is exactly what happened.

Some metricians accuse the academic group of intellectual snobbery—that is, deliberately snubbing valid research only because the authors lack credentials. Phil Birnbaum, a frequent critic of both of the authors of this article, summarizes this complaint…

That should read “Some metricians accuse SOME IN the academic group”.  Otherwise, yes.

Birnbaum (2006) considers sabermetricians to be ‘‘no less intelligent than academic economists’’ and superior to economists in their understanding of baseball.

True.  Baseball analysis is what we do.  If you have two groups of people, equally intelligent, and one spends far more time being subject matter experts, and the other spending more time learning technical skills, well, by definition, the subject matter experts are superior in their understanding of baseball.  And the techies are superior in their application of technical skills.

This statement reveals a curious worldview. On one hand, the aspect that is universal across both groups—members of both communities have been devoted sports fans since an early age—is considered unique to the nonacademic sports analysts.

No, this is not considered unique to saberists.  It seems that the authors think that just being a hardcore fan is what we think makes us superior.  That’s not correct.

On the other hand, when it comes to the aspect that is unique to academics—academia normally involves many years of advanced training and requires its participants to be judged competent by their peers in a ‘‘publish or perish’’ environment—metricians demand equal recognition. In our view, this mentality begets misplaced confidence.

Actually, we don’t demand equal recognition.  We don’t even demand recognition frankly.  What we would like is for some economists to realize they are still dancing disco, while the rest of us have grown up from that.  Perfect the electric slide all you want, and that won’t mean that we want to be your equals there.

...researchers seeking to study sports phenomena should proceed with caution when using informally vetted metrician findings

Agreed.

Often the merit of ideas offered by this community is judged by a consensus of pseudonymous avatars,

Ah, yes, the necessary jab at aliases.  In that one statement, the authors have insulted half the internet population, if not more.  Is it really relevant that “Tangotiger” gives feedback on sabermetric topic, as opposed to someone with two names, like “Tom Tango”.  How do we know if “Gabriel Desjardins” appears on his birth certificate?  Is this really relevant?  So this is a statement of pure bias on the authors part: “judged by a consensus of pseudonymous avatars”.  They could have said “judged by a consensus of extremely dedicated students of sports analysis”.  The authors deserve scorn for such a blatant editorial slant.

many of whom appear to lack training and experience with advanced research methods.

Guilty. 

But, the presumption being made here is that “advanced research methods” is somehow beneficial.  You can make an argument that such methods see the trees as noise to the forest.  In my opinion, being a subject matter expert is a far larger prerequisite for being a quality sports analyst than being a technical expert.  Indeed, if you were to list all the great sports analysis in the past decade, my view would be supported.

And look at the fertile ground that is PITCHf/x.  What is a non-subject matter expert to do with such data?  Purely a fish out of water.  The subject matter experts are the ones who will sort out how the data should be aggregated, interpreted, etc.  The technical expertise will rely on data mining and cluster analysis.  The regression jackhammer will have little use to those who need archeological tools.

Consequently, much of what passes for research among metricians would not pass through the academic peer-review process of any academic discipline.

True. At the same time, much of what passes for research among some academicians would not ass through the non-academic review process. 

While a specific academician might agree that the worst hitting outfielder in baseball could have a performance worth 12 million dollars, even the most casual baseball fan would laugh at such a result.  Indeed, while the baseball fan would say that your toaster is broken, this academician will reply that the toast is supposed to be burnt black on one side and stay white on the other.

That being said, researchers who validate metricians’ findings should acknowledge the origins of their approach.

Good.

Although we are critical of the hobbyist community’s informality

And to me, I PREFER the informality.  Things get done, things move fast.  Things get corrected quickly.

we must acknowledge that metricians are responsible for many important discoveries that should inform sports economists in their research. In fact, we have observed instances where economists have made elementary mistakes in understanding sports that would not have been made if they had paid attention to metricians’ findings. It is our hope that academic researchers using sports as their laboratory will review the analysis from all relevant outlets, including the metrician community.

Good.  If only I can believe it.  JC shows an enormous level of disdain and scorn for our work.  For example, replacement level.  Also, he only uses Dewan’s plus/minus, and proclaims it as the superior metric, when a) he’s made no effort to show why Dewan is better than UZR, and b) UZR is far more publicly accessible than Dewan.  Indeed, the rest of us see UZR and Dewan as pretty close to each other.

With these thoughts in mind, we now proceed to three specific questions:
1. What can the sports economist learn from the metrician research?
2. What issues arise when sports economists review the metrician research?
3. How should sports economists handle the less savory interaction aspects when we journey into the land of the metricians?

I haven’t read the rest of the paper yet.  I’m just cut/paste/comment as I read.  I’m only at the bottom of page 3.  Let me give you my answers, and let’s see what the authors say:

1. That the saberists have provided the bulk of the research, so that means that the sports economist needs to spend the bulk of their time looking at saberist work.

2. That there is no one single placeholder, that things evolve, that sometimes there are competing views.  It’s a bit of a mess.

3. Some sports economists go into the deep end of the pool where the subject matter experts know how to swim and dive, so respect that they know how to swim and dive without having actually been taught how to swim and dive.  Maybe we should be wearing tighter swimsuits and shaving our chest hairs and wearing a cap to improve the aerodynamics of swimming and diving.  We can accept that.  But, we’re already 95% of the way there.  You can help by adding the other 5%.  But, don’t tell us that we should go back to the wading pool and buying textbooks on how to swim and shaving our hair.  Or, in a word, don’t be a dick. 

Are saberists dicks too?  I’m sure some are.  But being a dick who can swim is better than being a dick who tells an experienced swimmer how to swim.

Speaking as a saberist that has posted on, literally, hundreds of sports sites, blogs, and forums, I can say that there is nothing unsavory in my dealings with any reader.  There are those readers who refuse to be educated, and so, to those readers, you bid adieu.  Others are sincerely interested in learning or teaching or interacting.  This is by far the largest segment of my interactions.  JC, on the other hand, by his own hand, has created the unsavory landscape that he finds himself in.  How is it that the rest of us can treat each other with so much respect, and yet JC finds himself, on site after site, as the bad guy?  Stop being the swimming coach telling Michael Phelps that he’s not swimming technically correct, even if he’s the best swimmer in the world.

That’s all I have time for now.  When I review the rest of the paper, I’ll put up another post.

Part 2

According to Turocy, though the added explanatory power of a regression is superior to some simple metrics, the coefficient weights suffer from omitted variable bias due to the correlation of included metrics with other unmeasured important aspects of the game. ‘‘Linear weights’’ is therefore a superior estimator of run production, because it does not suffer from omitted variable bias… This metric was originally developed by operations research analyst Lindsey (1963) and updated by sabermetricians Thorn and Palmer (1984). Linear weights estimates contributions to baseball events using play-by-play data to weight the run-generating probabilities for individual events; thus, its estimates avoid the omitted variable bias problem.

Yes, perfect, and exactly what I’ve been saying.  So, here, JC agrees with me that simply relying on the jackhammer regression, the one where the run value of a double is only +.15 above the single, is the wrong thing to do.  Good.  But he adds:

Nevertheless, the story does highlight how academics can learn from metricians.

Yes, perfect.  This should be the rule, not the exception.

Scully (1974, 1989) used strikeout-to-walk ratio to measure pitcher performance; however, Zimbalist (1992b)6 and Krautmann (1999) argue that ERA is a better measure of pitcher quality because ERA has greater explanatory power. ....(then Voros DIPS talk)… Consequently, we see Scully’s general approach is confirmed by a metrician, demonstrating what the nonacademic sports research community can contribute to sports economics research.

Yes, good.  DIPS is the kind of thing that only a subject matter expert would see.  And it’s exactly the kind of thing that should be used by the sports economist.  It’s not a good idea for them to try to create a metric when we’re the ones who have so much experience in doing just that.

Part 3
Berri now takes over:

Looking at team data in our same sample from 1987-88 to 2007-08—or 591 team observations—we see that a team’s NBA Efficiency per game explains 32% of the variation in team winning percentage.

I follow the basketball metrics debate from a high level view.  But, from what I can gather, it’s similar to what you’ll find in hockey: the contributions of individual players are not recorded as well as we’d like.  And when we try to create inferences at the player level, it kind of breaks down at the team level.  And, since the only validation we have is at the team level, then, in order to get the best correlation, it’s best to throw out anything you might learn at the individual level that decreases that correlation. 

Really, the argument is very similar to various run created metrics: we really don’t care how runs are created at the team level, as the individual hitter is what interests us.  But, to increase correlation, throw logic and some useful individual data, to best serve the jackhammer regression.

And after I wrote that, here’s a third-person statement that comes right after all that:

To put these results in further perspective, consider Wins Produced, a measure reported by Berri (2008).14 Wins Produced explains 94% of team wins.

But, that’s NOT the objective.  If that was the objective, why not simply take points scored by the team, divide by 48*5, and multiply by minutes played for each player.  And do the same for points allowed.  There, r=.999.

Consequently, these models are not well connected to winning and, therefore, are not useful in determining the economic value of players.

I can see why Berri is so not-liked by the basketball analyst community.  He engages them even less than JC does, and he positions his metric much higher than JC would dare (though JC does go there when he says things like Brandon Lyon, per inning pitched, is worth as much as Doc Halladay).  JC has the good sense to shut up about it, while Berri just keeps compounding his assertions.

So, Berri should actually take JC’s advice from the page before and learn from the analyst community as to what makes a good metric.  The irony is of course completely lost on Berri.

The purpose of tracking statistics is to separate a player from his teammates. The plus–minus measure, though, provides a player evaluation that fails to accomplish this objective.

I’m glad JC is my antogonist, because Berri is starting to get on my nerve.  Pure unadjusted plus/minus is crap, yes.  Bobby Orr for example was +124 in 1970-71, for the highest such mark since it’s been recorded.  Bobby Orr is also one of the three greatest hockey players of all time.  There is no doubt that the +124, while perhaps not in its degree, at least points to something real: Orr was good. That Bruins team was great, as they were probably around +160 to +170 as a team.  So, we can see that when Bobby Orr was not on the ice, the rest of the team was around +40.  So, Orr was surrounded by some good players, but certainly not a fantastic great group of players.  Clearly though, Orr was the man on that team.  It’s also not hard to see who his defense partner was: Dallas Smith who was +94.

So, in order to make sense of plus/minus, you really need to know who was on the ice when Bobby Orr was on the ice.  Bobby Orr, basically, pollutes the data.  If Orr was used equally with the other five defensemen, then that would have been fantastic.  And really, that’s what we really want: to try to isolate a player from the rest of his team. To deride plus/minus because it is simply a counting number is short-sighted.  Indeed, it shows the reason why an economist should let the saberist figure out how to create the individual metric.

Regressing a player’s plus–minus this season on his plus–minus last season reveals that the latter explains only 9% of the variation in the former.

Ack!  Again, it completely misses the point, that Berri would do this.  I can’t believe what I’m seeing here.  We should be thankful that JC is our baseball antogonist, because Berri here simply is not fighting fair.

When we complete a similar exercise for other statistics tracked for hockey players—such as shooting percentage, assists, goals, points, penalty minutes, and shots on goal—we see a level of explanatory power that ranges from 39% to 80%. In sum, every other hockey statistic tracked for skaters exhibits a far higher level of consistency across time. And this suggests that relative to plus–minus, every other statistic captures more of the player’s individual skill and less the happenstance of his teammates’ identity.

Really?  Big deal!  So, the number of shots a player takes is more indicative of him as a shooter than the plus/minus he shares with his teammates is of his plus/minus he will share with his other teammates next year?  Again, completely misses the point.

So, it appears that plus–minus is not a particularly powerful metric in hockey.

Enough.  You’re killing me here.

Again, the problem with plus–minus is that a player’s teammates can affect his measure. Consequently, as a player’s teammates change, a player will see his plus-minus value fluctuate.

Yes, this is a GOOD thing.  That shows that you can’t use plus/minus in isolation, that you need to treat the individual as part of a team.  There are 10 guys on the court when things happen, when he scores, when a teammate scores, and when an opponent scores.  And not just scoring, but turnovers, shots, fouls, etc.  The field goals and assists simply made you want to hope that things can be so easily separated, individual from team, that plus/minus really shakes you back to reality: you need to tweak out the individual contribution from the team.

Plus/minus forms the basis for everything you want.  Granted, it comes at a price: an uncertainty level that is tied to how non-random your teammates on the ice or court are with you.  Ideally, every player plays with every other player an equal amount of time.  This is very similar to the WOWY (with or without you) that I use in baseball as well.  You simply need to tie an uncertainty level around the metric.

To combat this problem, people have introduced adjusted plus–minus.

Good.  Go on.

An even greater gain is seen if 5 years of player data are examined. Estimating a coefficient for 373 players who played for five seasons, we see that 38.9% of coefficients are at least twice the value of the standard error. And 50.4% surpass the 1.5 threshold. Although more data do increase the level of statistical significance, it is still the case that most players—even when 5 years of data are used—are not found by this method to have a statistically significant impact on outcomes.

That’s good to know for basketball.  That’s the kind of good stuff I’d like to see.

Turning to plus-minus, how many points are scored and surrendered when a player is on the court should be linked to current wins. However, efforts to separate the player from his teammates have not proven successful.

In the instance he is citing.  That doesn’t mean that it doesn’t exist in some other way of adjusting for plus/minus.

Consequently, despite the popularity of these approaches among the metricians, neither is an improvement over what has been published in academic journals.

Meaning “Wins Produced”?  Which is Berri’s metric.  Again, it’s a simplistic way to look at the problem.  Say we go back to hockey, where we see someone with 50 goals and +30 and on the same team, someone with 10 goals and +30.  And we know that they don’t play on the same line.  Now, it’s likely that the guy who scored 50 goals contributed more to the +30 than the guy with 10 goals.  So, the best thing to do is think of plus/minus as adding an extra component or layer to the understanding of the player.

The same would apply to basketball.  The combination of the other stats with plus/minus will give you a “sum greater than the whole”.

The story of Linear Weight and DIPS highlights how metricians can help sports economists. Our examination of PER and adjusted plus–minus, though, suggests economists should be careful.

The second sentence should be more like “economists should wait out to see what the analysts are discovering”.  I object to the idea that the economist will swoop in to save the day, as Berri seems to do with his “Wins Produced” metric.

The metricians do not provide the same peer review that academic journals do. Consequently, just because a work is accepted among the metricians, it does not necessarily mean this work would pass muster in the academic community.

No, not the same peer review.  But, neither do we provide necessarily inferior review.  In many respects, we provide superior review.  And just because it passes muster with the academic community doesn’t mean it will pass muster with the subject matter experts in the blogosphere.

Part 4

In addition to disseminating ideas more widely, blogs offer the opportunity to discuss and debate with the audience.... However, no matter the pedigree of the participants or the overall quality of ideas, these forums are no substitute for the formal peer-review process that typically governs economics research.

Sure they are.

Regardless of the usefulness of commentary, blogs should be considered a source of ideas and inspiration for further research rather than a serious research outlet.

Again, just more summary assertions without evidence.  Say it with me: bullllll-sh!!!!!t.  I consider my blog an extremely serious research outlet.  Now, I would be delusional to say that, if nothing also came of my blog.  But, that’s hardly the case.

Why can’t my blog be BOTH: inspiration for further research AND a serious research outlet?

Good ideas developed on blogs should be written up, tested rigorously, and then published through normal channels.

Why?  Why?  Why why why why why?  If I did that, if we did that, the sabermetric movement would slow down to a crawl.  Again, more asserting of their will, with no backing at all.  Indeed, JC went out of his way to discuss Linear Weights and DIPS, and those were published in non-normal channels.  Again, another bullsh!t assertion.

And I’ve seen the results of JC’s “rigorous” tests: by rigorous, he means through whatever standards his academic teaching tells him is rigorous.  Sorry, it makes no sense.

Kruger and Dunning (1999) find that many nonexperts tend to overestimate their abilities. ‘‘Not only do these people reach erroneous conclusions and make unfortunate choices, but their incompetence robs them of the metacognitive ability to realize it.’

I agree with him.... but, I think it’s JC and Berri who are the non-experts!  We who study baseball are the subject-matter experts.  When JC and Berri swoop in to tell us that they are experts, and that Francoeur’s abysmal performance is worth 12 million dollars because he happened to play 160 games, then that means it’s JC that reached erroneous conclusions and made unfortunate choices.

Dude, c’mon.

The knowledge disparity between academics and interested laymen sometimes leads to unpleasant discourse.

Knowledge disparity?  “Interested laymen”?  How about calling us “subject matter experts” or “highly motivated students”?

...some participants often confuse ease of access in the same forum as equal expertise.

Really?  You aren’t talking about subject matter experts are you?  If you mean the public in general, well, that’s the internet, the greatest form of communication yet invented.

When nonacademics discover their ideas are not being accepted, unpleasant behavior can result.

Can you differentiate between “nonacademics” and “subject matter experts” (SME) please?  SMEs make up a tiny percentage of these people.  So, I’m not going to defend some internet flamers.

Examples of ‘‘unpleasant behavior’’ include commentators leaving essentially the same comment over and over again, often under different aliases;

No SME would do this.  So, again, you are indicting the internet.

As a consequence, some blogowners have gone so far as to remove the comments option to avoid the unpleasantness.

As far as I know, only sabernomics.com has done so.

Really, this whole section so far is just a philosphical point of view that really has no basis to what we’re talking about.  Lumping SMEs with outright flamers is disingenuous to say the least.

Despite the behavior of commentators frequently referred to as ‘‘trolls,’’ the online interaction has proven to be net beneficial. Most commentators are pleasant and offer useful insight. Consequently, very few (if any) academics ever walk away from the blogging experience once they have developed an audience. Apparently, by their own behavior, sports economists who are aware of the costs of blogging can certainly reap many benefits; and thus, blogging can ultimately improve the research generated by sports economists.

That’s a good enough conclusion.  One must therefore ask why JC refuses to engage with other saberists.

#1          (see all posts) 2010/01/20 (Wed) @ 12:03

A lot of your comments are reminding me of the old open source debate.  The online baseball community is the Google to the academic’s Microsoft. 

Imagine that one day a critical analyst notes that a top hitting statistic can be improved making a simple adjustment to include park factors.  And a few days later that top statistic has been virtually replaced with a new and improved metric.  This is the kind of progress that simply doesn’t happen in academics and why the comparison to open source seems so poignant to me.  It can take over a year before the community at large gets to comment on an academic piece, saberists can make vast improvements on each other’s work in a matter of days.  One saberist can suggest an idea and another can do the grunt work. 

I think the most common defense of the peer-review system is the most disingenuous.  I’ve often heard the argument that the peer-review system prevents bad research from being presented.  It would seem academics take us (and themselves) as fools.  Most intelligent people can recognize the difference between good work and bad work and pick the best measures.  And when there’s no clear consensus, we just quote UZR, +/-, and total zone to cover all our bases.


#2          (see all posts) 2010/01/20 (Wed) @ 12:27

Tango,

Since I’m often seen as someone with an academic bias (and there is, of course, some there as I can admit), I think you present both sides of the paper well here.  Thank you for that view. 

I think you make valid points about the efficiency of online work and open access, and it’s one that many academics push away, while others realistically understand how things are changing.  Obviously, if peer review were the end all way of getting things done, academics would not blog.  I think they make a great end route in terms of perceived legitimacy and expert opinion, but I also think using an open forum to perfect things is a useful tool, and may not be utilized to its fullest.

I know one of the reasons economists and saberists could collaborate well is the economists understand things like labor theory and its applications at a high level, while sabermetricians are great at measuring these things.  I think this is what you get at when you say ‘subject matter expert’ vs. ‘technical expert’.  And I appreciate the distinction between the two here.  I think some economists are true students of the evolution of baseball analysis, while others are not.  I also think it’s changing toward the latter for those truly interested in economics with sport as a specialty.

I’m sure there are things in the paper that some will take in a less than savory way, but that’s expected.  I think that keeping the perspective (as you do above) is the best way to lead into that criticism.


#3    Sky      (see all posts) 2010/01/20 (Wed) @ 12:42

I find myself wondering about the difference in modeling the subject by economists (ok, JC and a couple others) and sabers.  Economists seems to use pre-set mathematical techniques, like regression and distribution models, while sabers, like Tango, create new models with goal of mimicking actual processes.

An example would be extrapolated runs (economists) vs. baseruns (sabers).  Personally, I find the latter approach better, because it’s more intuitive.  Both approaches may end up at the same endpoint, but to me, the saber model approach provides more value and understanding along the way.  And may provide more jumping off points for further analysis.

Saber-style modeling, where a strong understanding of the subject matter is necessary, almost seems more difficult, or at least time consuming.  Although the effort put into an economics degree is certainly time consuming.  Maybe the economist-style approach is like using a swiss army knife on any problem, while the saber-style modeling approach is like using a tool with only use, but creating a new one for each problem?

I’m curious how others, especially those in the economics field, view my above simplification.


#4    Rodney King      (see all posts) 2010/01/20 (Wed) @ 12:53

Very nice response by Tango, and agree with the comparison to open source.  I find it interesting how JC and Berri discuss “sports economists” as if there were many in existence.  Perhaps there are many who do not weigh in on this debate, but it seems like those two are the only sports economists specified as such.  Maybe the debate would look different on the other side, but I cannot imagine any reason why it would unless one is trying to protect their tenuous credentials as a knowledgeable insider.


#5    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 12:54

Sky: great example.  Yes, those who use the jackhammer regression will simply accept that the difference between a single and a double is .15 runs and move on.

The saberist will say “now that is impossible!”, and will create a model with logical underpinnings.  He’ll use pliers instead of a wrench.

And saberists are not immune to the lure of the jackhammer regression.  Not only JC and John Jarvis, but you have Eric Walker in this very blog do the same thing. BPro does something more jackhammer-like than archeological as well.  Bill James stubborn stance on the matter as well.

So, yeah, the objection I have is simply that nothing close to the true model is being attempted at being created.


#6    SM      (see all posts) 2010/01/20 (Wed) @ 13:20

"Often the merit of ideas offered by this community is judged by a consensus of pseudonymous avatars”

This is very confusing and appears to be a jab in the form of mothers basement bloggers. If they believe it to be a serious idea, they should back it up. Do they back the point up at all, by saying that the wisdom of the crowds is wrong?
I always had thought that Francis Galton himself believed this to be true somewhat.

If the point is that research and critique from anonymous sources is terrible I wonder what they would have thought of the Federalist papers.


#7    Nick Steiner      (see all posts) 2010/01/20 (Wed) @ 13:24

And look at the fertile ground that is PITCHf/x.  What is a non-subject matter expert to do with such data?  Purely a fish out of water.  The subject matter experts are the ones who will sort out how the data should be aggregated, interpreted, etc.  The technical expertise will rely on data mining and cluster analysis.  The regression jackhammer will have little use to those who need archeological tools.

I think this is a very good example.  I work with a lot of Pitch f/x data, and have little technical skills.  Everything I’ve learned has been from guys like Mike Fast and Colin Wyers.

Furthermore, if you are looking to apply the regression hammer to Pitch f/x data, you will get nowhere.  How the hell are you supposed to use regression to figure out the value of a 95 MPH fastball up and in, with -10 inches of horizontal spin deflection, 0 inches of vertical spin deflection, thrown in a 2 strike count against a lefty batter?  Regression won’t work there - there’s only been like 20 such pitches thrown over the past 3 years (if that!). 

None of the values in Pitch f/x are linear (except for velocity) or polynomial.  The ONLY regression technique that works with Pitch f/x is LOESS, and that breaks down with too many variables, isn’t particularly accurate from what I can tell you (Dave Allen or Jeremy Greenhouse would know much better than I though), is very computationally intensive and doesn’t give a closed form equation.  A neural net might also work, but that’s very hard to implement and get correct.  Josh Kalk was able to do so in a couple articles though.

The point is that a classic econometrician would get lots with Pitch f/x data.  Technical skills are important, but really only in terms of data-mining from SQL.  The only place where formal technical training is applicable really is with pitch classification, and as Mike Fast argued with cdm a while back, a mega-advanced system like cdm had was using wasn’t as accurate as Mike’s system because Mike spent time with the data, look at pitchers game by game, and tried to get the best results. 

Being a good Pitch f/x guy requires you to look under the hood, so to speak, and that can’t be done with regression.  You need some technical skills dealing with Perl and SQL, and I’ve just mooched off of others in that regard, having no previous training with any of those things.  But for the most part, Pitch f/x work requires a lot of care and diligence.


#8          (see all posts) 2010/01/20 (Wed) @ 13:28

I was thinking of this a little differently, as befits a dabbler rather than a true contributor… Saberists do have a peer review, it’s just a lot faster.  A lot of what goes on in blogs and comboxes may seem to an academic more like “shop talk,” only out in the open where anyone can wander past and spill some pixels on the topic.  It’s messy and makes the saberists look a little like bumpkins at the country club.

Who knows how much shop talk spurs great ideas that lead to experiment and good theories?  Ultimately, only the academics.  It’s more obvious online.  A stray comment can spark an advance in a metric that opens up a new understanding of the sports we love.  That comment can come from a fellow sabermetric pioneer or a stray snark from someone who hates all sabermetrics and swears by the holy trinity of wins, rbi, and clutchiness.  (Clutchiness would definitely be the Holy Spirit, and his disciples are known by their dirty uniforms.)

This ability to eavesdrop and ask questions (and shoot off my mouth) is part of the attraction for me.  When I was a kid, I pulled the back off my digital watch and yanked out the guts to see how the buttons actually worked.  I preferred baseball cards from older players rather than prize rookies - the rookie cards were boring!  They had no stats on the back.

Saber blogs are both of those things combined: peeking under the cover of the sport and seeing how it produces its results.  And because the process is so open a relative beginner finds it easier to follow; it’s more educational.  The peer-reviewed and published research paper is not meant for me.  I could no more follow that than I could pole-vault the English Channel.


#9          (see all posts) 2010/01/20 (Wed) @ 13:31

Their best defense of that statement would be that they were referring to the incentive structure that an anonymous person faces.  As a peer reviewed academic, the onus is on you to prevent your name from being sullied, presumably by triple checking your work and self-editing unceasingly. Further, it should prevent you from doing unsavory things like misrepresenting the data.  As a “pseudonymous avatar” the incentive structure doesn’t prevent anyone from going in and tinkering with the data to help support misleading results.

Of course such an argument is baseless because the scientific method still applies.  If somebody does some kind of groundbreaking work, others will want to know the process so they can replicate the results.  We’ve seen this in the comparison of WAR/WARP.  Because the components of WARP are at least partially shrouded in mystery, most people not employed by BPro simply refer to WAR.


#10    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 13:41

"wasn’t as accurate as Mike’s system because Mike spent time with the data, look at pitchers game by game, and tried to get the best results. “

Right, only a subject matter expert would even question that perhaps there is a bias in the way the data was collected game-by-game and make the necessary adjustment.

Sakes/Hauer, for example, did not even think to mark Tejada’s first year of his 6/72MM$ contract at 12MM$, instead relying on the actual payout of 5MM$ in the first year of a back-loaded deal.  Indeed, when I presented them with this fact, they responded that they were true to their source of the data.

***

As for the incentive to protect one’s name, I can sit here and say that “Tangotiger” has not been sullied in the least.  Perhaps there is a “Tangotiger” out in Denmark that hasn’t protected his name, but just because, ON AVERAGE, a person doesn’t protect the reputation of his online identity (just as easy to create as discard) doesn’t mean you get to INFER that to specific people, like me.

On a case-by-case basis, “Pizza Cutter” and “Tangotiger” have shown that we protect our reputation more than those with normal sounding names.


#11          (see all posts) 2010/01/20 (Wed) @ 14:02

">academia normally involves many years of advanced training and requires its participants to be judged competent by their peers in a ‘‘publish or perish’’ environment—metricians demand equal recognition. In our view, this mentality begets misplaced confidence.”

To Tango’s response, I would add: we don’t give a crap about academia.  We don’t demand equal recognition to academics on academic grounds.  It’s totally irrelevant.

It’s like, if you prefer to do sabermetrics during your circus trapeze act, a laptop in your hands while your feet hug the trapeze bar and you swing back and forth.  If you can do that, it’s very impressive, and good for you!  But you then say, “hey, I went to trapeze school for years, and do you know how hard it is to swing from a trapeze, and how dare you compare yourself to us acrobats when you’re even afraid of heights!”

Well, our argument is: you don’t need a trapeze to do good sabermetrics.  You don’t need years of formal education.  You don’t need formal peer review. 

In terms economists would understand: it’s the outputs that matter, not the inputs.  We’re saying our outputs are no worse than theirs.  They’re saying, our outputs must be better, because look how good our inputs are!

That’s something that wouldn’t be tolerated in ECON101.


#12          (see all posts) 2010/01/20 (Wed) @ 14:13

Honestly, as someone who doesn’t participate/contribute to this conversation on either side...I find this little “war” or battle or whatever you guys have going on kind of...petty?  I’m a baseball fan that keeps up with a number of the sabermetric blogs/principles, but happen to work in econ...and I find both sides kind of repulsive.  Technical skills are good - things like regression tell us stuff, a lot of stuff, if they’re done right...but it can get complicated and may not always be done right.  It certainly adds value and contributes to our understanding to know the technical/math stuff conceptually and be able to apply it.  At the same time, it may be incorrectly used and lead to false conclusions.

Peer review also has it’s benefits - one thing I see with blogs is to someone like myself who keeps up with it, but doesn’t get into it at nearly the level of most of you - a lot of misinformation gets passed around.  Someone does a poor study and reaches wrong conclusions, or even does a good study that needs followup work to come to sound conclusions - that kind of stuff gets linked to and used as evidence in arguments and it’s just hard to keep track of the good research and what’s right and wrong (and any new research that comes out), because it all stays up on the internet.  So peer review, theoretically, should help keep track of the quality research and give us legitimate studies to point to instead of less thought out posts that may have serious inaccuracies and distract us from the good research.  Of course, anyone pretending the peer review process is perfect is lying to themselves - it has issues, as well.  It’s also a good point that it slows everything down and really limits how quickly research gets done and ideas get built on and checked by others in the “saber” community.

Overall I just don’t understand what the point of these petty little battles is.  It appears to me both sides stoop pretty low, and have a hard time recognizing the merits of the other, and as I’ve briefly laid out, there are merits to both.  We all have our limitations [except for me, of course wink ], and if you try to make a better effort to see where others are coming from and incorporate their skills to complement yours, we might just all end up better off…

One last note - there are times when different ways of looking at things are just “different”, rather than one being right and one being wrong…


#13    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 14:29

B: I often say: summary opinion without evidence is bullsh!t.

You say that we (I?) are engaged in some petty battles, and that we (I?) stoop pretty low.  Since you provide no evidence, that’s called bullsh!t.

You are free to make your point, and feel free to say that I stoop low to make my point.  I don’t moderate here. But, give me the courtesy of at least seeing your evidence so that I can at least confirm or refute it.

In the same way that you, for whatever reason, see this as a “battle” and alot of “stooping” with no reason for posting, I find your summary opinion completely without basis and therefore, no reason for posting.

So, go ahead.  Post your evidence.  You may be right, but right now, we only have your word.


#14    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 14:29

I have added a “Part 2” at the top of this thread.


#15    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 14:46

I have added a Part 3.  This part focuses solely on Berri’s words.  What I say here is not pretty and I can see why Berri is not well-received, even less so than JC, by the analyst community.


#16          (see all posts) 2010/01/20 (Wed) @ 14:49

B,

If you followed the other thread (http://www.insidethebook.com/ee/index.php/site/comments/peer_review_v_internet_debate/), you’ll see that on the whole, the interactions between “hobbyists” and “academic economists” have been constructive.

Really, people are scratching their heads to figure out why JC Bradbury and, to a lesser extent, Dave Berri are such a-holes.  That’s it - just those two guys.  And we’re talking about a-holes in the sense of flaunting credentials, doubling down when they make a mistake, and projecting their own insecurities onto other people’s responses. 

Much as peer-reviewed journals are an improvement over non peer-reviewed journals, the peer review process somehow gave Bradbury and Berri a journal paper about “the dangers of believing what you read on the internet.” It certainly fails the test for original research, and the writing style is embarrassing and betrays an attitude of academic superiority. 

“there are times when different ways of looking at things are just “different”, rather than one being right and one being wrong”

Yes, like UZR and Dewan’s +/-?

But what about Francoeur’s performance being worth $12M above replacement?  What about a cherry-picked sample giving an incorrect peak age?

Are there not times when something is demonstrably wrong?  Do you think anybody would get their hackles up if Tango or MGL put out something vastly incorrect and they got trashed for it by people with the right methodology? 

“Someone does a poor study and reaches wrong conclusions, or even does a good study that needs followup work to come to sound conclusions - that kind of stuff gets linked to and used as evidence in arguments”

Give an example, please!


#17    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 15:14

I have added a Part 4.

In my opinion, they should have stopped the paper after Part 2.

From part 3 through 4, this paper was just an outlet for their personal opinions.  Nothing will come of it in any sense.  No one will be able to use it to learn something better, it doesn’t advance the discussion, it does nothing really, other than reinforce the image we have of JC and Berri.


#18          (see all posts) 2010/01/20 (Wed) @ 15:30

Post 3 clearly indicates that Berri is what my Economics advisor refers to as a terrible economist.  Regression is a very useful tool so long as you ask yourself the right questions while setting it up.  Berri is running regressions all over the place without ever asking himself a single meaningful question.  Common sense must be used when running a regression, there’s nothing to be gained by running a year to year correlation on plus-minus because the statistic is heavily polluted.  On the other side of the coin, of course shots on goal are heavily correlated year to year, there’s no benefit to noting such a correlation.

The economists I worked with in undergrad would scold Berri unceasingly.


#19          (see all posts) 2010/01/20 (Wed) @ 16:32

@Tango:

I’m referring to the constant back and forth that seems to me like it attacks the other “side” rather than attempts to be constructive.

“What we would like is for economists to realize they are still dancing disco, while the rest of us have grown up from that.  Perfect the electric slide all you want, and that won’t mean that we want to be your equals there.”

“Indeed, while the baseball fan would say that your toaster is broken, the academician will reply that the toast is supposed to be burnt black on one side and stay white on the other.”

I’m seeing general statements like this, and just not seeing the point.  Anyways, the real substance of my post was all the stuff that you didn’t discuss, so we can move on.  Whatever.

@Hawerchuck:

That makes more sense that it seems to be more of a personal thing with a couple of bad economists.  Fair enough.  Calling Jeff Francoeur a $12M player is certainly wrong, and as you said, I didn’t mean to imply that some ideas can’t be wrong.  They certainly can be.  My intention is to point out that even when certain ideas are established with merit - like rWar/fWar, for instance, certainly useful tools with a lot of good work done behind them to make them work - it doesn’t mean there aren’t alternative ways to value players that might be “different”, but not necessarily incorrect, though, again, a method that comes up with ridiculous results (like maybe what Moore is using with the Royals?) can be wrong.

“Someone does a poor study and reaches wrong conclusions, or even does a good study that needs followup work to come to sound conclusions - that kind of stuff gets linked to and used as evidence in arguments”

The first one that comes to mind for me is the concept of a pitchers influence on BABIP.  It’s limited, of course, that’s been shown.  My impression is a lot of theory (at one point) was constructed on the notion that pitchers basically don’t influence balls put in play.  I’ve read a bunch of the research on this, and for a long time I strongly felt that the research presented didn’t support the conclusions reached, which (at least for a time), was that pitchers basically didn’t control BABIP.  Simply put, the research didn’t back that up.  The data being used (pitcher level data) just wasn’t substantial enough to make that conclusion - it was really necessary to have pitch level data at the minimum to look into it.  Of course the research has come a long way since then (Voros McCracken’s article came out in 2001 after all), incorporating pitch f/x data, batted ball profiles, and what not - so don’t take this the wrong way, it’s just meant to be a past example to back up that statement.

“this paper was just an outlet for their personal opinions.  Nothing will come of it in any sense.  No one will be able to use it to learn something better, it doesn’t advance the discussion, it does nothing really, other than reinforce the image we have of JC and Berri.”

And that was the type of thing I was complaining about in the first place, and I suppose that makes it one of those examples you were looking for…


#20    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 16:44

Interesting.  I was actually framing the other side’s perspective, and then dismissing it as irrelevant.  You see that as stooping or being petty?  I was seeing it as stopping the other side.

***

“And that was the type of thing I was complaining about in the first place, and I suppose that makes it one of those examples you were looking for… “

Well, in this case, I positioned myself to do a review of their paper.  I laid out all my opinions, and my conclusion naturally followed.

So, again, no stooping, no pettiness.

If you want to argue there’s alot of useless back-and-forth, yes, I agree, completely.  If you want to argue that it’s unpleasant, then I agree.  I basically take exception to your characterization, since you really have not provided any basis for stooping and pettiness.


#21    Mike Fast      (see all posts) 2010/01/20 (Wed) @ 16:50

I imagine that people who step into the middle of this argument feel much the same as I often do when I step into the middle of an argument between my 6-year-old and my 3.5-year-old.  Unless somebody is bleeding, I don’t care who started it, I don’t care who did what, I don’t care who’s more at fault.  I just want them both to stop and play nicely together again.  They, on the other hand, care about who did what.

Such is life.


#22    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 17:11

Excellent analogy.


#23    colintj      (see all posts) 2010/01/20 (Wed) @ 18:11

I really doubt this debate would exist if we had better fortune with the academics that decided to crash the party.  Imagine if Tyler Cowen got a sudden research interest in sports. 

Relatedly, I’m pretty sure JC isn’t going to work in a team front office anytime soon.


#24          (see all posts) 2010/01/20 (Wed) @ 19:12

The thing that strikes me first about this post is that you sound a lot like what an old scout must have sounded like to Paul DePodesta—“we’ve been here, we put in the time, how can you know what you’re talking about when you haven’t done x?” where x is either play the game or be a saberist.  “It’s not a good idea for them to try to create a metric when we’re the ones who have so much experience in doing just that” is exactly what a scout would have said about player evaluation to anyone thinking about using numbers to project performance.

I don’t think that a lot of what you say is invalid, Tango, because it’s certainly a good idea for economists to read saberists and become updated on interesting research being done by subject matter experts.  Subject matter experts, however, may only be experts in baseball, and not in measurement or analytical techniques which are necessary to create accurate statistics.

The A Team/1: Peer reviewed journal publication is considered (by economists, at least) the final hurdle to acceptance of an idea.  Most people, especially those on the cutting edge of research, approximate the openness of an online community by releasing working papers that aren’t quite ready for primetime, but are pretty well developed.

Rodney King/4: You would be surprised by the number of people doing sports economics.  The difference between them and Berri/Bradbury is that, while they may read saberists and utilize their results, they don’t engage with the community.  See if you can find a conference schedule from the Western Economics Association meetings (I can’t at the moment, though)—there are usually 3-4 days full of sports economics papers being presented.

SM/6: The wisdom of the crowds is often wrong - many otherwise reasonable people think Joe Morgan is an adequate baseball commentator.

Nick/7: I wonder if you might be able to analyze that kind of data by matching on propensity scores; regardless, I think we can both agree that the best kind of analyst is someone who is both a saberist and has been trained with a high level of formal techniques.

B/12&19: Largely agree.  One example of an erroneous stat that could make its way into the public discourse: any BPro stat that Tango likes to take shots at.  If BPro was forced to submit to peer review in the past, I don’t think you (Tango) would have recommended to a saber journal to publish a paper claiming WARP’s value as a metric.  However, there are now a significant number of people believing in its value because there is no quality control before something is released for public consumption.

Hawerchuk/16: Also agree that JC Bradbury and Dave Berri antagonize with no reason.  However, I’m glad that you (and others, connected with the other thread,) realize that it’s really only two guys that piss off saberists.  That’s why Tango’s repeated comments about “academic economists” and “academicians” in this post are frustrating and out of line.  JC Bradbury and Dave Berri =/= the whole group of academics.

The A Team/18: Of course your undergrad economists would scold Berri endlessly.  He’s not exactly a cream of the crop sports economist, much less a cream of the crop economist in general.

Out of curiosity, did anyone bother to think about the kind of jobs Dave Berri and JC Bradbury have?  They’re professors at Southern Utah and Kennesaw State, neither of which are particularly known as bastions of economic talent.  Tango is arguing with bottom of the barrel economists and using it to indict the entire profession on the one hand, then when I disagree in the other thread about the value of academic contributions to sabermetrics, he references the huge amount of academics working in saber.  It’s not a question of “the academician will reply that the toast is supposed to be burnt black on one side and stay white on the other,” but probably more of a question of why JC Bradbury in particular tends to be wrong about lots of stuff.  For someone who I respect a lot as an analyst, you sure seem willing to take a sample size of <5 and use it to make a lot of very general statements, a technique you would certainly question if it were applied to baseball, Tango.


#25    Nick Steiner      (see all posts) 2010/01/20 (Wed) @ 19:30

Tango is arguing with bottom of the barrel economists and using it to indict the entire profession on the one hand, then when I disagree in the other thread about the value of academic contributions to sabermetrics, he references the huge amount of academics working in saber.

Ben you keep mentioning this and it’s strange.  Tango has NEVER extrapolated his interactions with JC or Berri to the rest of the economists.  If someone here has, then they do not speak for everyone - only for themselves.  Tango has gone out of his way not to blanket all sports economists under the JC/Berri mold.

Furthermore, nobody has every disagreed that academics can’t have value in Sabermetrics.  I’m not sure where you are getting this from.  People are arguing that subject matter experts are always going to be better than technical experts (or academics) on their subject (assuming equal level of expertise).


#26          (see all posts) 2010/01/20 (Wed) @ 19:35

Nick/25: I used two direct quotes of Tango’s post above where he refers to “academic economists” as a blanket group and “academicians” as a blanket group.  I don’t know where you’re getting it from that he’s NOT indicting an entire profession.  If he only wants to talk about JC and Dave Berri, he should say “JC and Dave Berri” rather than “academic economists.”


#27    Nick Steiner      (see all posts) 2010/01/20 (Wed) @ 19:39

Ben - I have not seen one instance on this thread where Tango referred to all “academics” as such and such. 

In fact “academic economists” was a line quoted from JC and Berri’s article!


#28          (see all posts) 2010/01/20 (Wed) @ 19:50

Nick/27: Then you’re either blind or not looking.  From the original post:

1. “What we would like is for economists to realize they are still dancing disco, while the rest of us have grown up from that.”

2. “Indeed, while the baseball fan would say that your toaster is broken, the academician will reply that the toast is supposed to be burnt black on one side and stay white on the other.”

3. “It’s not a good idea for them [referring to “sports economist” from the previous sentence] to try to create a metric when we’re the ones who have so much experience in doing just that.”

4. “Indeed, it shows the reason why an economist should let the saberist figure out how to create the individual metric.”

5. “The second sentence should be more like “economists should wait out to see what the analysts are discovering”.”

All of those are comments that pertain to sports economists as a blanket group, ignoring the fact that there is some overlap between what Tango things of as a valuable saberist and the list of people who are sports economists.  Matt Swartz would be one of those people.  I get it that JC Bradbury sucks at being a baseball analyst - but restrict the commentary to him rather than making blanket statements about sports economists.  Just because Bradbury can’t differentiate between subject matter experts and laypeople doesn’t mean that Tango isn’t able to differentiate between Bradbury and other economists.


#29          (see all posts) 2010/01/20 (Wed) @ 20:07

Ben,

Here is what I wrote about Berri and Bradbury a few months ago:

“It is ridiculous to appeal to the authority of a PhD in an unrelated field...The irony is that within academic circles, you’d get laughed at for flaunting your credentials as an academic at a third-rate institution like Southern Utah University (Berri) or Kennesaw State University (Bradbury).”

Even though academics don’t see them as serious researchers (and they even admit that they’re viewed as unserious), people on this site take them seriously because there’s no bias in favor of credentials in analyzing baseball.  You could be the night-watchman at a cannery (Bill James) or really be living in your mom’s basement (McCracken) or be a frustrated academic who wants to feel superior to someone - anyone - else (Bradbury/Berri) - you can analyze your idea and run with it.


#30          (see all posts) 2010/01/20 (Wed) @ 20:13

Hawerchuk/29:  Agreed completely; I don’t have a PhD yet, and am certainly not as smart as others here.  I just think that what you’re saying is a beautiful example of why I have such an issue with what Tango wrote in the 5 examples I cite above - generalizing like that is inaccurate and uncalled for.


#31    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 20:15

That’s why Tango’s repeated comments about “academic economists” and “academicians” in this post are frustrating and out of line.  JC Bradbury and Dave Berri =/= the whole group of academics.

While I have gone to pains to say “SOME” in some instances, obviously I have not qualified this enough in all instances.

As I have said in the other thread, this is JC (and Berri) v the world.  My comments are limited and should be construed as limited to those two.


#32    Tangotiger      (see all posts) 2010/01/20 (Wed) @ 20:18

If BPro was forced to submit to peer review in the past, I don’t think you (Tango) would have recommended to a saber journal to publish a paper claiming WARP’s value as a metric.

EXCELLENT example (as well as LEV among others).  Yes, perfect. 

The problem is that an academic peer review process would not have stopped the super-low replacement level from existing either. 

This could only be stopped by SMEs putting their foot down, or at the very least, putting a rider on the use of this stat.  Something like “Hey, we respect Tango generally, and he can’t stand the superlow replacement level, but we think it’s valid, so in this case, don’t listen to him.” Something like that.


#33    tangotiger      (see all posts) 2010/01/20 (Wed) @ 20:32

Tango is arguing with bottom of the barrel economists and using it to indict the entire profession on the one hand, then when I disagree in the other thread about the value of academic contributions to sabermetrics, he references the huge amount of academics working in saber.

Right, as noted, I am disagreeing with JC and Berri specifically.  They are the ones who are using “academics” as if these two guys are representative.

To be clear, I should have started by saying that I disagree that these two guys are representative of who they are purporting to represent, and they are in the minority.  And my comments are directed solely at the guys actually make the comments (and whoever it is that supports their position).

Again, you are right to have thought otherwise, because I was not explicit enough.


#34    tangotiger      (see all posts) 2010/01/20 (Wed) @ 20:55

Ben, I agree that I didn’t qualify when I needed to, as I’m re-reading.  However, you are taking at least one item out of context:

While an academician might agree that the worst hitting outfielder in baseball could have a performance worth 12 million dollars, even the most casual baseball fan would laugh at such a result.  Indeed, while the baseball fan would say that your toaster is broken, the academician will reply that the toast is supposed to be burnt black on one side and stay white on the other.

You only quoted the bolded part.  The statement preceding that was an obvious reference to JC/Frenchy.  So, the toaster comment had to be taken in that context.  If you want to argue that I should not have said “an academician” but rather “this academician”, that’s fine.

I’ll ask for just a little slack, not a lot though, for this review.  In light of the way I deal with everyone else and the inconsistency of the comments as you noted, obviously I’m not silly enough to paint broad brushstrokes against a class of people.


#35          (see all posts) 2010/01/21 (Thu) @ 00:47

I’m consistently amazed at the whole argument.  I participate in a listserv in my work that is populated both by academics and practitioners in my field.  There is nothing but mutual respect (a little teasing, but all in good fun).  The academics sometimes quote the practitioners in scholarly papers, and are consistently open to correction about the difference between their expectations of how things work and how things work in the real world.  The practitioners are more than willing to learn about tools and resources they were unaware of, and to be corrected about assumptions made in the field that are not supported by the literature. 

In fact, it’s a great deal like the tone of this blog.  And both are “must reads” for me in much the same way, because it’s a pleasure to come knowing that everyone has the same goal of getting a little bit smarter and making the field a little bit better. 

What these particular economists are doing reminds me more of when my wife was in a hard science field and some people with huge main frame computers were doing calculations that she disagreed with based on a basic knowledge of chemistry.  Essentially, she was arguing that the computer data must be missing something because at that temperature and pressure those elements shouldn’t react like that.  She was essentially drummed out of the field by people telling her the computers had to be right, they were computers after all, goddammit.  And of course it was later shown that she was absolutely right.  To me, this is exactly like the saberists who look at their conclusions and test it against whether they make sense in a baseball context.  A number that makes the worst hitting outfielder worth 12 million is an indication that your numbers are wrong, not that the worst hitting outfielder in baseball is more valuable than everyone else says.


#36    dlf      (see all posts) 2010/01/21 (Thu) @ 10:55

Not to go completely astray, but this morning’s NPR had an interesting story about the hacked climate science emails in which they quoted several academicians who acknowledged that peer review is subject to the same petty personality conflicts of any human interaction.


#37    Tangotiger      (see all posts) 2010/01/21 (Thu) @ 11:15

Cool, good to know that they are self-aware enough to realize the kind of bias any group brings to the table.

***

FWIW, I made 3 or 4 changes to my original post, by adding “some” and other qualifiers like that.  Such is life as a blogger, but such is it that I can make a change like that in a day.


#38          (see all posts) 2010/01/21 (Thu) @ 11:33

Tango,

Your updates/changes I think would be appreciated by many.  While I think the article does push toward an overall conclusion of finding value in the blogging interaction, I do understand your frustration with certain parts.  Though, I think it is reasonable to replicate and show value of metrics when writing about them to a new audience, as long as further credit is given.

I think it must also be kept in mind that the article is written toward an academic audience, so the lower-level description of the medium and its outputs is expected.


#39          (see all posts) 2010/01/21 (Thu) @ 12:00

I think the additional comments have been very helpful in clarifying your view, Tango.  I guess I was in a similar boat as Ben initially - where I read it as criticism against academics, economists, and their methods like regression (and other more technical math related abilities) and peer review, it seems you’re much more in line with my thinking.  A couple of guys doing bad economics hardly reflects what “academics” are capable of bringing to the baseball world.  I remember a blog post you did a while back about why you hate regression (in this context with regards to linear weights)...and the first thing I thought of when I saw these guys methods (that you were questioning) was potential omitted variable bias and multicollinearity, the latter of which could really mess up a linear weights model.

Having a PhD in this stuff can be helpful - a good grasp of the technical aspects of these things really can help bring additional ways of analyzing data and modeling that others without the technical knowledge are incapable of.  Simply having a PhD, however, doesn’t mean you’re doing things right.  Unfortunately, there are more mistakes and incorrect methodology and just “bad economics” (as I would call it) going on in economics than I would like to akcnolwedge, but still, when people do it right, it brings a lot of additional knowledge.

Also, while I do like the blog format for the saber stuff that goes on, it seems people here are a little too dismissive of a more formal process like peer review.  As I initially acknowledged, and someone else mentioned, the peer review process, aside from being slow, is hardly perfect.  It’s subject to many human biases, and at times is not nearly as good at determining the worthwhile work from work with sketchy methodology/purposes.  I do think something like that would be very useful for the saber community, though.  Not necessarily an academic peer review (though maybe trying to incorporate more technical knowledge is a good thing?), but some sort of formal process where ideas that have become accepted really are tested and published in a formal place, reviewed by knowledgeable people, and essentially established as higher quality than just a blog post.  It lends more credibility to the idea, and helps establish the ideas that have strong research backing them, that hold up to substantial testing, from the rest of what gets posted on the blogosphere.  I really think something like that could have a very useful place among saberists.  Not every idea has to go into it, heck, most can stay out on the blogs to be worked on and advanced by others, but the important ones can be established in a more credible way.

So anyways, going back to the original point, I’m glad this is more a case of being dismissive of bad economics, rather than of academic research/economics and their tools, processes, and knowledge as a whole.  I see a lot of ways both of these “sides” can contribute to further our knowledge of baseball.


#40    Tangotiger      (see all posts) 2010/01/21 (Thu) @ 12:30

Not necessarily an academic peer review (though maybe trying to incorporate more technical knowledge is a good thing?), but some sort of formal process where ideas that have become accepted really are tested and published in a formal place, reviewed by knowledgeable people, and essentially established as higher quality than just a blog post.

I’m all for that. 

You are suggesting some sort of “association” that the subject matter experts would belong to, much as other professions belong to these kinds of associations.  IEEE for one (though I presume that it creates splinter groups).

SABR could ostensibly play that role, but their focus is far more toward historical research than technical vetting.

This blog here plays that role in a very informal, sporadic setting.  Kinda like a watchdog or ombudsman.  Phil plays kind of that role in his blog as well.

To have something more formal, that has more teeth, would require buy-in from some of the leading saberists.  In one respect, The Forecasters Challenge is one method: you have many of the top forecasters agreeing to expose the results of their work, and test it in competition.  If The Forecasters Challenge were to gain more prominence, then the forecasters who choose NOT to participate would be exposed to some extent.

The problem is that the rigidness and formality of an association will eventually become a barrier, and if the buy-in is not there, it simply becomes another bloated thing.

The academic peer review process is even more restrictive, and therefore, is doomed to fail at the saberist level.  It’s a non-starter.

As it stands, whatever kind of association were to develop, would have to be something just a bit stronger and a bit more rigid than what we currently have with blogs.  Anything stronger, and it can’t succeed.


#41    Mike Fast      (see all posts) 2010/01/21 (Thu) @ 13:07

I really expected Dave Barry’s section of the paper to be funnier given what I’ve read of his work before.  :|


#42    Tangotiger      (see all posts) 2010/01/21 (Thu) @ 17:19

Phil’s take:
http://sabermetricresearch.blogspot.com/2010/01/chopped-liver-ii.html

And if you guys think I was tough, my review is extremely light.

My review was simply a point-by-point rebuttal to what Berri and JC were saying.

Phil however ties it in to something larger, like a real reviewer would do.  Phil is the Ebert of sabermetrics. 

I’m glad he’s on the same side I’m on…


#43          (see all posts) 2010/01/21 (Thu) @ 17:35

"but some sort of formal process where ideas that have become accepted really are tested and published in a formal place, reviewed by knowledgeable people, and essentially established as higher quality than just a blog post.  It lends more credibility to the idea, and helps establish the ideas that have strong research backing them, that hold up to substantial testing, from the rest of what gets posted on the blogosphere.”

really, this is just about incorporating status and formality into what already happens to make our signals more obvious.  personally, i think it’s useful that we don’t have anything like this.  usually those methods involve some sort of paywall and increased publicity.  if you want to lower the quality of the commentary, that’d be the way to do it.


#44    Nick Steiner      (see all posts) 2010/01/21 (Thu) @ 17:41

+1 on Phil’s piece.

I’m now convinced that JC is basically just a parody so wrapped in disillusionment and bias that there is little point to reading anything he writes anymore.


#45    anon      (see all posts) 2010/01/21 (Thu) @ 18:08

Why are these questions (regarding studies done by people who work for universities compared to studies done by those who don’t) relevant to the paper on econimics and/or baseball?  Seems like a much broader question beyond the scope of an economist or a saberist ... or the two of the combined.


#46    Rally      (see all posts) 2010/01/22 (Fri) @ 10:47

The thing about JC is he seems 100% convinced he’s right and we are the idiots.  Did anyone read, shortly after the Francoeur debates, his analogy of him being the pilot of a plane being told how to fly by passengers?  The arrogance is stunning, and combined with wrongheadedness is downright comical.

But of course, he can’t see that, he thinks the same of us.  If only there were a way to test and get people to put up or shut up.  You know, real science.  Maybe he’s right, maybe we are the idiots.  How can we test this?

Here’s what I propose:  Take the season stats of a league and set up a simulation.  Diamond Mind or APBA could work.  Leave projections out of the discussion, we only want to replay the actual season stats, and derive our player values from those.  Have JC’s system compete with salary based on rWAR, fangraphs WAR, BPro Vorp, Win shares to $, and maybe a few other variations of player valuing.  No human interaction is needed, just start with Pujols, the system that values him the highest gets him.  All systems would have to be calibrated so we’re dealing with the same total value for players.  Ranking of players (to see what order they go in the auction) determined by average composite rating (say the order for NL goes Pujols-Hanley-Linc-Utley or something like that).  A few other details would have to be ironed out.  I think playing time would have to be enforced, so if you draft Furcal 2008

We would not need JC’s involvement, as I doubt he’d want to interact with the unlearned heathens we are anyway.  But I’m pretty sure his team would get Frenchy, and probably finish in last place.


#47    Hizouse      (see all posts) 2010/01/22 (Fri) @ 11:06

I don’t remember the pilot-airplane comment; I thought he just said “read my book” a few times before shutting off comments....

But that reminds me: does JC mention Francoeur at all in this article?  It could serve as an example of how his interaction with saberists caused him to improve his (as yet unreleased new) model.  Of course, telling that tale would require him to rehash his end of that discussion.


#48    WTE      (see all posts) 2010/01/22 (Fri) @ 14:47

A few comments, which may or may not be on point:

1. Maybe I’m biased because I work in a research library, but part of the problem may be curatorial. When JC makes his bogus citations to almost irrelevant academic research, there is at least a pretty good chance those resources will be available to readers, even 40 years after the fact. No matter how irrelevant they are. If Tango falls into a ditch tomorrow, how long will this blog last? Continuity of knowledge is a pretty serious advantage for the academic side here, no matter how misapplied by JC.

Maybe we need some big publishing house, library or some larger institution to take ownership of good baseball research, and curate it properly, with appropriate index, classification, etc.

2. Whoever made the point on Phil’s blog that economists in particular have a certain departmental arrogance kind of nailed it. And that’s curious, since the track record of economics as a discipline is pretty spotty. But it’s a fashionable hammer these days, and there are plenty of people swinging away.


#49          (see all posts) 2010/01/22 (Fri) @ 15:43

WTE,

Doug Pappas died tragically a few years ago.  His work is not lost.

JC really doesn’t bring anything to the table.  It doesn’t matter how hard we tried to find his contributions.


#50    Colin Wyers      (see all posts) 2010/01/22 (Fri) @ 15:52

Yeah, I can see why “so thanks to some random baseball hobbyists on the Internet I remembered the difference between absolute and marginal revenue products” isn’t in Bradbury’s submission to the Journal of Sports Economics.


#51    WTE      (see all posts) 2010/01/22 (Fri) @ 16:17

So 40 years from now if someone wants to retrieve a specific argument Pappas made or a dataset he used, you’re pretty confident that would be somewhere out there? Where?

Where is the equivalent of a library that classifies and stores the work of saberists?

Maybe Wikipedia will actually end up being a critical resource here. Maybe saberists should be dropping as much content as possible into Wikipedia, or something like it. The infrastructure and long-term archival support needs to be there.


#52          (see all posts) 2010/01/22 (Fri) @ 16:33

"really, this is just about incorporating status and formality into what already happens to make our signals more obvious.  personally, i think it’s useful that we don’t have anything like this.  usually those methods involve some sort of paywall and increased publicity.  if you want to lower the quality of the commentary, that’d be the way to do it. “

No, it’s not, it’s about establishing the best practices and knowledge into something more credible, trustworthy, and easier to find and reference.  It’s about making sure these ideas really are tested by other knowledgeable peers to make sure they hold up to various alternative ideas and analysis.  It’s not going to lower the commentary - because it wouldn’t be the driving force for research, the blogs seem to do that just fine, it would be something sent in after the “high quality commentary” or whatever you want to call it has been done on the blogs.  The whole point is when it gets to the point of being published in a “saber journal” (term I just made up for this purpose, you can call it whatever you want), it’s not about commentary anymore, it’s about publishing the idea as credible, tested, and approved.

There’s real value to that.  Tango calls people out for not giving examples - what if I try to go looking through the blogosphere for some example of research in some hypothetical situation.  I find some on the subject I want...well, how do I know what followup research has been done on it?  How do I know it hasn’t been eventually been discredited?  It’s probably still in the form it was in when it was published on a blog- so it’s missing authenticity that I think good ideas/research need.

One last random thought I just had - it would also provide kind of a one stop destination for established ideas.  That’s useful in a lot of ways, and if you’re looking for research to cite or build upon, it can always be done from a source like that and it gives your research more credibility/merit.


#53    Tangotiger      (see all posts) 2010/01/22 (Fri) @ 17:15

In the main thread, I quoted the authors:

2. What issues arise when sports economists review the metrician research?

And I responded:

2. That there is no one single placeholder, that things evolve, that sometimes there are competing views.  It’s a bit of a mess.

So, I agree that there should be more centralization happening.  I’ve offered my wiki as such a thing:
http://www.tangotiger.net/wiki/

It’s sputtering along, with a few people putting in most of the work.  But yes, when someone comes to me asking me for a reference for something, it would be so much easier for me to say “wiki”.

But, most of the saberists don’t see the value enough here to actually document their work.  Plus, since this is a hobby, we don’t WANT to do work.  Why do we care about posterity?  We do what we want, we put out our research and we move on.

Yes, this makes it very hard for others to cite us.  But, again, we have little incentive to do otherwise.

I presume academicians do what they do because their livelihood depends on it in some respect. 

Even SABR itself sees no interest to do this. 

To most saberists, this is like a game of pickup.  We don’t want to keep detailed scores.  Just enough to see who won the game we’re playing.  Tomorrow, there’s another game.

I agree, it’s a mess we’re leaving.


#54          (see all posts) 2010/01/25 (Mon) @ 16:57

I’ve followed a lot of the WoW wars at the apbr metrics forum.  Berri completely dismisses anything done by anyone online.  Berri’s 94% is based on essentially doing a regression with a previous residual as a ‘team adjustment’.  If other metrics are allowed a similar ‘team adjustment’ they come out as good or better than WoW.  This doesn’t stop Berri from using the 94% number to dupe people.


#55    Eli      (see all posts) 2010/08/05 (Thu) @ 21:08

Not sports related, but interesting:

“In order to distinguish between well founded criticism and unsubstantiated claims made for political purposes, all future debate should take place in peer-reviewed publications.”

http://www.spiked-online.com/index.php/site/article/9383/


#56          (see all posts) 2010/08/05 (Thu) @ 21:14

Thanks, Eli, that was definitely worth reading.


#57    Tangotiger      (see all posts) 2010/08/05 (Thu) @ 22:14

These guys make me sick.

How about this: if something HAS been peer-reviewed, I don’t want to discuss it.  It’s as stupid and idiotic as saying that you’ll only argue in a peer-reviewed setting.

These guys are nothing but control freaks whose arguments are hanging by a thread.

Galileo, Newton, Edison, Da Vinci.... f-ck them all, according to these snooty a$$hats.


#58    Richard Bergstrom      (see all posts) 2010/08/06 (Fri) @ 03:05

I’ll just reuse something I said over at Baseball Prospectus that one difference between “Working in the Land of the Metricians” is a tone of voice used for publishing an academic paper at a conference, versus a site like Tango where it’s more like a chat between experts in a faculty lounge. Of course, a lot of people fall asleep at those conferences. wink

I’ve studied education all my life, have a Masters Degree in Education and worked in college admissions, financial aid, the business office and business intelligence. But, I’m considered a layman or nonacademic or support staff as if I don’t know what makes a student likely to graduate based on their admissions/attendance/financial aid status when a teacher in a class of 300 students does. Not to dismiss the teacher’s view, but I got one too smile


#59    EvanZ      (see all posts) 2011/02/24 (Thu) @ 00:28

Sorry, way late to this party.

As someone who is a) an academic b) developing basketball metrics..ONLINE (oh, the horror) and c) been directly questioned by Berri about my “credentials”, I’d like to add my two cents on peer review.

Peer review is very important in academics. It is important, as far as I can tell, for one sole reason: papers are the currency of academia. Papers are what get you grants and tenure. If there were no need to beg the government for money and no need to convince your dean and provost and other faculty members that you are worth hiring for life, there would really be no need for peer review.

The thing is, aside from Berri and a handful of others who actually publish academic papers on sports economics, why do any of us need a formal peer review process? None of us are getting tenure for this work. None of us are getting grant money to do our research in saber or apbr metrics. Our peer review is in the blogosphere and the forums, not in the ivory towers of academia (and remember, I say this as someone who also relies on formal peer review for my day job in academia). Ironically, Berri’s metric has been stalled in place for 10 years, while the rest of us continue to work on improvements, and dare I say it, actual new research into the subject. That’s what academia is all about, after all.


#60    MGL      (see all posts) 2011/02/24 (Thu) @ 02:40

Haha.  I thought this was a new thread and I was amazed at the number of responses.  If I were an academician I probably would have realized that the thread was over a year old.

My quick take:

This bifurcation of academicians v. metricians (saberists) is artificial, pointless, and silly.  Some of the best saberists have great academic credentials and other do not.  Some do great work and others do not, and everything in between.  If you ran a regression with various x variables and the y variable was quality and quantity of work, my guess would be that the only significant x variable would be expertise and knowledge of the sport.  IOW, there are many ways to skin a cat, but you have to know HOW to skin a cat, first and foremost.  Great sports knowledge is hardly a guarantee for quality research and great academic tools is hardly a guarantee either.  And having both, while an advantage, is no guarantee either. I have limited expertise in statistics and econometrics, and I would like to think that I have contributed greattly to the field of sabermetrics.  Why (assuming that is true)?  Because I have a vast storage of knowledge and experience in baseball related matters and I have a good ability to problem solve. Tango has more statistical acumen than I do, but a little less experience, but nowhere near the math/stats tools of other academicians and sabermeticians.  Yet he too has contributed vast amounts of research to the field.  And we both know where our weaknesses lie.  If we run up against something because of those weaknesses, we either ask for help or we go in a different direction.

There is also little doubt in my mind that in general (and I hate to make generalizations like this, but I’ll make an exception), when it comes to real world, practical issues, like solving sports-related problems, being immersed in technical know-how, while often useful, can be an impediment (depending upon use and presentation) to solving such problems.

When an issue is complex with many variables going in all different directions, reducing that issue to one or two questions is ludicrous.  That is the way I see this debate and is why I have no posts above…


Page 1 of 1 pages


Name (required)
E-Mail (optional; WILL be published)
Website (optional)

<< Back to main


Latest...

COMMENTS

Feb 23 01:15
How much should minor leaguers make?

Feb 22 22:31
Not everything you learn in college is true (duh)…

Feb 22 17:27
Would you cut to a regularly scheduled show, if the main event ran long?

Feb 22 17:02
This week in chart failure

Feb 22 16:26
Who’s evaluating the 2011 forecasts this year?

Feb 22 12:21
MLB 2012 Odds: BetOnline

Feb 22 07:11
K minus BB differential or ratio?

Feb 22 01:18
Two players have the same stats: one is much younger.  Which one will be better next year?

Feb 21 14:49
Knuckleball pitchers: all of them

Feb 21 13:57
Proper compensation for Epstein?