Bucs Dugout: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
New Blog: RSL Soapbox for Real Salt Lake Fans!

The Next Moneyball Frontier: Freakonomics and Pitching

Turns out that Steven Levitt, the economist half of the Freakonomics duo, recently co-produced a paper examining whether or not pitchers (and football teams) follow game theory. The bottom line? “Pitchers appear to throw too many fastballs; football teams pass less than they should.”

"Game theory," in this case, refers to the economics principle that "[i]n the perfect world of game theory, two players locked in a zero-sum contest always make rational choices. They opt for the “minimax” solution — the set of plays that minimizes their maximum possible loss – and their play selection does not follow a predictable pattern that might give their opponent an edge." The researchers examined every MLB pitch from 2002 to 2006, grouped them by pitch type, then looked at OPS per pitch type. According to them, fastballs are overused (and pitchers are too predictable in general) to the tune of about 2 losses per team per year (of course, if every team corrected, you'd simply come to a new equilibrium with fewer runs scored).

The linked piece is just an abstract, so go ahead and read through it - it's interesting stuff, and definitely points to a possible Moneyball-like arbitrage situation, in which whatever team learns to exploit these findings first will gain a significant - and essentially cost-free - advantage.

0 recs  |  Comment 47 comments

Story-email Email Printer Print

Comments

Display:

fastballs are overused (and pitchers are too predictable in general)

Also known as the Papelbon principle.

by wickethewok on Oct 20, 2009 2:43 PM EDT reply actions   0 recs

There are significant flaws with his methodology...

…on both the baseball and the football halves of the paper. There’s a fairly good enumeration of the flaws in the baseball part here and here, and of the football part here.

An interesting study, with its knees broken by methodological flaws. I really wish that outside academics would run their work past baseball experts before publishing, because the problems were avoidable, and the work would be much more useful without them.

by Vlad on Oct 20, 2009 3:36 PM EDT reply actions   0 recs

Should've checked

From reading the abstract, it seemed like there might be methodological flaws, but I wasn’t going to guess at them.

Oh well.

by JRoth95 on Oct 20, 2009 3:52 PM EDT up reply actions   0 recs

It's still an interesting question...

…and the abstract’s still worth a read.

It’s just unfortunate that Leavitt didn’t do his due dilligence this time around. Look at all the problems he and Dubner got into on the global warming chapter of their new book…

by Vlad on Oct 20, 2009 4:32 PM EDT up reply actions   0 recs

Oh yes...

How terrible for Leavitt and Dubner that people are talking about their book. I’ll bet they just hate that.

by maguro on Oct 20, 2009 5:24 PM EDT up reply actions   0 recs

No such thing as bad publicity, I guess...

…but in an ideal world they probably would’ve preferred to have at least a few of the people talking about it saying nice things. When you write a book about economics, and then a Nobel-winning economist like Paul Krugman devotes multiple pieces to tearing your methodology apart, that’s got to sting a little.

by Vlad on Oct 21, 2009 10:15 AM EDT up reply actions   0 recs

They've been down this road before...

The most famous assertion of their first book – that abortion reduced crime – was based on faulty data and analysis, as explained here. But it hardly mattered since the idea sounded cool and kind of counter-intuitive, so they sold lots of books and became celebrity intellectuals anyway.

The only reason that Krugman hates their methodology is that they don’t fully adhere to the liberal conventional wisdom on AGW. If Levitt had reached the “correct” conclusions, Paul wouldn’t give a toss about their methodology.

In any case, the controversy is just more money in the bank for Levitt/Dubner whether their methodology is sound or not.

by maguro on Oct 21, 2009 10:33 AM EDT up reply actions   0 recs

I think you understate Krugman's commitment to accuracy.

He’s quite vocal in criticizing liberals when they make decisions that he doesn’t support – just look at his scathing criticism of Obama for trusting ex-Goldman personnel to sort out the financial mess, for example. He’s a pretty honest broker.

It’s also kind of strange to describe global warming as a “liberal” position. It’s the position of nearly every mainstream scientist worldwide – are they ALL “liberals”?

by Vlad on Oct 21, 2009 11:00 AM EDT up reply actions   0 recs

Not to get too far off-topic...

But “the liberal position” is that not only is global warming real, but that the only way to deal with it is to radically reduce CO2 output in indutrialized countries right now. Levitt suggests that this may not be the optimal method of dealing with global warming and that other mitigation strategies may be more effective.That’s what all the fuss is about.

Your statement that Krugman is an honest broker made me chuckle, but this is hardly the place to discuss that.

by maguro on Oct 21, 2009 11:12 AM EDT up reply actions   0 recs

I think that it would be more accurate to say...

…“best way”, rather than “only way”.

I think the “fuss” has more to do with them misunderstanding/misstating the views of others, particularly Ken Caldeira. They could have made similar points without stirring up the same level of controversy, if they’d focused less on contrarianism and more on accuracy.

by Vlad on Oct 21, 2009 11:31 AM EDT up reply actions   0 recs

This piece...

…does a good job of articulating the problem w/their approach: Link.

by Vlad on Oct 21, 2009 11:35 AM EDT up reply actions   0 recs

Well, this just brings us back to my original point, which is that controversy is feature and not a bug for Levitt/Dubner. They’re loving every minute of this.

by maguro on Oct 21, 2009 11:38 AM EDT up reply actions   0 recs

I think that in the abstract they love controversy...

…but in this particular case a bit more came out of the bottle than they were prepared to handle.

I know that I probably won’t bother reading the book as a result of it, where I probably would’ve if I hadn’t heard.

by Vlad on Oct 21, 2009 11:59 AM EDT up reply actions   0 recs

Levitt’s global warming guy discusses his ideas here if you’re interested.

Why not evaluate their argument directly instead of through an establishment-approved intermediary like Krugman?

by maguro on Oct 21, 2009 12:12 PM EDT up reply actions   0 recs

It's tough to evaluate it directly...

…since they disabled in-book searching on the Amazon link and served the guy with the excerpt on his blog with a takedown notice. As such, I have to look at analysis from people who DID see it, in order to tell what they were saying.

Your link is broken.

by Vlad on Oct 21, 2009 2:12 PM EDT up reply actions   0 recs

That one works.

From the linked piece by Myhrvold:

SuperFreakonomics is not a technical book on the science of global warming; it is a popular book that treats these details at a high level. And besides, the three little paragraphs on solar isn’t the main point of the chapter — it is a small side-show that illustrated a point: that I feel many people are too optimistic about plans to solve global warming.

That is, to me, the problem with the book in a nutshell. If you care enough about something to write about it, you should care enough to double- and triple-check all the little details, even if they’re tangential to the main point that you’re trying to make.

When you start saying that mistakes are OK because it’s just a popular book rather than a scholarly one, or that the mistakes don’t change the thrust of your point, you’re already badly off-track, and you’ve forfeited any claim of being anything more than an entertaining fiction.

by Vlad on Oct 21, 2009 4:44 PM EDT up reply actions   0 recs

Superfreakonomics
  1. on Amazon.com today. Levitt and Dubner are still heartbroken that Paul Krugman gave them all that free publicity.

by maguro on Oct 27, 2009 9:53 AM EDT up reply actions   0 recs

Well, it's a sequel to a bestseller.

I’m betting it would’ve sold a fair number of copies purely on the strength of that… so did the controversy help or hurt?

by Vlad on Oct 27, 2009 10:08 AM EDT up reply actions   0 recs

Yawn...

more conventional wisdom from the legacy media.

by maguro on Oct 27, 2009 10:20 AM EDT up reply actions   0 recs

Sure, if by "legacy media"...

…you mean “independent professional statisticians” analyzing their practices:

In a blind test, the AP gave temperature data to four independent statisticians and asked them to look for trends, without telling them what the numbers represented. The experts found no true temperature declines over time.

“If you look at the data and sort of cherry-pick a micro-trend within a bigger trend, that technique is particularly suspect,” said John Grego, a professor of statistics at the University of South Carolina.
But they’re probably all liberal academics, or something.

by Vlad on Oct 27, 2009 10:55 AM EDT up reply actions   0 recs

Well...

I guess if you’re going to use that logic and expand the data set, you could say house prices aren’t really declining because they’re still higher than than they were in 1970. Did you know that housing prices this decade are higher than any decade on record?

And unemployment? Not really rising. If you start measuring in 1980, see, there’s a clear downward trend.

by maguro on Oct 27, 2009 11:14 AM EDT up reply actions   0 recs

It's easy to know what they'd say...

…if you substitute conjecture about what they’d say for an actual response.

Personally, I’d tend to think that those two examples would produce different results, since the data sets are totally different.

by Vlad on Oct 27, 2009 1:59 PM EDT up reply actions   0 recs

But why did the first one sell?

Because it was controversial!

by maguro on Oct 27, 2009 10:18 AM EDT up reply actions   0 recs

I bought and read the first one...

…and I don’t remember nearly as much controversy there. Controversy certainly wasn’t why I picked it up – I just thought it looked interesting when I flipped through a copy.

by Vlad on Oct 27, 2009 10:57 AM EDT up reply actions   0 recs

It's interesting

The standard line about them is that the first book was drawn from Leavitt’s own research, and was therefore reasonably sound, while the new book is entirely drawn from others’ research, and so is less sound (they picked their cases, and didn’t really learn about the underlying facts). But it turns out that, even with a topic where Leavitt spent more time (and he claims to be a bit of a seamhead), he fucks it up.

by JRoth95 on Oct 20, 2009 10:39 PM EDT up reply actions   0 recs

I haven't been impressed with their baseball knowledge in the past.

They wrote a “Billy Beane sucks at Moneyball” piece on their blog a few years back, which was particularly clueless.

by Vlad on Oct 21, 2009 10:17 AM EDT up reply actions   0 recs

It is an interesting issue

One thing I thought was goofy in the discussion at Inside the Book was that a lot of it (the parts not picking apart methodological flaws, such as how OPS was used) revolved around the idea that the premise must be wrong, because there simply couldn’t be non-minimax conditions in baseball, because players would exploit them, and bring it to minimax. That’s simply in direct opposition to the entire Bill James/seamhead/advanced stat/Moneyball project. You’d be hooted off that board for saying that MLB teams use the sac bunt optimally (and always have), but somehow the idea crept in that pitchers vs. batters must not have a flawed equilibrium. Uh, guys? The whole point is that baseball is hidebound and tradition-laden, and that there are lots of flawed equilibria. If everything were optimized, then looking at advanced stats would be no more than somewhat technically interesting, like studying in detail the engineering of buildings that stand up for several decades, then get razed for economic reasons.

by JRoth95 on Oct 20, 2009 10:45 PM EDT reply actions   0 recs

Sorry

I didn’t really finish my thought because I got off on a rant. What’s interesting is that there probably is a flaw in strategy, whether being too predictable or relying too heavily on FBs, but it’s hard to find – the data is all there, but, even when you look at it right, there’s so much to correct for (or not) that it’s nebulous.

Plus, it seems that the margins are slim – if Levitt and Kovash only got to 2 games of 162 using formula that probably exaggerates, then you’re realistically looking at a win per year – and there are a million things that can gain (or lose) you a win per season. It’s tantalizing, frankly.

by JRoth95 on Oct 20, 2009 10:50 PM EDT up reply actions   0 recs

One of us is reading that thread incorrectly. To me it doesn’t read like any of them are assuming that the premise was wrong. They note a bunch of problems with the way the study was conducted (most notably using at-count instead of through-count to evaluate the pitch and using an incorrect version of linear weights [which they also incorrectly call OPS]) and then based on a table of data which uses through-count and the correct linear weights, they conclude that the study is egregiously wrong.

by epoc on Oct 20, 2009 11:48 PM EDT up reply actions   0 recs

No, I got that part

It’s at the end, with the back-and-forth with 2×2 matrices LeBron James and such where it seems to divert to the idea that suboptimal strategies are unpossible. Maybe they were actually talking at a different level, but that’s how I was understanding that portion of the discussion.

by JRoth95 on Oct 21, 2009 9:47 AM EDT up reply actions   0 recs

I think that what they're saying there...

…is that any truly substantial imbalances (of the sort that Levitt was claiming to have found) will tend to be self-correcting in fairly short order, not that they can’t exist at all.

I could be wrong, though.

by Vlad on Oct 21, 2009 10:21 AM EDT up reply actions   0 recs

It wasn’t clear to me whether the claim was that no significant imbalance was possible, or that an imbalance as large as the alleged one couldn’t exist. Frankly, I don’t think the proposed imbalance was so large as to be unimaginable, especially since it was describing a very familiar traditionalist dynamic. After all, doesn’t it seem a bit odd that having infinitely more data about pitch selection hasn’t changed – even a little bit – the ancient beliefs about how to pitch to batters? It’s improved scouting a bit, but only because it’s easier to manage the data that scouts have always had access to, like “Player X can’t hit the curve.” Meanwhile, sacrificing, stealing, and basic evaluation of AVG vs. OBP have all been radically reconsidered. I think it would be more surprising to learn that there are no exploitable imbalances, although they might be small.

by JRoth95 on Oct 21, 2009 1:19 PM EDT up reply actions   0 recs

Dunno.

Tango has done polling for a fans’ scouting report of MLB players for several years now, and the results have generally correlated pretty closely with advanced defensive metrics. If crowds of observers can collectively figure out optimal defensive analysis, why couldn’t they figure out optimal pitch distribution in the same way?

by Vlad on Oct 21, 2009 2:14 PM EDT up reply actions   0 recs

sort of off topic but

when he polls the fans, is it before or after he releases his metrics? there’s some arguments for bias if he’s more likely to poll fans who pay attention to defensive stats than not.

by johnnycuff on Oct 21, 2009 2:46 PM EDT up reply actions   0 recs

Think he starts right after the end of the regular season.

I just got an e-mail about it the other day. Not sure when he releases fielding #s, though.

Contamination from knowledge of #s is a potential issue, and he’s expressed concern about it in the past. But the earliest years of the poll precede the common availability of PBP defensive metrics, and IIRC they correlate pretty well as well. For whatever that’s worth.

by Vlad on Oct 21, 2009 4:48 PM EDT up reply actions   0 recs

I don't think they're comparable

If I’m understanding you correctly, Tango’s poll basically asks fans to rate and/or rank defensive players. Since any given defender is only involved in a handful of non-routine plays per game, there’s not that much to keep track of – the main question is whether fans are observant enough not to be fooled by flashy, but actually mediocre, defenders. In contrast, every game involves a couple hundred pitches in myriad combinations – an observant fan might notice something useful about high-profile PAs or count-situations (like the efficacy of Lincecum’s changeup as an out pitch), but to glean marginal effects over every one of those pitches? It just doesn’t seem plausible, especially because the fan then has to do the same thing 161 more times just to have a sense of how 1/15 of the MLB season went.

It’s a good argument that there aren’t gross imbalances, but no one says there are – even the original claim is on the order of a few percent – maybe 2 pitches per team per game

Anyway, why wouldn’t this argument also work to say that the sac bunt has always been used optimally? After all, it’s easy for even a casual fan to keep track of whether a player on first is more likely to score if he’s bunted over than if the subsequent batters swing away.

by JRoth95 on Oct 21, 2009 10:22 PM EDT up reply actions   0 recs

Well, optimal sac bunt usage...

…is a variable, insofar as the run expectancy charts change with the league’s run environment. Pitch distribution is much more of a fixed quantity.

by Vlad on Oct 22, 2009 9:11 AM EDT up reply actions   0 recs

The thought exercises with the 2×2 matrices were just confirming that the axioms of game theory apply to pitch sequence just as they do to any other game. Someone suggested that it was possible that the pitcher-batter matchup might not conform to game theory axioms, but when he went through the matrix he realized that he was wrong. The thought exercise was not about whether or not an imbalance actually exists, but about whether a pitcher and batter who were both playing optimally would achieve a perfect balance, as game theory suggests. It was a theoretical question, in other words, not a practical one about whether or not imbalance actually exists.

Also, the correct interpretation of the correct data, which they’re discussing there, does not mean that imbalances don’t exist. It just means that in general a balance exists. There are probably certain pitchers and batters who play sub-optimally and can be exploited, but you’re not going to learn that from a study of the general population (of players). As the thread suggests, all you learn from the general population of players is that on the average, pitchers and batters are in equilibrium with regards to the game theory aspect of pitch selection. This is in contrast to the Levitt paper which apparently claims that on the whole a huge imbalance exists because pitchers aren’t pitching optimally. I’d be tempted to say that we’d be right in being skeptical of Levitt’s claim, prima facie. After all, despite the imbalances you note w/r/t bunting and stealing &c., they don’t amount to anything close to 2 whole wins per year per team. The claim is pretty dubious when you think about it. (I understand that this is a standard problem people have with Levitt’s work: his outrageous results are literally incredible and due to his methodologies, often incorrect.)

by epoc on Oct 21, 2009 2:32 PM EDT up reply actions   0 recs

OK, thanks

Now I understand what they were getting at with that part of the thread.

That said, I’m still not sure that the claim is prima facie dubious (well, maybe the scale is), because of the way pitch selection is done (at least the way announcers assure me it’s done; what do I know?). The following aspects strike me as red flags of a situation in which the parties may not be optimizing:

- generally catchers make pitch selections, with input from pitchers. Neither one has access to data as they make selections, so they will be combining their recollection of scouting reports plus their own experiences with heuristics

- both pitchers and catchers are prone to be overly influenced by anecdote – “this guy beat me on a FB last time, I won’t let him do it again.” Retired pitchers talk about remembering the pitch for every HR they gave up – which is impressive, but ultimately useless, as the sequence largely determines outcome (outside of hanging curves and the like).

- competitive psychology pushes in predictable directions – “if he’s going to beat me, it’s going to be on my best pitch.” That’s not to say this is a terrible idea – everyone in the park knows Mariano’s bringing the cutter, yet he still usually succeeds – but that it’s an emotional, not a rational, decision

- near as I can tell, the basics of how to sequence pitches are unchanged since the end of the Deadball Era. As I said above, baseball is prone to being hidebound, and it strikes me as more likely that players and coaches are reluctant to change than that pitching strategy was perfected 90 years ago

I agree that existing practices are probably quite close to optimal; I just see reasons to doubt that they’re exactly optimal, and wouldn’t be surprised to see an analysis similar to this one reveal inefficiencies.

by JRoth95 on Oct 21, 2009 10:38 PM EDT up reply actions   0 recs

I agree, mostly. There’s probably a certain amount of sub-optimal play going on in batter v. pitcher matchups, but it all evens out over a large enough sample. The sample size of the study is a big reason why the results seem dubious, and of course the scale of the results (2 wins per year) is the particularly dubious claim. I agree that there are probably inefficiencies in individual matchups, but I’d guess they’re pretty negligible. Sometimes you see a pitcher throw fastball after fastball because he’s “lost confidence” in his breaking ball or whatever. That’s an obvious inefficiency that’s going to get exploited, but it’s not endemic, and the pitcher will correct it in his next outing. I doubt there are significant systemic inefficiencies in any pitcher or batter’s approach to an at-bat, simply because reasonably intelligent adults who spend a lot of time playing a game will learn to play the game close to optimally just through experience. At any rate, all we can say definitively is that the data we have suggests that pitchers and batters have basically found equilibrium in regard to the game theory aspects of an at-bat.

by epoc on Oct 22, 2009 12:36 AM EDT up reply actions   0 recs

Indeed

It just occurred to me that pitchers (and, to a lesser extent, hitters) have access to information that an outside analyst never could – they know what pitches were “mistakes.” It’s not that this would trump analysis, but that it’s a critical layer to understanding strategy and what actually happens in a PA.

I’d imagine that an analyst would argue that mistake pitches cancel out once you have enough PAs, but I’m not sure it works like that. For each pitcher, different pitches are more likely to end up mistakes – a mislocated FB may be more or less likely than a hanging BP – and that will be a huge part of sequencing pitches. And since mistake pitches are (I can only assume) more likely to be hit hard, they’ll have a bigger impact on outcome (even when you use the correct metrics). IOW, hits happen on good pitches and mistake pitches alike (and hitters miss mistakes as well); but mistakes, when hit, are more likely to turn into XBHs. But “mistake pitch” isn’t a category in the data. And they happen to every pitcher, in every game.

Now, again, it’s possible that, in a large sample, that cancels out (I’m not sure it does, but it may). However, on the level of the individual pitcher optimizing his pitch selection, that knowledge – Pitch X is likely to be a mistake, Batter can’t hit Pitch X, but he can hit Mistake Pitch X – all goes into the sequence in a way that’s invisible to an outside analyst. So while the gross data may not be skewed by the mistakes, the data got the way it did due to the mistakes, and the pitchers know all about it.

by JRoth95 on Oct 23, 2009 2:28 PM EDT up reply actions   0 recs

All I know about economists ...

is that every time a government report comes out, economists are always surprised that it’s a) way over their estimates or b) way under their estimates.

They’re sort of like sports columnists that way: Every time they’re wrong they get to go, “Huh.” And move on to the next topic.

by bucdaddy on Oct 21, 2009 10:27 AM EDT reply actions   0 recs

I remember the use of statistics being dissected

when that book first came out. It was by statisticians and went well beyond the abortion controversy.

by ol Pete on Oct 23, 2009 11:16 AM EDT reply actions   0 recs

Comments For This Post Are Closed


User Tools

Welcome to the SB Nation blog about Pittsburgh Pirates.
Start posting about the Pirates »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recent FanPosts

101_0170_small
40-Man Moves
18470r_small
Rule 5 possibilities
20080124sgrammy_330_small
Small Market Teams Pocketing Spare Change
Small
Jeebus Cracker
Small
McCutchen's defense
Small
Roberto calling
20090612mf_fleury_cup_500hp_small
Pirates would trade Doumit!
Leo4_small
John Sickels' Comments on Donnie Veal
Pitt20_small
LaRoche to the Phils?
Bloody_mary_small
Ohlendorf Blogging: USDA can't keep him "down on the farm."

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recent FanShots

Mariners Eyeing Doumit
Tabata 5-5 w/3 Ribbies - Hitting .390 @ EOS
Bay rejects $60M over 4 years
"[Chase D'Arnaud] does everything well enough," said an American League...
Pirates hire Steve Williams as Major League scout
Jim Tracy, Baby! NL Manager of the Year!
2009 Minor League Six-Year Free Agent Listing
NL Rookie of the Year: Chris Coghlan
2010 CHONE Projections
2010 UZR Projections

+ New FanShot All FanShots >

SPONSORS


Managers

Charlie_small Charlie