Maybe Nate Silver Was Wrong

By James Kwak

I think Nate Silver does a good job aggregating polls to make meaningful quantitative predictions about upcoming elections. But as he said himself shortly before the election, if the polls he relies on are systematically biased, then his forecasts are going to be off.* Many people have noted that Silver (and other quantitative poll aggregators like Sam Wang) correctly predicted an Obama victory and the outcomes in most if not all states.

But the fact remains that Obama did modestly better than the polls, and hence the poll aggregators, expected (not to mention than the Romney campaign expected). We shouldn’t read too much into this, as even where Obama significantly overperformed—like in Iowa, where Silver forecast a 3.2 percentage point victory and the actual came in at 5.7 points—the results were within the confidence intervals. But it’s also possible that the polls really were systematically biased, only they were biased against Obama—not against Romney, as conservative pundits were claiming in the last days.

Why would that be? One possibility is turnout. Many polls incorporate a likely voter model, which weights the sample to try to approximate the expected composition of the electorate. Gallup’s problem, I believe I read somewhere, was that they expected the electorate to be whiter than it turned out to be. In retrospect, we have anecdotal evidence that the electorate was younger and less white than many people (such as Paul Ryan) expected. And one common explanation is that this was due to the strength of the Obama campaign’s get-out-the-vote operation.

One piece of evidence for this theory is that Obama’s performance relative to expectations was especially good in the swing states, where you would expect him to have devoted most of his GOTV efforts.** Of the nine major swing states (in order of competitiveness, according to Silver, Florida, North Carolina, Virginia, Colorado, Iowa, New Hampshire, Ohio, Nevada, and Wisconsin), Obama beat Silver’s poll-based forecast in seven; on average, including the two states where he underperformed, he beat the polls by 1.1 percentage points. Of the other forty-one states and the District of Columbia, by contrast, Obama overperformed in only twenty-three, or just over half, and on average he beat the polls by only 0.4 percentage points.

Now, this is not something that Nate Silver was supposed to predict. Just before the election, his forecast is based almost entirely on the polls. And he freezes his model several months before the election, precisely because he doesn’t want it to be influenced by subjective judgments.

But this is exactly the kind of thing that journalists (and their subspecies known as pundits) are supposed to predict. That Obama would have the best turnout operation ever is not something that Nate Silver could predict in January. But all those people who don’t believe in polls, who think that old-fashioned beat reporting and gut instinct are the way to predict elections, could have done the work to figure out that Obama had the best turnout operation ever. Based on that research, the pundits and the political experts could then have said, “I expect Obama will do better than the polls, because the current generation of likely voter models does not take into account the strength of Obama’s turnout operation.” And that would have added value to what Silver was doing.

In other words, this was an election where old-fashioned reporting and punditry could have provided some insight into the outcome. But they didn’t, because the pundits were too busy spinning false stories about momentum (which were provably false, since momentum does show up in polls) instead of looking for relevant facts.

In theory, poll aggregation should not be the last word in election forecasting; there should be a place for political expertise. But with the “experts” we’ve got now, it is the last word.

* Does anyone know why all posts from the first six days of November have vanished from Silver’s blog?

** You would also expect him to have devoted most of his other efforts in those states, but those activities, such as TV advertising, should have shown up in the polls before Election Day.

  5. @James: I can see all the posts from early November in my rss feed, with links that open the full articles, even though the articles aren’t listed on the nytimes page you linked to. Here’s the post from November 6. You can get to the rest with repeated use of the “previous” links at the bottom.

  6. > Does anyone know why all posts from the first six days of November have vanished from Silver’s blog?

    The first post that doesn’t show up is the election liveblog, so my guess is the listing code chokes on that entry and stops. You can still see earlier posts if you visit the url for each individual day:


  7. In other countries, when polls diverge from election results, we call that a priori evidence of fraud.

    But only for other countries.

  8. Nemo: you really don’t understand how this stuff works very well, do you? I mean, what part of ‘within the confidence interval’ is unclear?

  9. The posts are still there. Your link filters by month, yes, but there’s still a result limit per page. At the bottom there’s a link to “older posts” that takes you to and the rest of the November posts.

  11. I suspect I understand this stuff a lot better than you think. (Dr. Kwak is not the only person who attended the Math Olympiad Program.)

    If everything is “within the confidence interval”, then the entire post is pointless. On the other hand, if there is something to explain — which I suppose there is, since Dr. Kwak spent half a dozen paragraphs here on hypothetical explanations — then the simplest explanation is the old-fashioned one.

    That being said, as much as I admire the guy and how accurate his methods seemed to turn out (better than anyone I’m aware of), I still have a hard time respecting his method. Why?? After saying all the above and then I say I don’t respect his method?? How can you really respect any stats method that isn’t making first hand/eye-witness efforts to assure the quality of the sampling data (I mean the samples themselves) ??? The answer is, you cannot.

  14. “In theory, poll aggregation should not be the last word in election forecasting; there should be a place for political expertise.”

    It really is amusing how closely the Nate Silver Versus the Pundits discussion mirrors the analogous discussion that was going on about baseball six or seven years ago.


  15. But Obama’s GOTV effort was not as effective as 2008. John McCain got more votes than Romney. From my memory, turnout was above expectation in 2008 also. It’s funny that polls used pretty much assumptions about turnout rates, when they could have been polling specifically about turnout. I don’t see any other way to analytically incorporate turnout – either pro-Obama or pro-Romney, the pre-election turnout arguments were just stories. I donated to Obama and was polled by phone at least 4 times – I imagine by different poll organizations because of the nature of the questions. I thought there had to be a problem with sampling if I was being targeted so frequently.

  16. Is this perhaps a problem similar to doing better than the market indices? While some people can, it is incredibly hard to keep biases from creeping in.

  17. Quite right, Nemo. Fraud is the simplest explanation.

    Here’s a good example of the merits of simplicity: A fellow by the name of Lamark had a very simple explanation for the evolution of species. For example, Lamark theorized that giraffes evolved to have long necks by stretching higher into the trees to reach fruit. This lengthened their necks, an acquired trait they passed on to their offspring who did the same over many generations until we have the animal we know today. Very simple and a bit off la Marc, if you will. Most of us know that Darwin’s more complex theory is correct.

  18. Ogden Wernstrom

    Per Nemo: “In other countries, when polls diverge from election results, we call that a priori evidence of fraud.”

    Nemo, you left out the word “exit”, as in “exit polls”, which are normally very reliable. I have not found Nate Silver’s analysis of exit polls yet.

    “But only for other countries.”

    True, that. I remember an election where one territory – which happened to have one candidate’s brother holding executive office in that region – had exit polls that said one thing and official results that said another. Then some activist judges took sides so the scandal would become moot. If we had seen that in a third-world country, we would have known the election results were fraudulent.

    “I suspect I understand this stuff a lot better than you think. (Dr. Kwak is not the only person who attended the Math Olympiad Program.)”

    It should be simple to point out any mistakes in the arithmetic. Go ahead.

  19. Fraud is NOT the simplest explanation for inaccurate predictions. The simplest explanation for inaccurate predictions is “The predictions were inaccurate.” It’s then the predictor’s job to figure out why, and refine his predictions for next time.

    Fraud WOULD BE the simplest explanation when (for example). the total number of votes in a precinct is larger than the number of adults in the precinct.

  20. As pointed out by Ogden Wernsgrom, unexpected divergence between results and polling implying potential fraud only applies only to exit polling. I am not sure why they didn’t teach this in whatever Math Olympiad Nemo is talking about, but that is because a good exit poll is essentially a random sample of the votes already cast, not a sample of people who may or may not vote for reasons that are difficult to model. As Kwak noted, just how much better Obama’s GOTV effort was than Romney’s might not have been clear to many observers, especially since the Romney had plowed a lot of resources into a very sophisticated (but new and less well tested) computerized GOTV system called ORCA. Multiple accounts in the press pointed out that the system performed very poorly on election day, so in swing states you had the Obama GOTV effort versus a poorly functioning and apparently buggy new system. That had to be worth something.

  21. There were two other factors at work that may have had the effect of raising turnout above expectations based on the models:

    1) intense media coverage of voter suppression efforts in some of the swing states for weeks and months leading up to the election angered African-American and Latino voters enough to mobilize them in greater numbers than predicted,

    2) many of the models are based on surveys that under poll cell-only voters who tend to be young and minority voters

  23. I’m currently reading Nate’s book and I admire that he realizes he has to continually assess the effectiveness of his methods, he’s not dogmatic. It’s easy to blame Obama’s turnout operation as the reason he won after the fact, but maybe the news organizations that the old fashioned beat reporters work for were so drunk on all the Republican ad money, they wouldn’t have been interested in old fashioned get out the vote drives. Don’t kill the golden goose of the media (particularly, television) industry.

  24. I used to avoided watching MSNBC because I thought they’d just reinforce my opinion, like Fox does on the Right. But election coverage on the other cable and network channels was (mostly) so poor that I turned to MSNBC. Those “kids” get it. They followed Nate and Sam Wang closely and they did their homework. They love politics but aren’t such junkies that they ignore issues and governance. Add heavyweights like Andrea Mitchell, Chris Matthews, Steve Schmidt among others, and it’s a thinking person’s news source. The Right will never match this. This take is nuanced and complex. The Right must have a simplified version of the world spoon-fed to them. That’s why it’s so easy to keep them corralled in the bubble.

  25. I am sorry to say you might be right about the value of ordinary punditry. I am still hoping that we can smash the life out of mindless punditry (99%) through a greater understanding of polling and statistics.

  26. The criticism of the media failing to do its job seems to miss or not pay enough attention to the political economy of the mainstream media in the U.S., where (political, economics, and international affairs) reporters are responding principally to two incentives: 1) Broadcast revenues 2) Access to policymakers. This causes them to make less accurate statements than their membership in The Fourth Estate presumably obliges. With all mainstream reporters more or less operating within this environment, it seems unreasonable for you to expect them to augment Nate Silver’s analysis (I doubt reporters care half as much about Nate Silver as the academically/mathematically inclined like myself do) in the fairly technical manner you describe. Even then, maybe a reporter did do exactly what you’re wishing they would have. But would Nate Silver have responded? Would the NYT have wanted him to? Perhaps if Nate Silver cared about accurately predicting elections more than anything else, he would join academia.


    Folks. NEMO is on to something here. This type of statistical outcome (the high end of the confidence interval in so many states) is the equivalent of a statistical anomaly. Would the outcome of the election be changed if some of the less then reputable actions of GOTV not occurred. No. The real story is that the proposed fraud (discovered by the same statisticians that called the election) was not necessary. Obama won this straight out and did not need to bend the rules. The Republicans are in some serious trouble here from an electoral college perspective. They do not want to admit that we are living through the modern version of the late 1920′s and 1930′s. Can you say 16 straight years of Democratic presidencies. That is where this is headed. Only to end because of the high probability of a negative outcome (would be the same of either party).

  28. Despite the incentives of ad revenue and access to policymakers, there are some reporters and analysys who manage to swim against the tide of conventional wisdom and maintain a significant degree of independence from the groupspeak of Washington DC. Matt Taibbi and James Fallows come immediately to mind.

  29. Nonsense. The problem is with the survey methodology: the under-sampling of Obama voters due to the use of landline-only and robo calling, the likely voter models, and the fact that more partisan GOP polls were run than partisan Dem polls skewing the averages in some swing states.

    If you believe there was significant fraud favoring Obama, cough up the evidence. This is a well-worn GOP canard. Accuse the Dems of the very practices you’re using.

    The evidence is overwhelming. it was the GOP that attempted to rig the election through voter suppression tactics of many types in key swing states. In PA we have it on tape. In Ohio, Colorado and Florida, it’s plain as day. Even in my deep red Texas there were attempts to purge polls of allegedly dead but very-much-living voters in heavily Dem precincts.

    Sorry, Charlie. The election just wasn’t close enough to steal.

