How to Cover a Future Election

As this election winds to a close the locations of the final stops are predictable, Ohio, Pennsylvania, Florida, and a smattering of Missouri and North Carolina.

In the late sixth party system, elections are contested on a very predictable axis that runs somewhere between Columbus and Orlando. Cover Pittsburgh for a little drama, perhaps Raleigh, and call it a campaign. This will not due in the future. At the opening of the mid-sixth Ohio and Florida were equal prizes. Now Florida dwarfs Ohio. After the next reapportionment, Florida will nearly double Ohio in size. Of course this doesn’t matter, right? They are still the same size. Florida is getting more liberal, Ohio is more conservative, also the rest of the country exists.

The base of the early sixth party system supposed that a candidate could align the south and the west. For Nixon and Regan, this included California. Aside from Carter in 76, who dealigned the South, the base map made it difficult if not impossible to elect a democratic President. Clinton was, in many ways, Carter 2.

California, Oregon, and Washington slipped out of alignment with the mountain west and Texas while substantial the rest of the country became more consistent. A southern Democrat could pick off enough votes to win with a base of the upper midwest, pacific, and north east plus a few more.

This was a reporters dream. A stable, consistent map with a tie almost built in and the key swing states a quick flight from New York or Washington. After two close Bush elections and two not so close Obama elections the map has changed. Safe Republican states like Colorado, New Mexico, Nevada, and Virginia have shifted alignment. Arizona is a battle ground, the great city of Atlanta suddenly matters. The basic alignment of the South and the West is breaking down and we simply don’t know how to report it.

The 2020 reapportionment won’t take effect until the 2024 Presidential election. This won’t be a dramatic shift, just a few seats moving in either direction. Density models suggest that this will make big states bigger and continue the trend of flattening the rest of the country into a Presidential desert. Likely losers, Ohio, Pennsylvania, Michigan, Illinois, West Virginia, Rhode Island, Alabama, and possibly New York and Minnesota. Gains likely in Texas, Florida, North Carolina, Virginia, Colorado, California, Arizona and Oregon.

For the most part this shift does little to the overall dynamics of the election. What it does is amplify the importance of states shifting party alignment. A swing campaign in 12 vote Arizona is very different than a 17 vote Ohio. A safe, Democratic, 14 vote Virginia transforms the map, as does a Washington-Oregon axis that finally outweighs Pennsylvania.

The implications?

Coverage will be shifting west. The national media will need to learn to cover Arizona. The campaigns, at least those paying attention, will spend less time in Ohio, moving toward North Carolina, Atlanta, and Denver. The issues covered will be shifting, people in the Southwest have different environmental needs than those in the cold Rust Belt. Replace that axis in the eastern timezone with a line between PHX and ATL and you see America’s future.

Last Day Twitter Activity

Clinton is now passing 160k hashtag uses per day, Trump has recovered to the mid-20k region. This could of course be an indicator of dislike for Clinton, but what it does indicate is that Clinton is the focal point as we arrive at the final calculation.

In the larger scheme of things, if Clinton wins the Romney curse (more twitter activity being negative) will be broken.

About Aggregation, a plea for transparency

Yesterday, Silver’s basic model in Florida went haywire. Clinton had gained in the previous four polls and his probability of Clinton win decreased. Poll Delta was positive for Clinton with negative Model Alpha (coefficient of victory). This means that other factors were outweighing new polling information. These could legitimately include lagged weighting, although any world where positive polls for one candidate could decrease their probability is quite rare. When combined with earlier misreads on the election and process, such as arguing that positive polls for Trump were a bad sign for his candidacy people started to question the model. My best guess would be that this was some combination of lag-weighting and salting state polls with national data. Low Clinton delta combined with an artificial salt score and a lag weight for some older state polls could explain it.

By morning his models had reset to track closer with actual polling data. Clinton appears to be well positioned to win the election. Sam Wang, a political scientist and aggregator, has been clear that this is not a particularly turbulent election. The polls have been remarkably consistent as has his model.

Here are a few important take aways:

A. Aggregated poll methods were an improvement over the simplistic horse-race, especially when those polls are matched to the actual unit of measurement, state votes.

B. Good modeling methods are not dramatic. No. The model won’t wildly swing based on a new poll. If 100 polls show Timmy winning, one poll with Billy winning won’t switch it back to 50/50.

C. This process is out of phase with the news cycle. Once the news cycle starts driving model runs, the model likely will be pushed to the breaking point.

Poll aggregation, like many predictive methods, have instead become the fodder for news the day of the event. This is not optimal. The domain of near future news is interesting, but should be restricted to the best methods. Sometimes there is no news. That is perfectly alright. The Presidential election algorithm is pretty straight forward. Basically, this election has not been particularly volatile, so why run the model after every poll?

Another Note about Policy Debate

One of the factors that keeps my peer group returning to Silver is his history as a high school debater. I debated in the same era and back then a popular argument was called the politics disadvantage. You would argue that the work of Clinton to pass your bill would preclude the passage of some other bill, like the African Growth and Opportunity Act, Debt Relief, KEDO, PNTR with China, or the CTBT. These are all important weighty matters. To get to this point one needs to win a few specific arguments: uniqueness – X will pass now, link – your plan would require substantial capital to pass, internal link – decreased capital means no passage of the other issue, impact – X is really good.

Of course one could argue defensively that the disadvantage is silly for any number of reasons, offensively one can turn the initial link (the plan increases capital/is popular) the impact (CTBT is bad) or the internal link by answering the theory of politics posed by the disadvantage. Some teams carried entire files devoted to responding exclusively on the internal link level. So you would say, political capital (or losers-lose) they could then read cards about Presidential losses causing policy wins (losers-win) or if the disadvantage was reversed they would read (winners-lose) or (winners-win) to argue that the act of spending capital makes more capital. Teams could plug and play the different internal link scenarios. A team with great internal link blocks could mange their research burden and exposure to risk by debating a known quantity.

Unlike an affirmative response to a politics disadvantage, there are not plug and play political theories that can make polls go backwards.

Surges and Such

As we reach the close of the data gathering phase of this research project, day t-7 saw the Clinton-Trump primary activity levels reach an all time high for division at Clinton x70. Historically, high levels of conversation such as this are bad for a candidate, although in 2015 they were very good for Trump. Something to consider, I am juxtaposing #ImWithHer to #Trump2016.

What do I think is happening? Without social network analysis and other approaches I don’t have a definitive answer. My guess is that the comparative evaluation phase of the process is over and that people are taking one, last, hard look at Clinton. Since this point, Trump activity has been rising and Clinton activity may be starting to normalize.

In terms of timing and election modeling, what does this mean? First, early voting is well underway in many states, so connecting anything related to polls with exit polls with actual results is fishy at this point. Second, For horse-race style analysis, this makes the election much less fun as exit polls are better than normal polls. Oregon for example is largely finished voting, if I vote at this point I will need to take my ballot to an official drop site. Third, even without running Cohen’s D on the daily twitter returns, I can tell you that Comey’s letter had a dramatic effect on Twitter traffic. Trump activity decreased to campaign record lows while Clinton surged to campaign highs.

Overall, I am reminded of the problems with producing social facts. Alain Desrosières The Politics of Large Numbers (a must read for computational critical/cultural studies) calls for a consideration of the categories and processes in statistical analysis. In social network analysis we tend to consider the census of a data set rather than a sample. It is nice to know that we have an inflection point to study and that we can understand the entire dynamic system around it, but it is discomforting to know that it can be enacted so quickly.