Examining voting turnout is the theme for this week’s blog, and previous turnout data will be utilized in predicting elections. At first glance, turnout is not an obvious predicting variable when it comes to forecasting elections. After all, one might believe increased turnout in a district, for example, just means that there will be the same vote share, and a proportional increase in each party’s total vote count. However, it could be an important factor in toss-up districts.
Speaker of the House Nancy Pelosi seems to believe in the notion that Democrats benefit from higher voter turnout, stating in October 2022, “The public is with the Democrats,” indicating her belief that more voters fall in line with her party. This is a phenomenon that I will explore in this blog. First I will examine how increased district turnout affects the Democratic vote share, then I will display it in tandem with incumbency effects. Lastly, I will analyze turnout in the context of the most recent midterm elections in 2018. All this analysis will feed into this week’s updated election forecast, with using 2018 turnout levels as stand-in data for 2022 (until I gather better/more accurate and predictive turnout data for 2022).
Is Speaker Pelosi Right
After plotting how turnout data on the congressional district level, it is evident that increased turnout has minimal benefits for the Democratic candidate. Our plot reveals that the relationship between the variables is not strong, let alone even related to an extent. This leads me to believe the notion that increased turnout does not benefit the Democrats, at least to the extent that Speaker Pelosi stated (as it almost being a campaign stratgey in and of itself).
The Incumbency Advantage Revisited
After plotting the relationship between turnout and the Democratic two-party vote share on a congressional district (and hardly seeing any relationship), I decided to see how this looks on a more granular level, highlighting how turnout affects incumbents and challengers alike. This resulted from the notion that increased turnout may indeed benefit an incumbent’s challenger, as evidenced by Professor Ryan D. Enos in Gov 1347 lecture. He displayed how 2018 congressional challengers who won boasted higher median voter turnout than an incumbent winner.
After visualizing this across congressional elections from 2012-2020, we see how each scenario is centered around the same turnout level of 54%, which refutes Professor Ryan D. Enos’ point. He examined one specific cycle, however, which may include other factors not seen through this analysis: referendum on the incumbent president, “increased get out the vote” efforts, etc.
What is most poignant in this visualization is just how rare an incumbent loses. This incumbency advantage is stark and inclines me to include this variable in this week’s forecast.
Examining Turnout in the Most Recent Midterm Elections
Now, as we examine turnout across districts in the 2018 cycle, the map below displays results that are unconvincing of how much turnout is correlated with races predicted to be competitive. There is no indication that predicted closer races experienced higher voter turnout, especially when you look at noncompetitive (solid R/Solid D) districts in Oregon and Colorado boasting higher turnout levels and many toss-up races boasting lower turnout.
The plot below further compares how Democratic vote share correlates with voter turnout, according to different expert ratings. And potentially to Speaker Nancy Pelosi’s surprise, increased turnout in the more competitive 2018 races actually hurt Democrats.
The map above and the plot below continue to sour my idea that turnout can become a reliable predictor variable in election forecasts. However, I believe it is worth examining in a model to see if the additional predictor variable improves my forecast (because my model has taken a turn for the worse in recent weeks).
Small Note on My Models Below
Due to statistical limitations in a model, I am required to create two of them: One where the incumbency district is included with all other variables, and another model where it is not included. This is because my turnout data only goes back to 2012, forcing each district to have a max 5 observations (2012 - 2020). This is significant because some congressmen and congresswomen have stayed in office all throughout that time period, preventing any variation in the incumbency variable over that time period (and thus the model is not able to see how a change in incumbency affects the resulting Democratic vote share). Therefore, my model with the incumbency variable included contains districts that have experienced a change in their representative, and the other model without an incumbency variable contains districts where the elected representative has not left office.
Limitations of My Model
When it comes to turnout, I experienced trouble in finding where it shows promise in predicting elections. Nonetheless, I progressed to including it into my model along with my incumbency variable so I could see how it changes from last week. And, maybe unsurprisingly, my models seemed to get worse. While the incumbency variable seems to check out as a predictor variable, the turnout variable does not seem to provide much value at all.
What could maybe explain the phenomenon that turnout does not matter when it comes to elections is that most campaigns actually counteract their opponents’ ground game since many campaigns have evolved to an advanced state, effectively eliminating turnout as an advantage for one side. In fact, scholars Joshua L. Kalla and David E. Broockman have detailed that persuasive effects rarely, if at all, emerge from ground campaigning, arguing that persuasion resulting from ground campaigning appears only in circumstances of compounding effects from early on in a race and campaigns investing heavily in an extremely unpopular position taken by their candidate. Thus, it does not seem that an advantage can be easily gained by ground campaigning, and maybe it is a zero-sum effort.
So after the past three weeks of decreasing confidence in my model, when I added average expert rating, incumbency, and turnout variables (all shown in my models), I believe it is time to return to more fundamental variables (like real disposable income change, national generic ballot margins, and possibly incumbency too). This decision of mine rests simply on the reality that I am working with data limited in scope. This reality means my models are sometimes working off only 2-3 elections/observations per district, which is far from ideal. Thus, going forward, I believe I am going to include only variables that include data from long ago, allowing for more observations, and data that is possibly more related to the national environment, so as to not depend upon unreliable district data (which could fluctuate as the quality varies district-to-district).
Notes:
This blog is part of a series of articles meant to progressively understand how election data can be used to predict future outcomes. I will add to this site on a weekly basis under the direction of Professor Ryan D. Enos. In the run-up to the midterm elections on November 8, 2022, I will draw on all I have learned regarding what best forecasts election results, and I will predict the outcome of the NE-02 U.S. Congressional Election.