Final Predictions

Final Predictions

Election day is this Tuesday, November 8th. In this blog, I will first explain each of the components of my predictive election model and then offer my final predictions for this election.

National Model

The first component of my model is an Economic Predictive model. One of the challenges, which I went over in previous Blog Posts, was that fitting a model using Economic data was difficult. However, the best model I was able to fit was using the GDPC1. The GDPC1, or Real Gross Domestic Product, is a measure of the actual value of GDP, adjusted for inflation. I am using the Quarter Cycle 5, meaning the first Quarter of the election year. In the table below, I have shown both the in-sample and out of-sample predictivty of the measure. As you can see, the R-Squared value is 0.4472, which is not completely terrible, but also is not necessarily as a high of an R-squared as we may want. This gives us an idea that this measure might not be the best part of the model. As far as the out-of sample predictivty, I tested the model on the 2018 House election. The model predicts the House incumbent two party, which in 2018 was the Republican party,to be 47. The actual result of the 2018 election was the Republicans picking up 46% of the two party vote share. The out of sample test shows the model in the range of where we want to be, so again it indicates that this is not necessarilly a bad model.
Finally, we can look at the predictions for 2022, which predict the Democrats to only get 44% of the two part vote share, but with a very wide interal ranging from 36 to 52, indicating that there is large uncertainty.

## 
## Economic Fundamentals Prediction (House Incumbent Predicted Voteshare)
## =======================================================
##                         Prediction     Lower    Upper  
## -------------------------------------------------------
## Prediction           43.6582511339047 35.59525 51.72125
## 2018 Test Prediction 46.5991276753862 39.38692 53.81134
## R-Squared                 0.4472                       
## -------------------------------------------------------


The next aspect of the model is the Generic Ballot. For the Generic Ballot, I made a model which predicts the Democratic predicted Democrat’s predicted two-party vote share based on the mean of Generic Poll results for Democrats in the last 100 days of the election. As we can see, the R-Squared is 0.612, which is higher than the Economic part of the model and again it indicates that it is a fairly decent model. The model shows a statistically significant coefficient of around 0.7, indicating that for every increase in the mean generic ballot poll in a year by a percent, the Democratic predicted voteshare increases by 0.7 percent, which seems to make sense. In terms of out of sample accuracy, I again tested the model on the 2018 election. The model predicted the Democrat’s getting 52% of the two-party voteshare, which again shows a pretty good model considering that the Democrat’s actually got around 54% of the two-party vote share. Being within 2% as we are for both of the models so far is really good for out-of-sample accuracy.
We can now look at the 2022 predictions which shows the Democrats getting 49% of the vote. This doesn’t exactly agree with the Economic model, but this makes sense because the Generic Ballot has been showing the Democrats doing better than a traditional fundamentals model might expect. The interval of this prediction is also less wide with a range of 45 to 53.

## [1] 45.025
## [1] 40.3
## 
## Generic Ballot Prediction (Democratic Predicted Voteshare)
## =======================================================
##                         Prediction     Lower    Upper  
## -------------------------------------------------------
## Prediction           49.1537529352792 45.38579 52.92172
## 2018 Test Prediction 51.847312911055  48.08031 55.61432
## R-Squared                 0.5931                       
## -------------------------------------------------------


Finally, we can look at the Expert Ratings part of the model. This uses Expert Predictions from the most recent election cycles to predict Democratic voteshare in each district that we have predictions for and then takes a mean to get the National Voteshare. While we have a lot less years for this model, we do have a lot of datapoints because there are so many races to predict in each cycle. As we can see, the R-Squared is the highest we have seen at 0.7023. This indicates a good model. The model shows a statistically significant coefficient on expert predictions of around -3, showing that as a rating changes by 1 away from Democrats (Likely to Lean D for example), the predicted voteshare decreases by around 3%, which also makes sense. The test prediction for 2018, showed Democrats getting 48% of the vote, which was a little bit further off than the first two models were. However, the actual Democrat voteshare was in the range of the interval and was not too far off, so it is not an awful out of sample model. It still however indicates that maybe the higher R-square of the model is due to a little bit more overfitting than the first two.


The prediction for 2022 shows the Democrats getting about 49% of the voteshare, with a wide interval ranging from 42 to 55.

## 
## Expert Prediction (Democratic Predicted Voteshare)
## =======================================================================
##                         Prediction         Lower            Upper      
## -----------------------------------------------------------------------
## Prediction           48.8033050906412 42.3885113565181 55.2180988247643
## 2018 Test Prediction 47.9929447562536 42.961060427511  55.9793564453987
## R-Squared                 0.7023                                       
## -----------------------------------------------------------------------


Now that I have explained and justified each piece of my model, I can offer my final predictions. To weight each part of the model, I chose to make the Expert Predictions and Generic Ballot worth a little bit more since they seemed beeter models to me, so they are each weighted at 0.35 and the Economic Model is weighted at 0.2 The last 0.1 of the weight is not a model but a measure of the mean Cook PVI of the country just to offer a measure of where the country is on average.
As you can see in the table, the overall model predicts Democrats getting 48% of the national vote, with an interval from 43% to 53%. This obviously shows the Democrats losing as most predictions would indicate at this time.

## 
## Overall Prediction (Dem Predicted Voteshare)
## ========================================================================
##                Weight    Prediction         Lower            Upper      
## ------------------------------------------------------------------------
## Overall          1    48.037080305968  42.8605152448963 53.2136463587824
## Generic Ballot  0.35  49.1537529352792     45.38579         52.92172    
## Expert          0.35  48.8033050906412 42.3885113565181 55.2180988247643
## Economic        0.2   43.6582511339047     35.59525         51.7215     
## Cook PVI        0.1   50.2045977011494 50.2045977011494 50.2045977011494
## ------------------------------------------------------------------------

Close Districts


Finally, I did a little bit of work to predict District level results for close districts. The problem with close districts is that one of our models, the Generic Ballot, is harder to do on a district level because districts have very poor and unpredictable polling. The district level polling model is not very effective, but I did want to use district polls where possible. The predictions that you see below for all of close districts uses the same economic model, the same expert predictions model, but just obviously predictions for a given district, and where possible it uses a district polling model. For districts where polling was not available, I simply increased the weight of the expert model. Also, I have increased the weight of the PVI, because for just one district the PVI is more important as it tells us the exact partisanship of a district. Therefore the weights for these districts are as follows. For districts with polling: 0.3 PVI + 0.3 Expert + 0.2 Polling + 0.2 Economic. For districts without enough polling: 0.3 PVI + 0.5 Expert + 0.2 Economic. This model is harder to make and likely a lot less useful or accurate than the National Model, which is my primary model as I previously justified.

## 
## District Predictions of Close Races
## ==============================================
## District Winning Party Dem Voteshare (2-Party)
## ----------------------------------------------
## AZ-2           R          45.7470851582591    
## CA-22          R          48.7924297143008    
## CA-27          R          48.7547837526603    
## CO-8           R          47.7627564350919    
## IA-3           R          47.5058427211595    
## IL-17          R          48.7917663721384    
## KS-3           R          48.7787975251267    
## MD-6           D           50.468004448844    
## ME-2           R          49.0432976863614    
## MI-10          R          46.4641210999723    
## MI-3           D          50.2224333767329    
## NC-13          R          47.6695632939094    
## NJ-7           R          47.5439896178719    
## NM-2           R          48.3163292900566    
## NV-3           R          48.9933306670497    
## NY-18          R           49.78956801633     
## NY-19          R          47.9841852168202    
## NY-1           R          45.8281489154832    
## NY-22          R          47.5349576438108    
## NY-3           D          50.4847897120962    
## OR-5           R          48.5908127123744    
## OR-6           R          49.4188689896569    
## PA-17          R          48.5598612823508    
## PA-7           R          47.7408442976346    
## PA-8           R          47.9598612823508    
## RI-2           R          48.9277947086825    
## TX-15          R          46.4970851582591    
## TX-34          D          50.3149388120414    
## VA-2           R          47.7242743580655    
## ----------------------------------------------