Open data and code behind Rethink Priorities's analysis of its 2020 Benchmarking Poll.
This poll was run by Rethink Priorities via the Prolific platform on 20 October 2020. After dropping for data quality checks, it includes 4933 people. People from California were intentionally upsampled, so the raw data includes 1022 people from California and 3911 people from other US states.
Analyses are done on weighted data that corrects this upsampling and corrects other ways that Prolific may not be representative. The weighted N is 1867, suggesting a raw margin of error of +/-2pts at 95% confidence.
All analysis was done before the election, so will not be biased by actual results.
- US national turnout is estimated to be 62.7% of the voting eligible population (80% NMCI: 57.4% to 67.0%). (Note that NMCIs are naive modeled confidence intervals that project actual election results using this poll alone and historical information about polling accuracy, but not other polls or other non-polling informaiton like fundamentals. Thus NMCI are not just the CIs implied by the raw polling margin of error. This is meant to imply that there is an 80% chance that the true value as observed on election day will fall between the two 80% NMCI values.)
- 80.4% of eligible US voters are estimated to be registered (80% NMCI: 75.5% to 85.3%).
- Biden at +9.8 support nationally (80% NMCI: Biden +5.3 to Biden +14.4). These naive modeled NMCIs alone suggest a 86.1% chance Biden wins the Presidency (again, taking into account this poll alone and information about historical accuracy, not taking into account any other polls or any non-polling fundamentals).
- Dems are at +5.3 nationally in the Generic Congressional Ballot (80% NMCI: Reps +2.6 to Dems +13.1), suggesting a 71.7% chance that Dems keep the House.
- Biden is at +2.3 in Texas (80% NMCI: Trump +4.0 to Biden +8.7), suggesting a 62.1% chance that Biden wins Texas.
- Biden is at +28.9 in California (80% NMCI: Biden +22.5 to Biden +35.3), suggesting a >99.9% chance that Biden wins California.
- Cornyn (R) is at +2.5 in the Texas Senate race (80% NMCI: Cornyn +11.3 to Hegar (D) +6.3), suggesting a 59.5% chance that Cornyn keeps his Texas Senate seat.
- CA Prop 14 (Stem Cell) is at 55.6% support (80% NMCI: 45.2 to 66.0), suggesting a 67.4% chance of passing.
- CA Prop 15 (Property Tax) is at 56.5% support (80% NMCI: 46.7 to 66.5), suggesting a 71.1% chance of passing.
- CA Prop 16 (Affirmative Action) is at 39.9% support (80% NMCI: 30.4 to 49.4), suggesting a 18.7% chance of passing.
- CA Prop 17 (Felon Voting) is at 61.2% support (80% NMCI: 53.7 to 68.7), suggesting a 89.5% chance of passing.
- CA Prop 18 (17yo Voting) is at 47.5% support (80% NMCI: 40.1 to 54.6), suggesting a 37.9% chance of passing.
- CA Prop 20 (Parole) is at 45.1% support (80% NMCI: 35.0 to 55.2), suggesting a 34.1% chance of passing.
- CA Prop 21 (Rent Control) is at 46.8% support (80% NMCI: 34.5 to 59.2), suggesting a 41.5% chance of passing.
- CA Prop 22 (Rideshare) is at 50.2% support (80% NMCI: 41.1 to 59.3), suggesting a 50.8% chance of passing.
- CA Prop 23 (Dialysis) is at 55.3% support (80% NMCI: 45.2 to 65.5), suggesting a 67.1% chance of passing.
- CA Prop 25 (Bail) is at 59.1% support (80% NMCI: 50.7 to 67.6), suggesting a 81.9% chance of passing.
The predictions above are the only ones I want to be held to, but see this sheet for more details on model outputs and other predictions. You can also investigate individual Jupyter noteboks (see "Models" below) for even more.
Writeups contain details on some of the above results as well as full details on the methodology.
-
RP 2020 Election Poll: Biden at +9.8 support nationally in Rethink Priorities poll, Dems +5.3 for Congress - Analyis complete 28 October 2020
-
RP 2020 Election Poll: California Propositions Poll - Analyis complete 28 October 2020
pip install -r requirements.txt
to install the pre-requisite packages.
Raw data is in repsonses_processed_national_weighted.csv
.
This data was produced with the following two scripts:
- (A) Process Data.ipynb - Load the data from Surveymonkey Raw CSV into processed CSVs. Use survey_dud_detector and other quality checks to remove bad data.
- (B) Weighting.ipynb - Use surveyweights package to apply US Census weighting
Note that running these two scripts on your own is unnecessary as the generated data is already included in the repo. Further note that running the first script is not possible publicly as the raw CSVs expose internal Prolific information that cannot be disclosed publicly.
See this sheet for details on model outputs and other predictions. You can also investigate the following individual Jupyter noteboks for even more:
- (C) Analysis - Trump-Biden - Model Trump vs. Biden. Also include estimates of turnout. (Analysis complete 28 Oct 2020. Note that this now contains additional post-hoc analysis done 11 Nov exploring alternative weighting schemes.)
- (D) Analysis - Generic Congressional Ballot - Model the Generic Congressional Ballot. (Analysis complete 28 Oct 2020.)
- (E) Analysis - Texas Biden + Senate - Model Trump vs. Biden presidential race and Cornyn (R) vs. Hegar (D) senate race in Texas. (Analysis complete 28 Oct 2020.)
- (F) Analysis - CA Propositions - Model California state propositions. Also model Trump-Biden in CA. (Analysis complete 28 Oct 2020.)
- (G) Analysis - 50 States + DC Models - Using national polling to infer state results for all 50 states + DC for Trump vs. Biden and Senate races. (Analyis complete 29 Oct 2020.)