Kayla Duskin (1, 2), Jevin West (1,2) and Joseph B. Bak-Coleman(3)
- University of Washington Center for an Informed Public
- eScience Institute
- Craig Newmark Center, Columbia University}]
This repository provides all code and data necessary to generates results, tables, and figures found in the article "Suspension of prominent accounts minimally impacts follower engagements",
The rise of problematic information online, including misinformation, is concerning to the public, social media companies, and governing bodies alike. As one form of intervention, social media platforms may temporarily or permanently suspended accounts that spread misinformation, at the risk of losing traffic vital to platform revenue and running head on into debates around free speech. Here we examine the impact on platform engagement following removal of six prominent accounts form Twitter during the COVID-19 pandemic. Focused on users who engaged with the removed accounts, we find that suspension did not meaningfully reduce activity on the platform. Moreover, we find that removal of the prominent accounts minimally impacted the diversity of information sources consumed.
If you plan on using this code for any purpose, please see the license and please cite our work as below:
Kayla Duskin, Jevin West, and Joseph B. Bak-Coleman. Suspension of prominent accounts minimally impacts follower engagement. Preprint (October 2023). https://osf.io/preprints/socarxiv/x4jau/
-
IdentifyAmplifiers.ipynb
: identify the highly engaged and moderately engaged users from internal database -
CollectAmplifierTweets.ipynb
: gather tweets from internal database and query twitter api -
Analysis.ipynb
: run the model, create figures -
src
: models (*.stan
), code used to clean the raw data, code for generating figures, and utilities used in the primary analysis.DBtools.py
helper functions for connecting to postgres databasedata_processing.py
processing data to get daily counts used in modelfetch_user_tweets.py
code for fetching user tweets from Twitter academic APIinequality_measurements.py
calculate shannon diversitymodel.py
Helper functions for running the main gaussian process model.*.stan
Stan model codeplotting.py
Code for creating figuresutil.py
Miscellaneous utilities.
-
data
: data files in comma-separated values (.csv
) formats./
: raw data files
-
out
: output files (generated by running)out/figures
: Figures generated from resultsout/posteriors
: Posterior objects for each follower saved as .json filesout/split_data
: The larger dataset split into individual followers for ease of processingprocessing_df
: DF used to keep track of followers, conditions, and processing statuschanges.csv
: Computed changes in activity across removed users and conditionschanges_grouped.csv
: Changes.csv grouped by outcome, group (med, top), and suspended user
You can reproduce the analysis of the aggregated date, including all figures and tables by following the guide below. Please note that minor, non-qualitative differences may exist due to difference in pseudorandom number generation.
First download this repository. Either download directly or open a command line and type:
git clone https://github.com/kduskin/TwitterSuspendedUsers