Skip to content

This project examines the difference in headlines between the paper and online versions of the New York Times articles surrounding and related to the withdrawal of U.S. troops from Afghanistan.

Notifications You must be signed in to change notification settings

kbec19/NYT-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

91 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

New York Times Headline Sentiment Analysis

Previous Research

For this project, I am using some data gathered in the DACSS 602 course "Research Design". In the project for that course, our research group posed the question:

How did the sentiment of news reporting on the U.S. withdrawal of Afghanistan shift over the period between when it was agreed upon between ex-President Trump and when it was executed by President Biden?

Basic Research Design: Manual coding of a stratified, representative sample of 300 articles after reaching an appropriate inter-coder reliability rating.

Data Sample: We coded articles that mention Afghanistan from the time of the Doha Agreement (February 29, 2020) through September 30, 2021, following the Congressional testimonies conducted September 28-29, 2021 regarding the withdrawal. The articles were collected from the New York Times and the Wall Street Journal World and News sections.

Text Coding Method: We used NVivo 12 to code news articles covering the U.S. withdrawal from Afghanistan during the period leading up to and following the day the last of the U.S. forces left Afghanistan.

Methods/Text Coding Categories:

  • Ground Source: When the reporting is on location
  • Military Source: When there is a quote from a U.S. military spokesperson, leader, or commander
  • Conflict Veteran Source: When there is a quote from a U.S. military veteran who served in Afghanistan
  • Framing: When there is a framing of the Taliban as a threat
  • Sentiment Rating: How the coder felt while reading the article

Outcomes:

The article source (NYT v. WSJ) served as a moderator, with the outcomes being the analysis of media frames ‘before and after’ (Trump admin v. Biden admin).

Current Project

I continued down the same path but with new data and a new direction through the DACSS 697D course "Text as Data".

This project examines the difference in headlines between the paper and online versions of the New York Times articles related to the withdrawal of U.S. troops from Afghanistan. The analysis includes articles that mention “Afghanistan” from the time of the Doha Agreement (February 29, 2020) through September 30, 2021, following the Congressional testimonies conducted September 28-29, 2021.

Analysis of a corpus compiled from data obtained through the New York Times API showed no statistically significant differences in the headlines using three widely used sentiment and emotion lexicons.

Topic modeling and examining a co-occurrence matrix of each set of headlines showed patterns in which types of words are chosen for the respective audience.

Specifically, this preliminary analysis showed that print headlines might carry fewer emotionally weighted words than online headlines.

Citations:

  • This research makes use of the NRC Word-Emotion Association Lexicon, created by Saif Mohammad and Peter Turney at the National Research Council Canada.

  • This research makes use of the Bing Lexicon. This dataset was first published in Minqing Hu and Bing Liu, ``Mining and summarizing customer reviews.'', Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD-2004), 2004.

  • This research makes use of the AFINN Lexicon, Nielsen, F. Å. (2011). A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:1103.2903.

About

This project examines the difference in headlines between the paper and online versions of the New York Times articles surrounding and related to the withdrawal of U.S. troops from Afghanistan.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages