Skip to content

A formal report written in R that uses Natural Language Processing and Machine Learning to classify news article claims as either true or false.

License

Notifications You must be signed in to change notification settings

Oliver-BE/fake-news-nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Classifying Fake News Using NLP and ML

A formal report written in R that uses Natural Language Processing and Machine Learning to classify news article claims as either true or false.

This project used a dataset of 1,911 unique PolitiFact claims and their associated truth ratings. Features were extracted from each claim using a bag-of-n-grams model with tf-idf as the scoring metric (the vocabulary size was first reduced using using lemmatization and stop word removal among other text cleaning procedures).

Seven machine learning classification models (including a random forest, a multilayer perceptron, and a recurrent neural network) were fit and a maximum classification accuracy of 71% was achieved.

Viewing the Project

The project can be viewed either as a formal PDF here or as a Bookdown website here.

License

About

A formal report written in R that uses Natural Language Processing and Machine Learning to classify news article claims as either true or false.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published