Skip to content

Latest commit

 

History

History
27 lines (21 loc) · 1.53 KB

File metadata and controls

27 lines (21 loc) · 1.53 KB

Web-Scraping and Sentiment Analysis of JW Marriot Hotel Reviews

Description

  • Implemented a web scraper to fetch live hotel reviews from TripAdvisor website.
  • Conducted data pre-processing and cleaning by removing stop words, punctuation, special characters, numbers, and white-spaces from reviews.
  • Performed tokenization and stemming of reviews, and built a corpus out of it.
  • Calculated the sentiment score of each review by comparing it's tokens with positive and negative lexicon and the overall sentiment score of the hotel depending on the number of positive and negative reviews it received.
  • Visualized the results using bar-plot and box-plot charts.

Code

Data

Data Visualization

  1. Sentiment score of first hundred reviews
  2. Positive and negative words in a single review
  3. Positive word count in first hundred reviews
  4. Negative word count in first hunderd reviews
  5. Overall sentiment score of JW Marriot hotel

Programming Language

R

Tools/IDE

RStudio