Skip to content

pefreis/goodreads-data-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

goodreads-data-analysis

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

The following libraries should be installed for running the code in this project:

  • pandas
  • plotly
  • scipy

Project Motivation

My goal with this project was to find some insight on how a popular book should look like, and I decided to do this by exploring Goodreads data. More specifically, I wanted to answer the following questions:

  1. Are the format and the size of a book relevant to its popularity?
  2. What are the most popular genres?
  3. How are genres related to each other?

File Descriptions

There is a single notebook in this project which contains all the code used for preparing the data, doing the analysis and setting up the visualizations. The notebook also includes markdown cells and comments that detail each step and explain some decisions in the process.

Results

The most insightful results of this analysis were put together in this post.

Licensing, Authors, Acknowledgements

Thanks to Manav Dhamani for collecting the data and making it available easily for us! The data and other descriptive information can be found at Kaggle. Feel free to use the code here as you would like!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published