Spotify Data Analysis in Python

This project analyzes a dataset containing information about Spotify tracks and explores various music attributes.

The data is comprised of two CSV files:

tracks.csv: Contains details about tracks such as name, popularity, duration, artist(s), and explicit content flag.
SpotifyFeatures.csv: Provides additional features for each track, including genre and audio properties like danceability, energy, and acousticness.

This project utilizes Python libraries including pandas, NumPy, seaborn, and matplotlib for data manipulation, analysis, and visualization.

Here's a breakdown of the key aspects covered in this project:

Data Loading and Cleaning:

Loads data from CSV files into pandas DataFrames.
Explores data for missing values and potential inconsistencies.

Data Exploration and Analysis:

Analyzes basic statistics of numerical features like popularity and duration.
Identifies most and least popular songs based on user popularity ratings.
Creates a correlation heatmap to understand relationships between audio features.
Performs sampling to reduce computational costs for visualizations.
Generates regression plots to explore relationships between specific features (e.g., loudness vs. energy, popularity vs. acousticness).

Time Series Analysis:

Extracts release dates from the 'tracks.csv' file and converts them into datetime format.
Creates a distribution plot (histogram) to visualize the distribution of songs across different years since 1922.
Generates a bar chart to analyze how average song duration has changed over the years.
Plots a line graph to depict the average duration of songs year-wise.

Genre Analysis:

Creates a bar chart to compare the average duration of songs across various genres.
Identifies the top five most popular music genres based on user popularity ratings.

This project provides valuable insights into music trends and user preferences on Spotify. Feel free to explore the code further, customize visualizations, and delve deeper into specific genres or audio features.

Datasets can be found at:

For tracks.csv
For SpotifyFeatures.csv

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Spotify Data Analysis.ipynb		Spotify Data Analysis.ipynb
Spotify Data Analysis.pdf		Spotify Data Analysis.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify Data Analysis in Python

Data Loading and Cleaning:

Data Exploration and Analysis:

Time Series Analysis:

Genre Analysis:

Datasets can be found at:

About

Releases

Packages

Languages

Mehtab-Sidhu/spotify-data-analysis

Folders and files

Latest commit

History

Repository files navigation

Spotify Data Analysis in Python

Data Loading and Cleaning:

Data Exploration and Analysis:

Time Series Analysis:

Genre Analysis:

Datasets can be found at:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages