bhavanachitragar / Data-Analysis-using-Pyspark Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Notebook.ipynb		Notebook.ipynb
README.md		README.md

Repository files navigation

Data Analysis using Pyspark

Importing first csv file of our dataset
Using pyspark sql data frame
Queries to extract useful information
Importing second csv file of our dataset
Merging two data frames and prepare it for more advanced queries
Visualizing results using matplotlib

About

Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.

data-analysis google-colab pyspark-sql

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%