Skip to content

Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.

Notifications You must be signed in to change notification settings

bhavanachitragar/Data-Analysis-using-Pyspark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Data Analysis using Pyspark


  • Importing first csv file of our dataset
  • Using pyspark sql data frame
  • Queries to extract useful information
  • Importing second csv file of our dataset
  • Merging two data frames and prepare it for more advanced queries
  • Visualizing results using matplotlib

About

Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published