Jupyter Notebook Archive for Scalable Machine Learning on Big Data with Apache Spark on Coursera.
- Lab Environment: Jupyter Notebook with Python-with-Pixiedust_Spark-2.0 kernel(How to setup?)
- working with RDD: Parallel Programming
- Functional Programming Basics with RDDs: Functional Programming Notes
- Working with DataFrames: RDD(Resilient Distributed Dataset) versus SQL
- Statistical Moments & Correlation: Averages, Standard Deviation, Skewness, Kurtosis, Covariance
- Simple Statistical Analysis Handson with Dataframe and SQL
- Guidance: Install PixieDust Kernel for Jupyter Notebook on Ubuntu 18.04 Server