This is the companion repo to my Linked In Learning Courses on Apache Hadoop and Apache Spark.
🐘 1. Learning Hadoop - link
- this course demos I use mostly GCP Dataproc
- for running Hadoop & associated libraries (i.e. Hive, Pig, Spark...) workloads
🌩️ 2. Cloud Hadoop: Scaling Apache Spark - link & link to content area in this repo
- this course demos I use GCP DataProc, AWS EMR --or--
- I use Databricks on AWS or on GCP
⛈️ 3. Azure Databricks Spark Essential Training - link & link to content area in this repo
- this course demos I use Azure with Databricks
- for scaling Apache Spark workloads
There are ~ 10 courses on Hadoop/Spark topics on LinkedIn Learning. See graphic below