Skip to content

vivekveeramani/Spark-The-Definitive-Guide

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark: The Definitive Guide

This is the central repository for all materials related to Spark: The Definitive Guide by Bill Chambers and Matei Zaharia.

This repository is currently a work in progress and new material will be added over time.

Spark: The Definitive Guide

Instructions for how to run this code

Running on your local machine

To run this on your local machine either put all data in the "data" folder to /data on your computer. Another option is that when reading in data from the book, simply specify the path to that particular dataset, on your local machine.

Running on Databricks

TODO: We will soon upload these instructions.

About

Spark: The Definitive Guide's Code Repository

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 91.1%
  • Python 4.9%
  • Java 4.0%