-
Keep it SIMPLE!
-
Goal of this book: enable writing efficient & simpler PySpark code for data algorithms using Spark
-
A lot of working PySpark code is provided so that the reader can understand how to use basic transformations on using RDDs and DataFrames
-
As much as possible, I have avoided writing complex code and functions: keep it simple so that you can debug easily and your co-workers can understand them.
-
CUT-and-PASTE: you may take portions of the code and tailor it to your needs