Welcome to my DE journey through books.
This repository contains code practices from books and some other personal codes.
Learning Spark Lightning-Fast Data Analytics (Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee). Practices in the directory LearningSpark
Data Pipelines with Apache Airflow (Bas P. Harenslak, Julian Rutger de Ruite). Practices in the directory DataPipe
!! the practices from books aren't directly identical to the original code in the book as I constantly make my own modifications. Because I always strive to use the latest version of tools and libraries, you could use this repository as reference.