Notes:
The course agenda will include official lectures and tutorials and unofficial meetups. Student might trigger online-sessions that we can join and help students to study.
- bold: indicates the lead/presenter
- dates for lectures and tutorials are fixed, however, under some circumstances, the planned dates might be changed.
- meetups:
- the schedule could be flexible but we try to use the common slots
Remember to read online Course Management slides ( download PDF)
The Schedule is currently revised for Spring 2023
Date | Lecture/Tutorial/Meetup | Topics | Responsibles |
---|---|---|---|
11.01 | Lecture 1 | Introduction to Big Data Platforms (download PDF) and Architecting Big Data Platforms (download PDF) | Linh Truong |
12.01 | Meetup 1 | How to prepare and succeed on Big Data assignments: experiences and expectation: Slides from Zixuan Liu & Guangkai Jiang and Slides from Tri Nguyen | Zixuan Liu, Guangkai Jiang, Minh-Tri Nguyen, Linh Truong |
18.01 | Lecture 2 | Service and Integration Models in Big Data Platforms, (download PDF). Additional slides: Cloud Infrastructures for Big Data Platforms (download PDF) and a Recap on Performance, Dependability, and Fault Tolerance in Distributed Systems (download PDF) | Linh Truong |
25.01 | Lecture 3 | Big Data Storage and Database Services (download PDF). Additional slides: common systems & integration problems. and A short example of metadata | Linh Truong |
26.01 | Tutorial 1 | Hands-on examples with big database services | Zixuan Liu |
01.02 | Lecture 4 | Big Data Ingestion, Transformation and Orchestration (download PDF). Additional slides about Streaming Data Ingestion with Apache Kafka | Linh Truong |
01.02 | Release the first assignment | ||
02.02 | Tutorial 2 | Data Ingestion with Apache Nifi | Guangkai Jiang |
08.02 | Lecture 5 | Hadoop and its Big Data Ecosystems (download PDF) . Study some real cases of Hadoop and data ingestion | Linh Truong |
09.02 | Tutorial 3 | Kafka for Big Data | Guangkai Jiang |
15.02 | No lecture/Meetup | Backup | All |
17.02 | due of the first assignment -13.00 | ||
20-24.02 | No lecture week | Assignment grading | All |
01.03 | Lecture 6 | Big Data Processing with MapReduce/Spark Programming Models (download PDF) | Linh Truong |
01.03 | Release the 2nd assignment | ||
02.03 | Tutorial 4 | Hadoop | Minh-Tri Nguyen |
08.03 | Lecture 7 | Stream Processing and Big Data Platforms (download PDF) | Linh Truong |
9.03 | Tutorial 5 | Data Processing with Apache Spark | Minh-Tri Nguyen |
15.03 | No lecture/Meetup | Backup | |
16.03 | Industry tutorial | Data Lakehouse on Azure using Synapse Analytics and HDInsight | Aitor Murguzur, PhD |
17.03 | Due the 2nd assignment | ||
20-24.03 | No lecture week | Assignment grading | All |
29.03 | Lecture 8 | Workflows for Big Data Platforms (download PDF) | Linh Truong |
29.03 | Release the third assignment | ||
30.03 | Tutorial 6 | Stream Processing with Apache Flink | Tri Nguyen |
05.04 | No lecture/meetup | Backup | |
06.04 | Tutorial 7 | Data processing examples with Airflow | Zixuan Liu |
12.04 | Lecture 9 | Selected Trends/Issues for Big Data Platforms (download PDF) | Linh Truong |
14.04 | Due of the third assignment | ||
17-21.04 | Assignment grading | All |