-
Notifications
You must be signed in to change notification settings - Fork 16
MATES ED2MIT 2021 "Big Data Infrastructure Technologies for Data Analytics"
Sarah edited this page Mar 6, 2021
·
3 revisions
Tutorial 1: Big Data Technologies: Introduction, Reference Architecture, Big Data algorithms. Cloud based Big Data storage services
Thursday, January 20, 2021: 15:00-18:00pm CET
- Course introduction: MATES project, Industry 4.0 and digitalisation, Digital and data competences and skills
- Introduction into Big Data concepts, architecture and technologies, use cases
- Discussion: Digitalisation aspects in your organisations
- Zoom recording divided into three parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
- Practice and lecture material on Google Drive on Google Drive
Tutorial 2: Big Data Platforms for Data Analytics, Big Data service providers, Hadoop platform overview
Tuesday, January 26, 2021: 15:00-18:00pm CET
- Big Data algorithms, Hadoop Big Data Platform
- Cloud based Big Data platforms and Providers
- Demo and practice: working with cloud services and Hadoop cluster
- Zoom recording divided into two parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
- Practice and lecture material on Google Drive
Tutorial 3: SQL and NoSQL databases for Big Data processing. Hadoop Datawarehouse Hive and Dataflow scripting language Pig
Thursday, January 28, 2021: 15:00-18:00pm CET
- Data types and data models
- SQL databases
- Distributed systems: CAP theorem, ACID and Base properties
- NoSQL databases overview
- Modern cloud databases
- Zoom recording divided into three parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
- Practice and lecture material on Google Drive
Thursday, February 4, 2021: 15:00-18:00pm CET.
- Big Data and Cloud Security
- Cloud security models, services, mechanisms
- Cloud security best practices: AWS and Microsoft Azure
- Cloud Compliance and (Self-)Assessment
- Compliance standards, security controls
- CSA GRC Stack: Governance, Risk Management and Compliance
- PCI DSS Cloud Computing Guidelines
- Practice with Big Data and Cloud Compliance
- Zoom recording: Download and play in MediaPlayer or other MP4 player
- Practice and lecture material on Google Drive
All lecture and supplementary materials are shared via a shared folder on Google Drive.
Course format: 4 tutorials of 3 hours, including 2 breaks 15 min.
Time 15:00-18:00, online on Zoom.
Course materials are uploaded daily in advance to the shared folder on Google Drive.
Lectures will be recorded and uploaded after lecture overnight, and updated after processing - approx. 2-3 days.
EDISON Community: Supporting and developing the EDISON Data Science Framework (EDSF)