Skip to content

MATES ED2MIT 2021 "Big Data Infrastructure Technologies for Data Analytics"

Sarah edited this page Mar 6, 2021 · 3 revisions

MATES ED2MIT Training "Big Data Infrastructure Technologies for Data Analytics

Tutorial 1: Big Data Technologies: Introduction, Reference Architecture, Big Data algorithms. Cloud based Big Data storage services

Thursday, January 20, 2021: 15:00-18:00pm CET

Topics

  • Course introduction: MATES project, Industry 4.0 and digitalisation, Digital and data competences and skills
  • Introduction into Big Data concepts, architecture and technologies, use cases
  • Discussion: Digitalisation aspects in your organisations

Materials

  • Zoom recording divided into three parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
  • Practice and lecture material on Google Drive on Google Drive

Tutorial 2: Big Data Platforms for Data Analytics, Big Data service providers, Hadoop platform overview

Tuesday, January 26, 2021: 15:00-18:00pm CET

Topics

  • Big Data algorithms, Hadoop Big Data Platform
  • Cloud based Big Data platforms and Providers
  • Demo and practice: working with cloud services and Hadoop cluster

Materials

  • Zoom recording divided into two parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
  • Practice and lecture material on Google Drive

Tutorial 3: SQL and NoSQL databases for Big Data processing. Hadoop Datawarehouse Hive and Dataflow scripting language Pig

Thursday, January 28, 2021: 15:00-18:00pm CET

Topics

  • Data types and data models
  • SQL databases
  • Distributed systems: CAP theorem, ACID and Base properties
  • NoSQL databases overview
  • Modern cloud databases

Materials

  • Zoom recording divided into three parts (raw, not edited): Download and play in MediaPlayer or other MP4 player
  • Practice and lecture material on Google Drive

Tutorial 4: Security and Compliance of Big Data platforms, Data protection

Thursday, February 4, 2021: 15:00-18:00pm CET.

Topics

  • Big Data and Cloud Security
  • Cloud security models, services, mechanisms
  • Cloud security best practices: AWS and Microsoft Azure
  • Cloud Compliance and (Self-)Assessment
  • Compliance standards, security controls
  • CSA GRC Stack: Governance, Risk Management and Compliance
  • PCI DSS Cloud Computing Guidelines
  • Practice with Big Data and Cloud Compliance

Materials

Course Materials

All lecture and supplementary materials are shared via a shared folder on Google Drive.

Logistics

Course format: 4 tutorials of 3 hours, including 2 breaks 15 min.

Time 15:00-18:00, online on Zoom.

Course materials are uploaded daily in advance to the shared folder on Google Drive.

Lectures will be recorded and uploaded after lecture overnight, and updated after processing - approx. 2-3 days.

Clone this wiki locally