Skip to content

moritz-bauer/BigData

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Lecture: Big Data

Here you can find all materials regarding the lecture "BIG DATA - An Introduction To The Fields Of Data Engineering, Development And Architecture Of Data-Intensive Applications." hold at the Cooperative State University Baden-Wuerttemberg in Stuttgart.

This lecture will give you a brief introduction to so what is called ’Big Data’. We will quickly refresh the basics about databases, data models and data processing you have learned so far and compare those to the distributed world of Big Data. After that we will take a deep dive into the foundations of distributed data storages and data processing as well as the belonging concepts and challenges of reliability, scalability, replication, partitioning, batch and stream processing. Later on we will take a look at the most common used software and frameworks (mostly the hadoop ecosystem). At the end, as you know the basic concepts and you are able to setup and work with distributed environments and huge data sets, there will be a short introduction to data science.

At the end of each lesson, there will be some hands-on exercises, which we will start together and which have to be nished till the next week. This lecture will only be about 36 hours in 12 weeks (1 slot each week), which is very little time to cover such an extensive topic. So pay close attention and if you can’t keep up, feel free to ask questions at the end of each lesson.

Materials

You can just download everything directly or install git and get everything by using:

git clone https://github.com/marcelmittelstaedt/BigData.git

and

git pull

If you find any mistakes or misspellings feel free to send me a mail ([email protected]) or if you are able to, commit a push request.

Practical Exam Topics for Winter Semester 2020/2021

Author

Marcel Mittelstädt

Contact:

Releases

No releases published

Packages

No packages published

Languages

  • HTML 85.7%
  • TeX 7.3%
  • Python 3.2%
  • Jupyter Notebook 1.9%
  • Shell 1.6%
  • Dockerfile 0.3%