Skip to content

Latest commit

 

History

History
317 lines (210 loc) · 8.09 KB

data-management.md

File metadata and controls

317 lines (210 loc) · 8.09 KB

Data Management

learning

stack example

editor tooling

databases

environment handling

choice of database

storage systems

blob storage

scheduler

permissions

ETL

methodology

input validation

batch

DBT

schema

schema changes

streaming

flink deployments

unit testing

native code

spark

classical DB

cachings

ETL tools

ETL in python

dbt and governance

unit testing data - validating assumptions

ingestion

convenience

big data

orchestration

monitoring

k8s

failures

load tests

unified batch & streaming

hybrid cloud

data sync

self service tools

api design

end user productivity

quality

consistency

BI

scaling DBT

web automation

modern data startups

interesting databases

streaming

timeseries

key-value (like)

log structured data structures and brokers

text search

interesting message queues

on basis of PostgreSQL

visualization and admin

self service pipelines

end user onboarding

web based code editors