Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 1.1 KB

README.md

File metadata and controls

21 lines (16 loc) · 1.1 KB

Real time analytics: Kafka + Druid + Superset

What is Druid?

Apache Druid (incubating) is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Druid is most often used as a database for powering use cases where real-time ingest, fast query performance, and high uptime are important. As such, Druid is commonly used for powering GUIs of analytical applications, or as a backend for highly-concurrent APIs that need fast aggregations.

  • Columnar storage format
  • Scalable distributed system
  • Realtime or batch ingestion
  • Self-healing, self-balancing, easy to operate
  • Cloud-native, fault-tolerant architecture that won't lose data
  • Indexes for quick filtering
  • Time-based partitioning
  • Approximate algorithms
  • Automatic summarization at ingest time

What is Superset?

Apache Superset is a data exploration and visualization web application and provides an intuitive interface to explore and visualize datasets, and create interactive dashboards.

  • A rich set of data visualizations
  • An easy-to-use interface for exploring and visualizing data