Real time analytics: Kafka + Druid + Superset

What is Druid?

Apache Druid (incubating) is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Druid is most often used as a database for powering use cases where real-time ingest, fast query performance, and high uptime are important. As such, Druid is commonly used for powering GUIs of analytical applications, or as a backend for highly-concurrent APIs that need fast aggregations.

Columnar storage format
Scalable distributed system
Realtime or batch ingestion
Self-healing, self-balancing, easy to operate
Cloud-native, fault-tolerant architecture that won't lose data
Indexes for quick filtering
Time-based partitioning
Approximate algorithms
Automatic summarization at ingest time

What is Superset?

Apache Superset is a data exploration and visualization web application and provides an intuitive interface to explore and visualize datasets, and create interactive dashboards.

A rich set of data visualizations
An easy-to-use interface for exploring and visualizing data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Real time analytics: Kafka + Druid + Superset

What is Druid?

What is Superset?

Files

README.md

Latest commit

History

README.md

File metadata and controls

Real time analytics: Kafka + Druid + Superset

What is Druid?

What is Superset?