Why default to summary rather than histogram? #460

sp1rs · 2022-08-25T15:37:34Z

What is the reason behind converting the metric to a summary rather than a histogram by default?

SuperQ · 2022-08-25T16:05:30Z

Probably historical. A number of Prometheus things from the very early days defaulted to Summary.

glightfoot · 2022-08-25T16:14:47Z

Hey, that's a good question. Histograms in prometheus have a few main disadvantages that prevent them from being useful for statsd by default. The first and biggest downside is that histograms require some knowledge of what's being measured and the expected distribution in order to set decent bucket boundaries. Imagine you have timings that are expected to measure around a few milliseconds, and another set of timings that cluster around a few seconds. With a generic histogram using the default buckets, neither of these sets of timings would produce accurate data in default histograms. However, if you know these distributions, you can create buckets that will allow you to get meaningful percentiles.

Second, histograms have a higher cardinality than summaries, especially if you try to measure something with a wide distribution of values. Given we don't know what kind of timings people will send, in order to have meaningful histograms by default in the statsd exporter, we'd need a very wide set of buckets. This causes more load on prometheus.

Finally, summaries are accurate and produce meaningful data out of the box for any timing*, regardless of the distribution, since they directly calculate percentiles. Histograms use a linear estimation between bucket boundaries to get a percentile value, which inherently has error baked in that some people don't necessarily consider. This may change once prometheus supports sparse histograms, which significantly improve on these limitations.

Assuming there are frequent enough timings being sent to be able to sample them.

TL;DR Summaries are cheaper and more accurate for unknown distributions than histograms, which currently require some knowledge of the expected distribution.

SuperQ · 2022-08-25T18:04:31Z

The big down side of Summaries is that they can't be aggregated. If you have more than one statsd_exporter receiving data from the same app(s). The data will be essentially useless.

matthiasr · 2022-08-27T06:42:51Z

I thought about changing the default in the past but never tackled that.

With Histograms v2 in the works, I would rather not change the default now – they will alleviate a lot of the "must pick buckets" pain, and if we can make one breaking change rather than multiple all the better.

pedro-stanaka · 2024-02-19T21:58:44Z

Now that native histograms are more stable I would +1 here to make this default in the next major release. I have been using it as default and can just recommend the level of detail you get is impressive.

matthiasr · 2024-03-03T21:31:20Z

They're "more stable" but still experimental 😅 We still need a text format (prometheus/proposals#32), and it's behind a feature flag in Prometheus itself. Let's wait until it is really stable 😉

matthiasr · 2024-11-08T14:56:59Z

With Prometheus 3.0 just around the corner, let's do this 😄 What should the default configuration be? Should we still include some default classic histogram buckets, or only an exponential histogram factor?

SuperQ · 2024-11-08T15:18:45Z

I would leave the default standard timer bucket set.

pedro-stanaka · 2024-11-08T16:45:04Z

I would also be fine with keeping buckets, specially with the work around Native Histogram with Custom Buckets (NHCB) going on.

Regarding the native histogram configuration, I would just make sure we set the max buckets option. Have seen that showing up on profiles when the number is too permissive (anything over 500).

matthiasr · 2024-11-09T15:45:43Z

Oh yes, very good point. OTel has set a precedent with [160 buckets by default]( https://opentelemetry.io/docs/specs/otel/metrics/sdk/#base2-exponential-bucket-histogram-aggregation), so unless there's a strong reason for another specific number I would stick with that.

…

On Fri, 8 Nov 2024, 17:45 Pedro Tanaka, ***@***.***> wrote: I would also be fine with keeping buckets, specially with the work around Native Histogram with Custom Buckets (NHCB) going on. Regarding the native histogram configuration, I would just make sure we set the max buckets option. Have seen that showing up on profiles when the number is too permissive (anything over 500). — Reply to this email directly, view it on GitHub <#460 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABAEBU3KCSVEBJEW3YR2D3Z7TTCLAVCNFSM6AAAAABRNVJ7J2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINRVGI2DQOJUGI> . You are receiving this because you commented.Message ID: ***@***.***>

matthiasr added enhancement question labels Aug 27, 2022

matthiasr added help wanted PRs for this issue are especially welcome and removed question labels Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why default to summary rather than histogram? #460

Why default to summary rather than histogram? #460

sp1rs commented Aug 25, 2022 •

edited

Loading

SuperQ commented Aug 25, 2022

glightfoot commented Aug 25, 2022

SuperQ commented Aug 25, 2022

matthiasr commented Aug 27, 2022

pedro-stanaka commented Feb 19, 2024

matthiasr commented Mar 3, 2024

matthiasr commented Nov 8, 2024

SuperQ commented Nov 8, 2024

pedro-stanaka commented Nov 8, 2024

matthiasr commented Nov 9, 2024 via email

Why default to summary rather than histogram? #460

Why default to summary rather than histogram? #460

Comments

sp1rs commented Aug 25, 2022 • edited Loading

SuperQ commented Aug 25, 2022

glightfoot commented Aug 25, 2022

SuperQ commented Aug 25, 2022

matthiasr commented Aug 27, 2022

pedro-stanaka commented Feb 19, 2024

matthiasr commented Mar 3, 2024

matthiasr commented Nov 8, 2024

SuperQ commented Nov 8, 2024

pedro-stanaka commented Nov 8, 2024

matthiasr commented Nov 9, 2024 via email

sp1rs commented Aug 25, 2022 •

edited

Loading