Same alert rules are created for LXDs and metals #32

facundofc · 2023-11-10T08:59:28Z

Bug Description

If the same grafana-agent application is related with two principals coexisting in the same metal, one metal deployed and one lxd deployed, the same set of rules is created in prometheus. For some (all?) rules this is not desirable because the metrics will be exactly the same for the metal and for the lxd. One example of such a metric is node_cpu_seconds_total.

The problem with this is that several alerts will fire due to the exact same issue: an overloaded host.

To Reproduce

juju deploy ubuntu
juju deploy ubuntu ubuntu-lxd --to lxd:0
juju deploy grafana-agent
juju relate ubuntu grafana-agent
juju relate ubuntu-lxd grafana-agent

Environment

This was observed in latest/edge, revision 16.

Relevant log output

n/a

Additional context

No response

The text was updated successfully, but these errors were encountered:

przemeklal · 2023-12-05T13:12:12Z

One possible workaround is to silence them forever, using these matchers for example:

job=~".*grafana-agent-container.*"

alertname!~"HostInterfaceMTUSize"

In the above example, only HostInterfaceMTUSize alerts will fire, everything else coming from g-agent deployed as grafana-agent-container will be silenced.

lucabello · 2024-01-04T14:20:25Z

@dstathis want to take a look? Is this related to recent work?

(sidenote: this was observed in rev16; we are currently at rev29)

dstathis · 2024-01-31T15:34:16Z

One possible way to solve this could be to have a config variable that disables node_exporter metrics. It would require you to deploy 2 different grafana-agent applications, one for lxd and one for the guest machines. Would that work for you?

przemeklal · 2024-02-05T07:08:12Z

@dstathis This could work as we usually deploy two or more different grafana-agent applications already (so that hardware-observer is related only to the one running on physical machines for example).

lucabello mentioned this issue Oct 11, 2024

Relating to multiple principals on the same machine #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Same alert rules are created for LXDs and metals #32

Same alert rules are created for LXDs and metals #32

facundofc commented Nov 10, 2023

przemeklal commented Dec 5, 2023

lucabello commented Jan 4, 2024 •

edited

Loading

dstathis commented Jan 31, 2024

przemeklal commented Feb 5, 2024

Same alert rules are created for LXDs and metals #32

Same alert rules are created for LXDs and metals #32

Comments

facundofc commented Nov 10, 2023

Bug Description

To Reproduce

Environment

Relevant log output

Additional context

przemeklal commented Dec 5, 2023

lucabello commented Jan 4, 2024 • edited Loading

dstathis commented Jan 31, 2024

przemeklal commented Feb 5, 2024

lucabello commented Jan 4, 2024 •

edited

Loading