Skip to content

Commit d6ac7f3

Browse files
Merge pull request #4832 from CourtCourt521/SystemMemoryExceedsReservation
MCO-1492: Add new runbook for SystemMemoryExceedsReservation to alert
2 parents c5d210a + 5a2d40e commit d6ac7f3

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

install/0000_90_machine-config_01_prometheus-rules.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -125,6 +125,7 @@ spec:
125125
annotations:
126126
summary: "Alerts the user when, for 15 minutes, a specific node is using more memory than is reserved"
127127
description: "System memory usage of {{ $value | humanize }} on {{ $labels.node }} exceeds 95% of the reservation. Reserved memory ensures system processes can function even when the node is fully allocated and protects against workload out of memory events impacting the proper functioning of the node. The default reservation is expected to be sufficient for most configurations and should be increased (https://docs.openshift.com/container-platform/latest/nodes/nodes/nodes-nodes-managing.html) when running nodes with high numbers of pods (either due to rate of change or at steady state)."
128+
runbook_url: https://github.com/openshift/runbooks/blob/master/alerts/machine-config-operator/SystemMemoryExceedsReservation.md
128129
- name: high-overall-control-plane-memory
129130
rules:
130131
- alert: HighOverallControlPlaneMemory

0 commit comments

Comments
 (0)