Skip to content

Commit

Permalink
add alert for system.slice rss going over system-reserved
Browse files Browse the repository at this point in the history
  • Loading branch information
sjenning committed Sep 16, 2020
1 parent 5d3fa47 commit 92fceb1
Showing 1 changed file with 9 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -43,3 +43,12 @@ spec:
severity: warning
annotations:
message: "Kubelet health failure threshold reached"
- name: system-memory-exceeds-reservation
rules:
- alert: SystemMemoryExceedsReservation
expr: |
sum by (node) (container_memory_rss{id="/system.slice"}) > ((sum by (node) (kube_node_status_capacity{resource="memory"} - kube_node_status_allocatable{resource="memory"})) * 0.9)
labels:
severity: warning
annotations:
message: "System memory usage of {{ $value | humanize }} on {{ $labels.node }} exceeds 90% of the reservation. Reserved memory ensures system processes can function even when the node is fully allocated and protects against workload out of memory events impacting the proper functioning of the node. The reservation may be increased (https://docs.openshift.com/container-platform/latest/nodes/nodes/nodes-nodes-managing.html) when running nodes with high numbers of pods."

0 comments on commit 92fceb1

Please sign in to comment.