-
Notifications
You must be signed in to change notification settings - Fork 630
Open
Labels
keepaliveUse to prevent automatic closingUse to prevent automatic closing
Description
kubernetes-mixin/alerts/resource_alerts.libsonnet
Lines 25 to 41 in dc563cb
| { | |
| alert: 'KubeCPUOvercommit', | |
| expr: ||| | |
| sum(namespace:kube_pod_container_resource_requests_cpu_cores:sum{%(ignoringOverprovisionedWorkloadSelector)s}) | |
| / | |
| sum(kube_node_status_allocatable_cpu_cores) | |
| > | |
| (count(kube_node_status_allocatable_cpu_cores)-1) / count(kube_node_status_allocatable_cpu_cores) | |
| ||| % $._config, | |
| labels: { | |
| severity: 'warning', | |
| }, | |
| annotations: { | |
| message: 'Cluster has overcommitted CPU resource requests for Pods and cannot tolerate node failure.', | |
| }, | |
| 'for': '5m', | |
| }, |
The KubeCPUOvercommit doesn't take node pools and tolerations into account and it might even be a stretch to cover that. Anyone has thoughts about that?
Metadata
Metadata
Assignees
Labels
keepaliveUse to prevent automatic closingUse to prevent automatic closing