Skip to content

Commit c2edb38

Browse files
committed
Raise alert on degraded network bonds
This will raise a alert when at least one of the bond members is down. Adapted from awesome-prometheus-alerts [1]. [1] https://samber.github.io/awesome-prometheus-alerts/rules.html#rule-host-and-hardware-1-34
1 parent a8fb2c8 commit c2edb38

File tree

2 files changed

+14
-0
lines changed

2 files changed

+14
-0
lines changed

etc/kayobe/kolla/config/prometheus/system.rules

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,15 @@ groups:
9696
summary: Host clock not synchronising (instance {{ $labels.instance }})
9797
description: "Clock not synchronising. Ensure NTP is configured on this host."
9898

99+
- alert: HostNetworkBondDegraded
100+
expr: (node_bonding_active - node_bonding_slaves) != 0
101+
for: 2m
102+
labels:
103+
severity: warning
104+
annotations:
105+
summary: Host network bond degraded (instance {{ $labels.instance }})
106+
description: "Bond {{ $labels.master }} degraded on {{ $labels.instance }}"
107+
99108
- alert: HostConntrackLimit
100109
expr: node_nf_conntrack_entries / node_nf_conntrack_entries_limit > 0.8
101110
for: 5m
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
---
2+
features:
3+
- |
4+
Adds a new Prometheus alert ``HostNetworkBondDegraded`` which will be
5+
raised when at least one bond member is down.

0 commit comments

Comments
 (0)