commit a9f84e13b13b0f20f1511d0503bbcf2df9f0fced
Author: Bin Qian <email address hidden>
Date: Thu Nov 18 15:05:51 2021 -0500
Add new collectd plugin to monitor a service status
When openldap service status return 160, raise a major alarm
for the service is approaching its FD limit. When 161 is returned
raise critical alarm for the limit is reached.
TC passed:
Alarm is raised when FD limit is reached, or above 95% (approaching).
Alarm is cleared when FD usage is below 95% threshold.
Upgrade test. New alarm raised on controller-1 (N+1).
Alarm is cleared when collectd restarts or node reboot (alarm will
be re-raised if alarming situation is dected again)
SM detects 161 status code and degraded the node with service
degraded alarm.
Alarm raised after fm comes back up after being not available.
Alarm is cleared after fm comes backup after being not available.
Reviewed: https:/ /review. opendev. org/c/starlingx /monitoring/ +/819137 /opendev. org/starlingx/ monitoring/ commit/ a9f84e13b13b0f2 0f1511d0503bbcf 2df9f0fced
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit a9f84e13b13b0f2 0f1511d0503bbcf 2df9f0fced
Author: Bin Qian <email address hidden>
Date: Thu Nov 18 15:05:51 2021 -0500
Add new collectd plugin to monitor a service status
When openldap service status return 160, raise a major alarm
for the service is approaching its FD limit. When 161 is returned
raise critical alarm for the limit is reached.
SM will degrade the node when the FD reaches the limit. /review. opendev. org/c/starlingx /ha/+/819130
Ref SM changes:
https:/
TC passed:
Alarm is raised when FD limit is reached, or above 95% (approaching).
Alarm is cleared when FD usage is below 95% threshold.
Upgrade test. New alarm raised on controller-1 (N+1).
Alarm is cleared when collectd restarts or node reboot (alarm will
be re-raised if alarming situation is dected again)
SM detects 161 status code and degraded the node with service
degraded alarm.
Alarm raised after fm comes back up after being not available.
Alarm is cleared after fm comes backup after being not available.
Closes-bug: 1952126 /review. opendev. org/c/starlingx /fault/ +/819132
Depends-on: https:/
Change-Id: I78bb6ed6f24570 d68f62818e12422 86d638fd835
Signed-off-by: Bin Qian <email address hidden>