Expose monitoring threshold/severity for "slow requests"

Bug #1922602 reported by Drew Freiberger
28
This bug affects 5 people
Affects Status Importance Assigned to Milestone
Ceph Monitor Charm
Won't Fix
Undecided
Unassigned
Juju Lint
Fix Released
Undecided
Unassigned

Bug Description

The following alert from the nagios check_ceph is classified as a Warning, but should be Critical as it will typically require operator intervention to resolve.

396 slow requests are blocked > 32 sec (REQUEST_SLOW)

Can you please expose a configuration to allow for setting the alert severity of this to Critical?

Current workaround is to perform the following config change:

juju config ceph-mon nagios_additional_checks_critical=true nagios_additional_checks='{"ceph_slow_requests": "slow requests"}'

Tags: bseng-196

Related branches

Revision history for this message
Drew Freiberger (afreiberger) wrote :

Adding juju-lint project, as it should check for existence of ceph-mon nagios_additional_checks_critical=true and nagios_additional_checks='{"ceph_slow_requests": "slow requests"}' until this bug is addressed within ceph-mon charm.

Eric Chen (eric-chen)
tags: added: bseng-196
Changed in juju-lint:
milestone: none → 1.0.3
Tianqi Xiao (txiao)
Changed in juju-lint:
status: New → Fix Committed
Changed in juju-lint:
status: Fix Committed → Fix Released
Revision history for this message
Peter Jose De Sousa (pjds) wrote (last edit ):

The fix provided via juju lint appears to not be working as expected. The values will be parsed down to the nagios python check file as:

`--additional-checks "'slow' 'requests'"` which fails consitently, the correct value would be --additional-checks 'slow requests'

Revision history for this message
Peter Sabaini (peter-sabaini) wrote :

Going to mark this as wontfix as we're transitioning away from LMA. The new COS based metrics and alerting does support checking for blocked pgs https://discourse.ubuntu.com/t/metrics-and-alerts/45526

Changed in charm-ceph-mon:
status: New → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.