capture slow heart beats in ceph logs

Bug #1966616 reported by Linda Guo
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
prometheus-grok-exporter-charm
Won't Fix
High
Unassigned

Bug Description

we did not easily detect dropped packets on the ceph network

needs to look into using the grok exporter application to provide counts for slow heartbeats that would indicate a networking issues.

2022-03-21T10:14:17.974322+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9955308 : cluster [WRN] Health check failed: Slow OSD heartbeats on front (longest 2120.925ms) (OSD_SLOW_PING_TIME_FRONT)
2022-03-21T10:15:17.679289+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9955357 : cluster [INF] Health check cleared: OSD_SLOW_PING_TIME_FRONT (was: Slow OSD heartbeats on front (longest 2120.925ms))
2022-03-21T12:21:32.712067+0000 mon.juju-a79b06-10-lxd-0 (mon.0) 9961566 : cluster [WRN] Health check failed: Slow OSD heartbeats on back (longest 1130.745ms) (OSD_SLOW_PING_TIME_BACK)

Linda Guo (lihuiguo)
description: updated
Linda Guo (lihuiguo)
Changed in charm-prometheus-grok-exporter:
importance: Undecided → High
Revision history for this message
Eric Chen (eric-chen) wrote :

This charm is no longer being actively maintained. Please consider using the new Canonical Observability Stack instead. (https://charmhub.io/topics/canonical-observability-stack)

Changed in charm-prometheus-grok-exporter:
status: New → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.