Better health checks for radosgw
Bug #1946280 reported by
James Troup
This bug affects 3 people
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph RADOS Gateway Charm |
Fix Released
|
Medium
|
Cornellius Metto |
Bug Description
We recently had a situation where one of the 3 radosgw daemons had gotten wedged and apache was returning a stream of 503s. This manifested on the swift client side as long time outs.
It looks like Ceph has supported /swift/healthcheck since 2016, so...
a) as long as haproxy is in use, we might as well make use of it's check capabilities and ask it to check /swift/healthcheck
b) we should have an nrpe check of the same
(This was PS5, so Focal/Ussuri, if it matters)
Changed in charm-ceph-radosgw: | |
status: | New → Triaged |
importance: | Undecided → Medium |
Changed in charm-ceph-radosgw: | |
assignee: | nobody → Cornellius Metto (ckmetto) |
Changed in charm-ceph-radosgw: | |
milestone: | none → 22.04 |
tags: | added: bseng-71 |
Changed in charm-ceph-radosgw: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
For what it's worth, "ceph status" was showing only 2 "rgw" daemons out of 3. The wedged one wasn't in the list.