Active ceph-mgr crashes on receiving report from a non-active mgr
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu Cloud Archive |
Fix Released
|
High
|
Ponnuvel Palaniyappan | ||
Ussuri |
Fix Released
|
Undecided
|
Unassigned | ||
ceph (Ubuntu) |
Fix Released
|
High
|
Ponnuvel Palaniyappan | ||
Focal |
Fix Released
|
High
|
Ponnuvel Palaniyappan |
Bug Description
[Impact]
An active ceph-mgr crashes and another ceph-mgr takes over and becomes
the active mgr. But this could again hit same issue and crash and the cycle can continue indefinitely (previously crashed ceph-mgr gets restarted by systemd).
This could affect the cluster stability/usability as ceph mgr handles a number of essential operations (modules that control/change Ceph cluster behaviour, metrics, etc).
[Test Plan]
Deploy and operate a Ceph cluster normally.
Increase the log level of mgr to 20.
Observe MMgrReport sent from non-active mgrs get ignored (no crash).
[Where problems could occur]
Possibly the fix may not actually fix and mgr continue to crash as before.
Might incorrectly ignore reports from active mgrs.
[Other Info]
Upstream main bug: https:/
Octopus backport PR: https:/
Octopus backport bug: https:/
This has been already been fixed and available in Pacific.
So needed to backport only for Octopus.
Changed in ceph (Ubuntu): | |
assignee: | nobody → Ponnuvel Palaniyappan (pponnuvel) |
status: | New → In Progress |
description: | updated |
Changed in ceph (Ubuntu Focal): | |
assignee: | nobody → Ponnuvel Palaniyappan (pponnuvel) |
status: | New → In Progress |
Changed in ceph (Ubuntu): | |
importance: | Undecided → High |
Changed in ceph (Ubuntu Focal): | |
importance: | Undecided → High |
tags: | added: sts |
Changed in cloud-archive: | |
assignee: | nobody → Ponnuvel Palaniyappan (pponnuvel) |
assignee: | Ponnuvel Palaniyappan (pponnuvel) → nobody |
importance: | Undecided → High |
status: | New → In Progress |
assignee: | nobody → Ponnuvel Palaniyappan (pponnuvel) |
Changed in ceph (Ubuntu): | |
status: | In Progress → Fix Released |
Changed in ceph (Ubuntu Focal): | |
status: | In Progress → Fix Released |
Changed in cloud-archive: | |
status: | In Progress → Fix Released |
Attaching debdiff for Focal.