After replacing ceph-osd disk, blank directories /var/lib/ceph/osd/ceph-X that are unmounted cause hangup in ceph-osd upgrade
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph OSD Charm |
Fix Released
|
High
|
Luciano Lo Giudice |
Bug Description
On one unit that was performing a luminous->mimic upgrade on charm version 21.04, experienced an infinite loop of the following in the unit debug-logs:
unit-ceph-osd-13: 20:42:16 DEBUG unit.ceph-
unit-ceph-osd-13: 20:42:16 DEBUG unit.ceph-
This exit status 22 is indicating the OSD doesn't exist or isn't running.
When investigating, I found the following /var/lib/ceph/osd/ directories and mounts:
myhost:
total 16
drwxr-xr-x 9 ceph ceph 4096 Jul 2 2020 .
drwxr-x--- 11 ceph ceph 4096 Sep 9 2019 ..
drwxr-xr-x 2 ceph ceph 4096 Sep 9 2019 ceph-51
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-56
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-60
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-64
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-68
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-74
drwxrwxrwt 2 ceph ceph 200 Jun 28 02:35 ceph-89
-rw------- 1 ceph ceph 69 Sep 9 2019 ceph.client.
myhost:
tmpfs on /var/lib/
tmpfs on /var/lib/
tmpfs on /var/lib/
tmpfs on /var/lib/
tmpfs on /var/lib/
tmpfs on /var/lib/
running rmdir /var/lib/
tags: | added: openstack-upgrade |
Changed in charm-ceph-osd: | |
status: | New → Triaged |
importance: | Undecided → High |
Changed in charm-ceph-osd: | |
assignee: | nobody → Luciano Lo Giudice (lmlogiudice) |
Changed in charm-ceph-osd: | |
milestone: | none → 21.10 |
Changed in charm-ceph-osd: | |
status: | Fix Committed → Fix Released |
Fix proposed to branch: master /review. opendev. org/c/openstack /charm- ceph-osd/ +/806644
Review: https:/