Luminous -> Mimic upgrade fails on one osd unit, no alert in juju status
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph OSD Charm |
Triaged
|
High
|
Unassigned |
Bug Description
While upgrading the ceph cluster in a Bionic Queens openstack deployment on the latest stable charms from luminous to mimic 6/7 of the ceph-osd units successfully upgraded, however one unit stayed on luminous:
root@obayifo:~# ceph-osd --version
ceph version 12.2.13 (584a20eb0237c6
root@obayifo:~# apt-cache policy ceph-osd
ceph-osd:
Installed: 12.2.13-
Candidate: 12.2.13-
Version table:
*** 12.2.13-
500 http://
100 /var/lib/
12.
500 http://
12.
500 http://
Whereas on a healthy node:
root@waldron:~# ceph-osd --version
ceph version 13.2.8 (5579a94fafbc1f
root@waldron:~# apt-cache policy ceph-osd
ceph-osd:
Installed: 13.2.8-
Candidate: 13.2.8-
Version table:
*** 13.2.8-
500 http://
100 /var/lib/
12.
500 http://
12.
500 http://
12.
500 http://
From what I can tell in the logs the charm caught the upgrade request, but somehow thought it was on mimic already:
2021-04-09 12:47:28 DEBUG juju.machinelock machinelock.go:172 machine lock acquired for ceph-osd/2 uniter (run config-changed hook)
2021-04-09 12:47:28 DEBUG juju.worker.
2021-04-09 12:47:28 DEBUG juju.worker.
2021-04-09 12:47:28 DEBUG juju.worker.uniter agent.go:20 [AGENT-STATUS] executing: running config-changed hook
2021-04-09 12:47:28 DEBUG juju.worker.
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:28 DEBUG juju-log Hardening function 'config_changed'
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "config-get" for ceph-osd/
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:28 DEBUG juju-log No hardening applied to 'config_changed'
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:28 INFO juju-log old_version: mimic
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:28 INFO juju-log new_version: mimic
2021-04-09 12:47:28 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:28 WARNING juju-log Support for use of upstream ``apt_pkg`` module in conjunctionwith charm-helpers is deprecated since 2019-06-25
2021-04-09 12:47:29 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
2021-04-09 12:47:29 ERROR juju-log Invalid upgrade path from mimic to mimic. Valid paths are: ['firefly -> hammer', 'hammer -> jewel', 'jewel -> luminous', 'luminous -> mimic', 'mimic -> nautilus', 'nautilus -> octopus']
2021-04-09 12:47:29 DEBUG jujuc server.go:211 running hook tool "juju-log" for ceph-osd/
Despite the version obviously being luminous from the output above. Unattended upgrades are turned off for this cloud, so nothing should have touched the ceph-osd packages between entering the new release configuration "cloud:
Juju status for reference:
Model Controller Cloud/Region Version SLA Timestamp
openstack foundations-maas maas_cloud 2.8.10 unsupported 13:35:57Z
App Version Status Scale Charm Store Rev OS Notes
bcache-tuning active 7 bcache-tuning jujucharms 5 ubuntu
ceph-osd 13.2.8 active 7 ceph-osd jujucharms 308 ubuntu
lldpd active 0 lldpd jujucharms 7 ubuntu
neutron-openvswitch 12.1.1 active 0 neutron-openvswitch jujucharms 280 ubuntu
nova-compute 17.0.13 active 7 nova-compute jujucharms 325 ubuntu
nrpe-host active 7 nrpe jujucharms 70 ubuntu
ntp 3.2 active 7 ntp jujucharms 45 ubuntu
Unit Workload Agent Machine Public address Ports Message
ceph-osd/0* active idle 0 10.244.49.66 Unit is ready (1 OSD)
bcache-tuning/4 active idle 10.244.49.66 bcache devices tuned
nrpe-host/4 active idle 10.244.49.66 icmp,5666/tcp ready
ntp/5 active idle 10.244.49.66 123/udp chrony: Ready
ceph-osd/1 active idle 1 10.244.49.67 Unit is ready (1 OSD)
bcache-tuning/5 active idle 10.244.49.67 bcache devices tuned
nrpe-host/5 active idle 10.244.49.67 icmp,5666/tcp ready
ntp/6 active idle 10.244.49.67 123/udp chrony: Ready
ceph-osd/2 active idle 2 10.244.49.68 Unit is ready (1 OSD)
bcache-tuning/2 active idle 10.244.49.68 bcache devices tuned
nrpe-host/2* active idle 10.244.49.68 icmp,5666/tcp ready
ntp/3 active idle 10.244.49.68 123/udp chrony: Ready
ceph-osd/3 active idle 3 10.244.49.72 Unit is ready (1 OSD)
bcache-tuning/6 active idle 10.244.49.72 bcache devices tuned
nrpe-host/6 active idle 10.244.49.72 icmp,5666/tcp ready
ntp/7 active idle 10.244.49.72 123/udp chrony: Ready
ceph-osd/4 active idle 4 10.244.49.71 Unit is ready (1 OSD)
bcache-tuning/0* active idle 10.244.49.71 bcache devices tuned
nrpe-host/1 active idle 10.244.49.71 icmp,5666/tcp ready
ntp/2 active idle 10.244.49.71 123/udp chrony: Ready
ceph-osd/5 active idle 5 10.244.49.70 Unit is ready (1 OSD)
bcache-tuning/1 active idle 10.244.49.70 bcache devices tuned
nrpe-host/0 active idle 10.244.49.70 icmp,5666/tcp ready
ntp/1 active idle 10.244.49.70 123/udp chrony: Ready
ceph-osd/6 active idle 6 10.244.49.73 Unit is ready (1 OSD)
bcache-tuning/3 active idle 10.244.49.73 bcache devices tuned
nrpe-host/3 active idle 10.244.49.73 icmp,5666/tcp ready
ntp/4 active idle 10.244.49.73 123/udp chrony: Ready
tags: | added: charm-upgrade |
Subscribing this field high as it's impacted an sqa upgrade test