[Rocky to Stein] Galera bundle failing during the upgrade with: Could not determine galera name from pacemaker node <controller-2>.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Damien Ciabrini |
Bug Description
Launchpad based on BZ https:/
When upgrading from OSP14 to 15 we start by upgrading the first controller, upgrade it's OS and create a new cluster with that single node. Then the remaining controllers join the cluster.
When upgrading the second controller, if we do a pcs status we can see that the galera-bundle is stopped for that new node and an error appears:
Online: [ controller-0 controller-2 ]
GuestOnline: [ galera-
Full list of resources:
Container bundle set: galera-bundle [192.168.
galera-bundle-0 (ocf::heartbeat
galera-bundle-1 (ocf::heartbeat
Container bundle set: rabbitmq-bundle [192.168.
rabbitmq-
rabbitmq-
Container bundle set: redis-bundle [192.168.
redis-bundle-0 (ocf::heartbeat
redis-bundle-1 (ocf::heartbeat
ip-192.168.24.21 (ocf::heartbeat
ip-10.0.0.101 (ocf::heartbeat
ip-172.17.1.10 (ocf::heartbeat
ip-172.17.1.16 (ocf::heartbeat
ip-172.17.3.12 (ocf::heartbeat
ip-172.17.4.26 (ocf::heartbeat
Container bundle set: haproxy-bundle [192.168.
haproxy-
haproxy-
haproxy-
Container bundle: openstack-
openstack-
Container bundle: openstack-
openstack-
Failed Resource Actions:
* galera_start_0 on galera-bundle-1 'not configured' (6): call=39, status=complete, exitreason='Could not determine galera name from pacemaker node <controller-2>.',
last-
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
The cause seems to be related to the pacemaker mapping variable cluster_host_map, which gets wrongly created:
When running the first controller upgrade (controller-0):
2020-01-16 01:07:40 | "Debug: try 1/10: pcs -f /var/lib/
However, when running the deploy steps for the second controller (controller-2):
2020-01-16 02:42:10 | "Debug: pcs_offline: pcs -f /var/lib/
The cluster_host_map is mapping controller-
Fix proposed to branch: master /review. opendev. org/702851
Review: https:/