Ocata -> Pike upgrade: upgrade gets stuck on split stack deployments during Deployment_Step2 because the cluster is in maintenance mode
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Critical
|
Marius Cornea |
Bug Description
[also discussed at https:/
Ocata -> Pike upgrade: upgrade gets stuck on split stack deployments during Deployment_Step2 because the cluster is in maintenance mode
Version-Release number of selected component (if applicable):
openstack-
How reproducible:
100%
Steps to Reproduce:
1. Deploy Ocata split stack deployment with 3 ctrl, 3 messaging, 3 db, 2 compute node, 3 ceph nodes
2. Upgrade to Pike
Actual results:
While running major-upgrade-
(undercloud) [stack@undercloud-0 ~]$ openstack stack list --nested | grep PROGRESS
| 8e40a6c7-
| dab455a8-
| 2edb57b9-
| f63bd95d-
(undercloud) [stack@undercloud-0 ~]$
Going to the database nodes we can see that the mysql_init_bundle has been running for 23 minutes:
[root@database-0 ~]# docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
4aa5d1cb91f3 192.168.
b9d4c6209a8c 192.168.
[root@database-0 ~]# pcs status
Cluster name: tripleo_cluster
Stack: corosync
Current DC: messaging-1 (version 1.1.16-
Last updated: Wed Oct 18 16:06:43 2017
Last change: Wed Oct 18 15:43:53 2017 by root via cibadmin on controller-0
18 nodes configured
36 resources configured (1 DISABLED)
*** Resource management is DISABLED ***
The cluster will not attempt to start, stop or recover services
Online: [ controller-0 controller-1 controller-2 database-0 database-1 database-2 messaging-0 messaging-1 messaging-2 ]
Full list of resources:
ip-192.168.0.66 (ocf::heartbeat
ip-172.16.18.27 (ocf::heartbeat
ip-10.0.0.16 (ocf::heartbeat
ip-10.0.0.138 (ocf::heartbeat
ip-10.0.1.14 (ocf::heartbeat
openstack-
Docker container set: redis-bundle [192.168.
redis-bundle-0 (ocf::heartbeat
redis-bundle-1 (ocf::heartbeat
redis-bundle-2 (ocf::heartbeat
Docker container set: rabbitmq-bundle [192.168.
rabbitmq-
rabbitmq-
rabbitmq-
Docker container set: galera-bundle [192.168.
galera-bundle-0 (ocf::heartbeat
galera-bundle-1 (ocf::heartbeat
galera-bundle-2 (ocf::heartbeat
Docker container set: haproxy-bundle [192.168.
haproxy-
haproxy-
haproxy-
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
[root@database-0 ~]# pcs property list
Cluster Properties:
cluster-
cluster-name: tripleo_cluster
dc-version: 1.1.16-
have-watchdog: false
maintenance-mode: true
redis_REPL_INFO: controller-0
stonith-enabled: false
Node Attributes:
controller-0: cinder-
controller-1: cinder-
controller-2: cinder-
database-0: galera-role=true
database-1: galera-role=true
database-2: galera-role=true
messaging-0: rabbitmq-role=true rmq-node-
messaging-1: rabbitmq-role=true rmq-node-
messaging-2: rabbitmq-role=true rmq-node-
Expected results:
Upgrade doesn't get stuck.
Additional info:
After setting pcs property set maintenance-
Cluster name: tripleo_cluster
Stack: corosync
Current DC: messaging-1 (version 1.1.16-
Last updated: Wed Oct 18 16:10:37 2017
Last change: Wed Oct 18 16:09:36 2017 by rabbitmq-bundle-2 via crm_attribute on messaging-2
18 nodes configured
36 resources configured (1 DISABLED)
Online: [ controller-0 controller-1 controller-2 database-0 database-1 database-2 messaging-0 messaging-1 messaging-2 ]
GuestOnline: [ galera-
Full list of resources:
ip-192.168.0.66 (ocf::heartbeat
ip-172.16.18.27 (ocf::heartbeat
ip-10.0.0.16 (ocf::heartbeat
ip-10.0.0.138 (ocf::heartbeat
ip-10.0.1.14 (ocf::heartbeat
openstack-
Docker container set: redis-bundle [192.168.
redis-bundle-0 (ocf::heartbeat
redis-bundle-1 (ocf::heartbeat
redis-bundle-2 (ocf::heartbeat
Docker container set: rabbitmq-bundle [192.168.
rabbitmq-
rabbitmq-
rabbitmq-
Docker container set: galera-bundle [192.168.
galera-bundle-0 (ocf::heartbeat
galera-bundle-1 (ocf::heartbeat
galera-bundle-2 (ocf::heartbeat
Docker container set: haproxy-bundle [192.168.
haproxy-
haproxy-
haproxy-
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
description: | updated |
Changed in tripleo: | |
milestone: | none → queens-1 |
Changed in tripleo: | |
milestone: | queens-1 → queens-2 |
Fix proposed to branch: master /review. openstack. org/513654
Review: https:/