Cluster upgrade from 9.0 to 9.1 broke corosync cluster
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Won't Fix
|
High
|
MOS Maintenance | ||
Mitaka |
Won't Fix
|
High
|
MOS Maintenance | ||
Newton |
Won't Fix
|
High
|
MOS Maintenance |
Bug Description
Cluster upgrade from 9.0 to 9.1 broke pacemacker/corosync cluster
Steps to reproduce:
1. Create 9.0 cluster (3 controllers, 3 compute+ceph-osd)
2. Changed DNS_DOMAIN and DNS_SEARCH in /etc/fuel/
3. Launch /etc/puppet/
4. Successfully deployed changes in the environment
5. Everything worked after step 4
6. Upgrade to Fuel 9.1 using https:/
after step 11: 'fuel2 update --env <ENV_ID> install'
deployment is in error state.
Pacemacker shows most services in stopped state.
# pcs status
Cluster name:
WARNING: corosync and pacemaker node names do not match (IPs used in setup?)
Last updated: Fri Nov 11 15:27:38 2016 Last change: Thu Nov 10 16:03:59 2016 by root via crm_resource on node-3.domain.local
Stack: corosync
Current DC: node-1.domain.local (version 1.1.14-70404b0) - partition with quorum
3 nodes and 46 resources configured
Online: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Full list of resources:
Clone Set: clone_p_vrouter [p_vrouter]
Started: [ node-2.domain.local node-3.domain.local ]
Stopped: [ node-1.domain.local ]
vip__management (ocf::fuel:
vip__vrouter_pub (ocf::fuel:
vip__vrouter (ocf::fuel:
vip__public (ocf::fuel:
Clone Set: clone_p_haproxy [p_haproxy]
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_p_mysqld [p_mysqld]
sysinfo_
sysinfo_
Master/Slave Set: master_p_conntrackd [p_conntrackd]
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Master/Slave Set: master_
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_neutron-
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_neutron-
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_neutron-
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_p_heat-engine [p_heat-engine]
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_neutron-
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_p_dns [p_dns]
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
sysinfo_
Clone Set: clone_ping_
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_p_ntp [p_ntp]
Stopped: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
PCSD Status:
node-
node-
node-
# crm
crm(live)# status
Last updated: Fri Nov 11 15:28:03 2016 Last change: Thu Nov 10 16:03:59 2016 by root via crm_resource on node-3.domain.local
Stack: corosync
Current DC: node-1.domain.local (version 1.1.14-70404b0) - partition with quorum
3 nodes and 46 resources configured
Online: [ node-1.domain.local node-2.domain.local node-3.domain.local ]
Clone Set: clone_p_vrouter [p_vrouter]
Started: [ node-2.domain.local node-3.domain.local ]
vip__vrouter_pub (ocf::fuel:
vip__vrouter (ocf::fuel:
crm(live)#
# crm configure show | grep location
location clone_p_
location clone_p_
location loc_ping_
location vip__management
location vip__management
location vip__management
location vip__public-
location vip__public-
location vip__public-
location vip__vrouter-
location vip__vrouter-
location vip__vrouter-
location vip__vrouter_
location vip__vrouter_
location vip__vrouter_
colocation conntrackd-
colocation dns-with-vrouter-ns inf: clone_p_dns clone_p_vrouter
colocation ntp-with-vrouter-ns inf: clone_p_ntp clone_p_vrouter
colocation vip__vrouter-
colocation vip_management-
colocation vip_public-
Cluster in non-operational.
Changed in fuel: | |
importance: | Undecided → High |
milestone: | none → 11.0 |
status: | Incomplete → Confirmed |
assignee: | nobody → Sergii Rizvan (srizvan) |
Changed in fuel: | |
assignee: | Sergii Rizvan (srizvan) → MOS Maintenance (mos-maintenance) |
Changed in fuel: | |
status: | In Progress → Won't Fix |
Marking as Incomplete. Please attach diagnostic snapshot.