HA: pacemaker resource lock impacts cluster transitions

Bug #1905585 reported by Damien Ciabrini
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
In Progress
Undecided
Damien Ciabrini

Bug Description

A resource lock is used as a synchronization point between pacemaker
cluster nodes. It is currently implemented by adding an attribute in
an offline copy of CIB, and merging the update in the CIB only if no
concurrent updates has occurred in the mean time.

The problem with that approach is that - even if the concurrency is
enforced by pacemaker - the offline CIB contains a snapshot of the
cluster state; so pushing back the entire offline CIB pushes old
resources' state back into the cluster. This causes additional burden
on the cluster and sometimes caused unexpected cluster state
transition.

Internally we have observed unexpected resource transitions in
pacemaker when this resource lock was used to synchronize certificate
renewal for the galera resource agent.

Changed in tripleo:
milestone: wallaby-1 → wallaby-2
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 14.0.0

This issue was fixed in the openstack/tripleo-heat-templates 14.0.0 release.

Changed in tripleo:
milestone: wallaby-2 → wallaby-3
Changed in tripleo:
milestone: wallaby-3 → wallaby-rc1
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 13.2.0

This issue was fixed in the openstack/tripleo-heat-templates 13.2.0 release.

Changed in tripleo:
milestone: wallaby-rc1 → xena-1
Changed in tripleo:
milestone: xena-1 → xena-2
Changed in tripleo:
milestone: xena-2 → xena-3
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.