pcs on host patchset triggers a problem during FFU on IHA deployments
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| tripleo |
Fix Released
|
High
|
Unassigned | ||
Bug Description
With the merging of the pcs on host patchset for 16.2 we are seeing a problem with FFUs on Instance HA environments.
Preamble: tripleo keeps the stonith-enabled cluster property set to false until step 5.
With the pcs on host patchset the enablement happens still at step 5 but it gets triggered during tripleo_ha_wrapper deployment task of cinder-volume which tries to restart the cinder-volume service (during the leapp of the first controller) and this hangs forever because pacemaker is in the following transition:
- stonith-
- pacemaker wants to call stonith on for controller-0 (which is probably dumb, but it is unlikely we'll be able to change that in the right timeframe as it seems a potentially involved change in behaviour)
- Any other action, like cinder-volume restart in this case, is stuck and the FFU fails.
If we simply move the stonith resource creation (and change nothing else in the stonith-enabled property being set at step 5) to step 2, we should be good.
| Changed in tripleo: | |
| status: | Triaged → In Progress |

Fix proposed to branch: stable/victoria /review. opendev. org/c/openstack /puppet- tripleo/ +/786114
Review: https:/