config-download: ensure pacemaker nodes start steps at same time

Bug #1768238 reported by James Slagle
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
James Slagle

Bug Description

Ansible executes nodes in batches based on the number set in ANSIBLE_FORKS. The default is 5. When deploying with a node count greater than the value of ANSIBLE_FORKS, different nodes can be done at different times on each task.

This can cause a problem with pacemaker nodes where each node must be on the same step at the same time, otherwise the cluster won't properly initialize. We need to fix this to ensure pacemaker nodes always run at the same time.

Changed in tripleo:
status: New → Triaged
importance: Undecided → High
milestone: none → rocky-2
tags: added: config-download
Revision history for this message
James Slagle (james-slagle) wrote :
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (master)

Fix proposed to branch: master
Review: https://review.openstack.org/565543

Changed in tripleo:
assignee: nobody → James Slagle (james-slagle)
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (master)

Reviewed: https://review.openstack.org/565543
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=1497da08d5cdce4c72e12b422a70905bf30bda6b
Submitter: Zuul
Branch: master

commit 1497da08d5cdce4c72e12b422a70905bf30bda6b
Author: James Slagle <email address hidden>
Date: Tue May 1 13:21:23 2018 -0400

    Run tasks on primary role first

    This patch adds the primary role name as the first host pattern in the
    individual plays in deploy-steps.j2. This will ensure that the primary
    role will execute tasks first, which is needed so that all Pacemaker
    nodes run the same step at the same time.

    Change-Id: I9c499be87ce51ae28914b013b4b91446a3a68015
    Closes-Bug: #1768238

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 9.0.0.0b3

This issue was fixed in the openstack/tripleo-heat-templates 9.0.0.0b3 development milestone.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/queens)

Fix proposed to branch: stable/queens
Review: https://review.openstack.org/582427

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/queens)

Reviewed: https://review.openstack.org/582427
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=ae528b662630d9950ee098a5f775489f7861f7f5
Submitter: Zuul
Branch: stable/queens

commit ae528b662630d9950ee098a5f775489f7861f7f5
Author: James Slagle <email address hidden>
Date: Tue May 1 13:21:23 2018 -0400

    Run tasks on primary role first

    This patch adds the primary role name as the first host pattern in the
    individual plays in deploy-steps.j2. This will ensure that the primary
    role will execute tasks first, which is needed so that all Pacemaker
    nodes run the same step at the same time.

    Change-Id: I9c499be87ce51ae28914b013b4b91446a3a68015
    Closes-Bug: #1768238
    (cherry picked from commit 1497da08d5cdce4c72e12b422a70905bf30bda6b)

tags: added: in-stable-queens
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 8.0.5

This issue was fixed in the openstack/tripleo-heat-templates 8.0.5 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.