max failure percentage doesn't work if the node is unreachable from the start

Bug #1908573 reported by Alex Schultz
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
Alex Schultz

Bug Description

If you attempt to deploy 3 nodes and 1 is unreachable from the start, the deployment will proceed on the available nodes but will error at the end. We should be failing fast if a node is unavailable and the max_fail_percentage for all roles expects 0 failures.

Changed in tripleo:
assignee: nobody → Zahid Hasan (akkim31)
Revision history for this message
Alex Schultz (alex-schultz) wrote :
Changed in tripleo:
assignee: Zahid Hasan (akkim31) → Alex Schultz (alex-schultz)
status: Triaged → In Progress
Changed in tripleo:
milestone: wallaby-2 → wallaby-3
Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-heat-templates (stable/train)

Related fix proposed to branch: stable/train
Review: https://review.opendev.org/c/openstack/tripleo-heat-templates/+/800406

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix merged to tripleo-heat-templates (stable/train)

Reviewed: https://review.opendev.org/c/openstack/tripleo-heat-templates/+/800406
Committed: https://opendev.org/openstack/tripleo-heat-templates/commit/b8d80df42f4e82a7554f9a35a9ab9fde6590c13f
Submitter: "Zuul (22348)"
Branch: stable/train

commit b8d80df42f4e82a7554f9a35a9ab9fde6590c13f
Author: Alex Schultz <email address hidden>
Date: Fri Dec 18 07:55:43 2020 -0700

    Fix unreachable handling

    When we clear the cached facts with unreachable nodes, we attempt to
    gather facts by default. This can cause the node to be skipped for every
    future playbook. This ends up bypassing all our failure percentage
    logic.

    CONFLICT: was lines 15,52. This is changed in Ussuri. To maintain
    consistency with Train, I have kept the Train method for gathering
    facts and just backported the gather_facts: false for the
    "Clear cached facts" task.

    Change-Id: Ie240877496b73a37f553a84af47dfebdbaf899e5
    Related-Bug: 1908573
    (cherry picked from commit 969693e667aba8d893f10f7b03bafde3b3f66287)
    (cherry picked from commit c9ffe726f70e08f8fa564a0d86bcea01f348546c)

tags: added: in-stable-train
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.