openstack overcloud plan delete can get stuck forever

Bug #1777074 reported by Michele Baldessari
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Medium
Dougal Matthews

Bug Description

If the stack delete failed:
| 8ac16fe9-a8bc-4aba-99c6-f6244731af08 | overcloud | 4eb73f99e4ac4e9a9b904687023d2b3c | DELETE_FAILED | 2018-06-15T05:09:54Z | 2018-06-15T07:55:00Z |

It seems that the 'openstack overcloud plan delete' command can stay stuck forever.
(undercloud) [stack@undercloud ~]$ openstack overcloud delete --yes overcloud
Deleting stack overcloud...
Started Mistral Workflow tripleo.stack.v1.delete_stack. Execution ID: d8f03440-1e5b-4000-a6a1-6a4d54e808c7
Waiting for messages on queue 'tripleo' with no timeout.
.... hangs forever....

It should detect that the stack deletion failed and return an error or somehow display what it is trying to do as it is a bit confusing for an operator.

Tags: workflows
Changed in tripleo:
milestone: rocky-3 → rocky-rc1
Changed in tripleo:
milestone: rocky-rc1 → stein-1
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-common (master)

Fix proposed to branch: master
Review: https://review.openstack.org/587818

Changed in tripleo:
assignee: nobody → Dougal Matthews (d0ugal)
status: Triaged → In Progress
Changed in tripleo:
milestone: stein-1 → stein-2
Dougal Matthews (d0ugal)
Changed in tripleo:
assignee: Dougal Matthews (d0ugal) → nobody
Changed in tripleo:
assignee: nobody → Alex Schultz (alex-schultz)
Changed in tripleo:
assignee: Alex Schultz (alex-schultz) → Dougal Matthews (d0ugal)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-common (master)

Reviewed: https://review.openstack.org/587818
Committed: https://git.openstack.org/cgit/openstack/tripleo-common/commit/?id=5399657bdd5a3866e8f4358b677cbfabe85cdeb5
Submitter: Zuul
Branch: master

commit 5399657bdd5a3866e8f4358b677cbfabe85cdeb5
Author: Dougal Matthews <email address hidden>
Date: Wed Aug 1 14:11:55 2018 +0100

    Exit wait_for_stack_does_not_exist if the status is delete failed

    The current implementation waits for 10 minutes, regardless of the stack
    status. This will exit earlier and fail the workflow, triggering the
    error handling in the delete_stack workflow.

    Change-Id: I1c38e9c4f1c8e5a7be8a354549ebfdcbd2e822cd
    Closes-Bug: #1777074

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-common 10.3.0

This issue was fixed in the openstack/tripleo-common 10.3.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.