kayobe overcloud service destroy fails on compute nodes if any VMs are running, after controllers are already gone

Bug #2050091 reported by Martin Ananda Boeker
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
kayobe
Triaged
Medium
Unassigned

Bug Description

Environment: kayobe 14.1, openstack 2023.1

service destroy should stop all VMs before starting to destroy all containers, or destroy them regardless of running VMs. Failing half way through after controllers are already destroyed means the `openstack` command can't be used anymore, so you have to ssh to the compute nodes, docker exec into the nova containers, and kill the qemu processes.

summary: - 'kayobe overcloud service destroy' fails if any VMs are running, but not
- until controllers are destroyed
+ 'kayobe overcloud service destroy' fails if any VMs are running, but
+ after controllers are already gone.
summary: - 'kayobe overcloud service destroy' fails if any VMs are running, but
- after controllers are already gone.
+ 'kayobe overcloud service destroy' fails on compute nodes if any VMs are
+ running, but after controllers are already gone.
summary: - 'kayobe overcloud service destroy' fails on compute nodes if any VMs are
- running, but after controllers are already gone.
+ kayobe overcloud service destroy fails on compute nodes if any VMs are
+ running, after controllers are already gone
Revision history for this message
Will Szumski (willjs) wrote :

Good catch and thanks for the workaround. We should investigate moving the check to start of the process to avoid this situation.

Will Szumski (willjs)
Changed in kayobe:
importance: Undecided → Medium
Will Szumski (willjs)
Changed in kayobe:
status: New → Triaged
Revision history for this message
Martin Ananda Boeker (mboeker) wrote :

Why even bother checking for qemu processes? If the user decides that they want to do a `service destroy -and-they-really-mean-it` then shouldn't it be a given that all VMs are going away? Or are they somehow preserved even after all nova and all volumes are gone?

Revision history for this message
Will Szumski (willjs) wrote :

> Why even bother checking for qemu processes? If the user decides that they want to do a `service destroy -and-they-really-mean-it` then shouldn't it be a given that all VMs are going away? Or are they somehow preserved even after all nova and all volumes are gone?

It would be useful to have the safety check if you were destroying services on a particular compute node in a production environment, but maybe we could add a flag to destroy all instances (more of a kolla-ansible feature)

Revision history for this message
Martin Ananda Boeker (mboeker) wrote : Re: [Bug 2050091] Re: kayobe overcloud service destroy fails on compute nodes if any VMs are running, after controllers are already gone

Valid, especially if only destroying one node. However, that check should
definitely happen before the controllers and everything else are destroyed,
especially if no limit is set, and then force delete everything. Having
nothing left of OpenStack except nova containers is probably not useful in
any situation.

Mit freundlichen Grüßen

Martin Ananda Boeker
EDV-Consult IT GmbH

On Tue, Feb 20, 2024, 18:16 Will Szumski <email address hidden> wrote:

> > Why even bother checking for qemu processes? If the user decides that
> they want to do a `service destroy -and-they-really-mean-it` then
> shouldn't it be a given that all VMs are going away? Or are they somehow
> preserved even after all nova and all volumes are gone?
>
> It would be useful to have the safety check if you were destroying
> services on a particular compute node in a production environment, but
> maybe we could add a flag to destroy all instances (more of a kolla-
> ansible feature)
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/2050091
>
> Title:
> kayobe overcloud service destroy fails on compute nodes if any VMs are
> running, after controllers are already gone
>
> Status in kayobe:
> Triaged
>
> Bug description:
> Environment: kayobe 14.1, openstack 2023.1
>
> service destroy should stop all VMs before starting to destroy all
> containers, or destroy them regardless of running VMs. Failing half
> way through after controllers are already destroyed means the
> `openstack` command can't be used anymore, so you have to ssh to the
> compute nodes, docker exec into the nova containers, and kill the qemu
> processes.
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/kayobe/+bug/2050091/+subscriptions
>
>

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.