kayobe

kayobe overcloud service destroy fails on compute nodes if any VMs are running, after controllers are already gone

Bug #2050091 reported by Martin Ananda Boeker on 2024-01-22

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	kayobe	Triaged	Medium	Unassigned

Bug Description

Environment: kayobe 14.1, openstack 2023.1

service destroy should stop all VMs before starting to destroy all containers, or destroy them regardless of running VMs. Failing half way through after controllers are already destroyed means the `openstack` command can't be used anymore, so you have to ssh to the compute nodes, docker exec into the nova containers, and kill the qemu processes.

Martin Ananda Boeker (mboeker) on 2024-01-22

summary:	- 'kayobe overcloud service destroy' fails if any VMs are running, but not - until controllers are destroyed + 'kayobe overcloud service destroy' fails if any VMs are running, but + after controllers are already gone.
summary:	- 'kayobe overcloud service destroy' fails if any VMs are running, but - after controllers are already gone. + 'kayobe overcloud service destroy' fails on compute nodes if any VMs are + running, but after controllers are already gone.

Martin Ananda Boeker (mboeker) on 2024-01-22

summary:

- 'kayobe overcloud service destroy' fails on compute nodes if any VMs are
- running, but after controllers are already gone.
+ kayobe overcloud service destroy fails on compute nodes if any VMs are
+ running, after controllers are already gone

Revision history for this message

Will Szumski (willjs) wrote on 2024-01-23:

Good catch and thanks for the workaround. We should investigate moving the check to start of the process to avoid this situation.

Will Szumski (willjs) on 2024-01-23

Changed in kayobe:
importance:	Undecided → Medium

Will Szumski (willjs) on 2024-01-23

Changed in kayobe:
status:	New → Triaged

Revision history for this message

Martin Ananda Boeker (mboeker) wrote on 2024-01-23:

Why even bother checking for qemu processes? If the user decides that they want to do a `service destroy -and-they-really-mean-it` then shouldn't it be a given that all VMs are going away? Or are they somehow preserved even after all nova and all volumes are gone?

Revision history for this message

Will Szumski (willjs) wrote on 2024-02-20:

> Why even bother checking for qemu processes? If the user decides that they want to do a `service destroy -and-they-really-mean-it` then shouldn't it be a given that all VMs are going away? Or are they somehow preserved even after all nova and all volumes are gone?

It would be useful to have the safety check if you were destroying services on a particular compute node in a production environment, but maybe we could add a flag to destroy all instances (more of a kolla-ansible feature)

Revision history for this message

Martin Ananda Boeker (mboeker) wrote on 2024-02-20: Re: [Bug 2050091] Re: kayobe overcloud service destroy fails on compute nodes if any VMs are running, after controllers are already gone

Valid, especially if only destroying one node. However, that check should
definitely happen before the controllers and everything else are destroyed,
especially if no limit is set, and then force delete everything. Having
nothing left of OpenStack except nova containers is probably not useful in
any situation.

Mit freundlichen Grüßen

Martin Ananda Boeker
EDV-Consult IT GmbH

On Tue, Feb 20, 2024, 18:16 Will Szumski <email address hidden> wrote:

> > Why even bother checking for qemu processes? If the user decides that
> they want to do a `service destroy -and-they-really-mean-it` then
> shouldn't it be a given that all VMs are going away? Or are they somehow
> preserved even after all nova and all volumes are gone?
>
> It would be useful to have the safety check if you were destroying
> services on a particular compute node in a production environment, but
> maybe we could add a flag to destroy all instances (more of a kolla-
> ansible feature)
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/2050091
>
> Title:
> kayobe overcloud service destroy fails on compute nodes if any VMs are
> running, after controllers are already gone
>
> Status in kayobe:
> Triaged
>
> Bug description:
> Environment: kayobe 14.1, openstack 2023.1
>
> service destroy should stop all VMs before starting to destroy all
> containers, or destroy them regardless of running VMs. Failing half
> way through after controllers are already destroyed means the
> `openstack` command can't be used anymore, so you have to ssh to the
> compute nodes, docker exec into the nova containers, and kill the qemu
> processes.
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/kayobe/+bug/2050091/+subscriptions
>
>

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.