Bug #1299139 “Instances stuck in deleting task_state require n-c...” : Bugs : OpenStack Compute (nova)

Revision history for this message

Matt Riedemann (mriedem) wrote on 2014-03-28:

#1

Further, this was the patch to cleanup instances stuck in 'deleting' task_state:

https://review.openstack.org/#/c/55660/

So the workaround here is you have to restart the compute service, that's not an ideal solution.

Changed in nova:
importance:	Undecided → High
tags:	added: icehouse-rc-potential

Revision history for this message

Matt Riedemann (mriedem) wrote on 2014-03-28:

#2

Note that the reset-state API doesn't help, the instance is just put into error/deleting state then but you still can't force the delete, you have to restart the compute service.

Revision history for this message

Matt Riedemann (mriedem) wrote on 2014-03-28:

#3

compute.log Edit (1.4 MiB, text/plain)

Tracing the first bad instance I have in that paste, 1d278e77-9900-4b2b-85eb-3a6aac4198c1, I see the delete request in the nova api log (attached):

2014-03-27 17:25:33.431 12851 INFO nova.osapi_compute.wsgi.server [req-06c80607-8eef-409e-a29c-152d693635ad 7bdd970e41c14774b4dd8f4e0e92d05c 5ab6bffec8ae44beb1b69750a0e8ea00] 9.5.48.212 "DELETE /v2/5ab6bffec8ae44beb1b69750a0e8ea00/servers/1d278e77-9900-4b2b-85eb-3a6aac4198c1 HTTP/1.1" status: 204 len: 198 time: 0.5162849

But I never see any terminate_instance happening in the compute log for that instance uuid (compute log attached).

The VM isn't in the hypervisor anymore either so the driver deleted the VM from the hypervisor but something blew up somewhere, not sure why I can't find that in the logs.

Revision history for this message

Matt Riedemann (mriedem) wrote on 2014-03-28:

#4

api.log Edit (2.1 MiB, text/plain)

tags:	removed: icehouse-rc-potential
Changed in nova:
importance:	High → Undecided

Revision history for this message

Matt Riedemann (mriedem) wrote on 2014-04-17:

#5

This is also related to bug 1296414 because if you bomb out hard and go over-quota, you can delete the instances from the database and restart n-cpu to clear them automatically on startup, but the quotas aren't cleaned up to match that so you can't boot any more instances at that point.

summary:

- Instances stuck in deleting task_state never cleaned up
+ Instances stuck in deleting task_state require n-cpu restart to remove

Revision history for this message

jichenjc (jichenjc) wrote on 2014-04-18:

#6

take e927334d-7a07-487c-9125-d2ddab9a8441 as example

I can see claim successful but I can't see 'Instance spawned successfully.' which indicate spawn success in the compute log
looks to me there is something wrong in the build process on the powervm
so the instance keep stuck to build state
this is why you will see 'During sync_power_state the instance has a pending task. Skip.' for this instance for the coming periods in the log

And I don't have other detailed info that can be referred so I don't know what happened during spawn ,this is what we need to take a look and handle it

I guess because the spawn stuck into build state, the delete will not success and the db is not update
instance.terminated_at field

this is rough guess from the logs I can read ,so let me know your opinion

Revision history for this message

Darren Carpenter (wdarrenc) wrote on 2018-03-15:

#7

Download full text (6.3 KiB)

I've run in to the same issue within a Newton environment. The delete was sent about an hour or two before I took a look, "virsh list --all" shows the instance shutoff;

[1] instance completes build

nova-compute.log:2018-03-14 20:57:52.304 46562 INFO nova.compute.manager [req-22d52f8e-e85d-4f64-b4ba-e938f4c49ed7 71aa816389589b718658b342803f9e3e9ab0baaae13597cf918694e0dbd6c0b9 a918bbf8b58a4d5bb12d7287b56fbb87 - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Took 18.21 seconds to build instance.

[2] delete call is sent

/var/log/nova/nova-compute.log:2018-03-14 20:57:32.293 22377 INFO nova.compute.manager [req-22d52f8e-e85d-4f64-b4ba-e938f4c49ed7 71aa816389589b718658b342803f9e3e9ab0baaae13597cf918694e0dbd6c0b9 a918bbf8b58a4d5bb12d7287b56fbb87 - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Terminating instance

[3] View from the compute (shows the compute choking, and the restart of the compute service to complete the delete)

nova-compute.log:2018-03-15 14:35:00.533 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:45:18.541 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:55:27.558 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:56:10.768 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] VM Stopped (Lifecycle Event)
nova-compute.log:2018-03-15 14:56:10.899 46562 INFO nova.compute.manager [req-0af61757-8dc4-4770-b7e3-69404ed31acd - - - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:05:31.645 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:15:58.511 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:26:07.537 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:36:10.533 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:46:39.554 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:57:12.604 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:07:34.577...

I've run in to the same issue within a Newton environment. The delete was sent about an hour or two before I took a look, "virsh list --all" shows the instance shutoff;

[1] instance completes build

nova-compute.log:2018-03-14 20:57:52.304 46562 INFO nova.compute.manager [req-22d52f8e-e85d-4f64-b4ba-e938f4c49ed7 71aa816389589b718658b342803f9e3e9ab0baaae13597cf918694e0dbd6c0b9 a918bbf8b58a4d5bb12d7287b56fbb87 - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Took 18.21 seconds to build instance.

[2] delete call is sent

/var/log/nova/nova-compute.log:2018-03-14 20:57:32.293 22377 INFO nova.compute.manager [req-22d52f8e-e85d-4f64-b4ba-e938f4c49ed7 71aa816389589b718658b342803f9e3e9ab0baaae13597cf918694e0dbd6c0b9 a918bbf8b58a4d5bb12d7287b56fbb87 - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Terminating instance

[3] View from the compute (shows the compute choking, and the restart of the compute service to complete the delete)

nova-compute.log:2018-03-15 14:35:00.533 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:45:18.541 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:55:27.558 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 14:56:10.768 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] VM Stopped (Lifecycle Event)
nova-compute.log:2018-03-15 14:56:10.899 46562 INFO nova.compute.manager [req-0af61757-8dc4-4770-b7e3-69404ed31acd - - - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:05:31.645 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:15:58.511 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:26:07.537 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:36:10.533 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:46:39.554 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 15:57:12.604 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:07:34.577 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:17:55.528 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:28:18.123 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:38:15.508 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:48:17.737 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 16:58:37.538 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:09:04.502 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:19:08.524 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:29:08.511 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:39:12.493 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:49:29.496 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 17:59:42.508 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 18:09:49.559 46562 INFO nova.compute.manager [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] During sync_power_state the instance has a pending task (deleting). Skip.
nova-compute.log:2018-03-15 18:16:15.316 1580773 INFO nova.compute.manager [req-2105ce61-a399-411d-b5be-4e327c127609 - - - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Service started deleting the instance during the previous run, but did not finish. Restarting the deletion now.
nova-compute.log:2018-03-15 18:16:15.521 1580773 INFO nova.compute.manager [req-e962e05e-a153-401a-a30b-99451fb2310f - - - - -] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Terminating instance
nova-compute.log:2018-03-15 18:16:15.528 1580773 INFO nova.virt.libvirt.driver [-] [instance: 2d404a4e-569f-4ad3-9f81-c00fd6045d2d] Instance destroyed successfully.

Hopefully this helps... let me know if any other logs/information is needed.

OpenStack Compute (nova)

Instances stuck in deleting task_state require n-cpu restart to remove

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches