Cleanup pending instances in "building" state
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Following up on the ML thread [1], it was recommended to create a bug report.
After a network issue in a Victoria cluster (3 control nodes in HA mode, 26 compute nodes) some instance builds were interrupted. Some of them could be cleaned up with 'openstack server delete' but two of them can not. They already have a mapping but can not be removed (or "reset-state") by nova. Those are both amphora instances from octavia:
control01:~ # openstack server list --project service -c ID -c Name -c Status -f value | grep BUILD
0453a7e5-
dc8cdc3a-
control01:~ # openstack server delete amphora-
No server with a name or ID of
'amphora-
control01:~ # openstack server show 0453a7e5-
ERROR (CommandError): No server with a name or ID of
'0453a7e5-
The database tables referring to the UUID
0453a7e5-
nova_cell0/
nova_cell0/
nova_cell0/
nova_cell0/
nova_cell0/
octavia/amphora.ibd
nova_api/
nova_api/
I can provide both debug logs and database queries, just let me know what exactly is required.
The storage back end is ceph (Pacific), we use neutron with OpenVSwitch, the exact nova versions are:
control01:~ # rpm -qa | grep nova
openstack-
openstack-
openstack-
python3-
openstack-
openstack-
python3-
[1] https:/
description: | updated |
At least, can you verify whether you have a DB record for the instance_mappings table with the server UUID ?
If so, please tell us which cell this instance is using, so then you can also verify the cells DB for the instances table.