Bug #1952745 “Evacuated instances should be completed when Compu...” : Bugs : OpenStack Compute (nova)

Revision history for this message

Sylvain Bauza (sylvain-bauza) wrote on 2021-11-30:

#1

What do you exactly mean by "Definitions of VMs are removed from libvirt." ?

Which migration are you doing ? I guess cold migrate ? If so, are you confirming the resize or do you have a config option value for it ?

https://docs.openstack.org/api-ref/compute/?expanded=migrate-server-migrate-action-detail#migrate-server-migrate-action

TBH, I don't really see a problem with what you say : if you recreate a nova-compute service, you need to restart it for removing the evacuated instances, but I could maybe misunderstand your concerns.

Changed in nova:
status:	New → Incomplete

Revision history for this message

Konrad Cempura (kcem) wrote on 2021-11-30:

#2

Download full text (18.1 KiB)

XML definitions of VMs on libvirt. It looks like VMs are removed from evacuated compute but this is not this compute anymore. It has only the same name.

Yes, cold migration. I migrate VMs to new compute with the same name as compute that VMs has been evacuated earlier. I confirm migration and everything looks ok unless nova_compute is restarted.

I am sorry if description is not detailed enough. I'll try to give more details.

Scenario 1:
===========

As described before plus details on cold migration:

Cold migrate 34f25e4c-6069-4ce0-ac28-79d28890d50a and a7f4c239-0df1-4b40-99d1-dcd1f71799f1 to vcmp1 and confirm (Id 6 and 8).

docker exec -it nova_libvirt virsh list
Id Name State
------------------------------------------...

XML definitions of VMs on libvirt. It looks like VMs are removed from evacuated compute but this is not this compute anymore. It has only the same name.

Yes, cold migration. I migrate VMs to new compute with the same name as compute that VMs has been evacuated earlier. I confirm migration and everything looks ok unless nova_compute is restarted.

I am sorry if description is not detailed enough. I'll try to give more details.

Scenario 1:
===========

As described before plus details on cold migration:

Cold migrate 34f25e4c-6069-4ce0-ac28-79d28890d50a and a7f4c239-0df1-4b40-99d1-dcd1f71799f1 to vcmp1 and confirm (Id 6 and 8).

docker exec -it nova_libvirt virsh list
 Id    Name                           State
----------------------------------------------------
 1     vm-2-a7f4c239                  running
 2     vm-1-34f25e4c                  running

docker restart nova_compute
reboot  # if nothing happened after nova_compute restart

docker exec -it nova_libvirt virsh list
 Id    Name                           State
----------------------------------------------------

Logs:
2021-11-30 16:58:00.759 6 INFO nova.compute.manager [req-65f62f9f-60c7-49df-a6de-8898dd27c2f3 - - - - -] [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Destroying instance as it has been
 evacuated from this host but still exists in the hypervisor

2021-11-30 16:58:02.569 6 INFO nova.compute.rpcapi [req-65f62f9f-60c7-49df-a6de-8898dd27c2f3 - - - - -] Automatically selected compute RPC version 5.3 from minimum service version 40
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [req-65f62f9f-60c7-49df-a6de-8898dd27c2f3 - - - - -] [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Failed to check if instance share
d: oslo_messaging.exceptions.MessagingTimeout: Timed out waiting for a reply to message ID 36cc09d77b324a388543b223d7c25e96

2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Traceback (most recent call last):
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
425, in get
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     return self._queues[msg_id].get(block=True, timeout=timeout)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/eventlet/queue.py", line 322, in get
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     return waiter.wait()
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/eventlet/queue.py", line 141, in wait
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     return get_hub().switch()
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/eventlet/hubs/hub.py", line 298, in switch
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     return self.greenlet.switch()
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] queue.Empty
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] During handling of the above exception, another exception occurred:
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Traceback (most recent call last):
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/nova/compute/manager.py", line 781, in _is_in
stance_storage_shared
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     instance, data, host=host))
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/nova/compute/rpcapi.py", line 587, in check_i
nstance_shared_storage
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     data=data)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/rpc/client.py", line 181, in c
all
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     transport_options=self.transport_options)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/transport.py", line 129, in _s
end
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     transport_options=transport_options)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
674, in send
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     transport_options=transport_options)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
662, in _send
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     call_monitor_timeout)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
551, in wait
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     message = self.waiters.get(msg_id, timeout=timeout)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
429, in get
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]     'to message ID %s' % msg_id)
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] oslo_messaging.exceptions.MessagingTimeout: Timed out waiting for a reply to message I
D 36cc09d77b324a388543b223d7c25e96
2021-11-30 16:59:02.574 6 ERROR nova.compute.manager [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1]
2021-11-30 16:59:02.654 6 INFO nova.virt.libvirt.driver [-] [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Instance destroyed successfully.

(...)
2021-11-30 16:59:02.690 6 INFO nova.compute.manager [req-65f62f9f-60c7-49df-a6de-8898dd27c2f3 - - - - -] [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Destroying instance as it has been evacuated from this host but still exists in the hypervisor
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [req-65f62f9f-60c7-49df-a6de-8898dd27c2f3 - - - - -] [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Failed to check if instance share
d: oslo_messaging.exceptions.MessagingTimeout: Timed out waiting for a reply to message ID 2413c4ca97fb4f9abf63128602ba9f4d
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Traceback (most recent call last):
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line
425, in get
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     return self._queues[msg_id].get(block=True, timeout=timeout)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/eventlet/queue.py", line 322, in get
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     return waiter.wait()
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/eventlet/queue.py", line 141, in wait
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     return get_hub().switch()
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/eventlet/hubs/hub.py", line 298, in switch
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     return self.greenlet.switch()
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] queue.Empty
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] During handling of the above exception, another exception occurred:
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Traceback (most recent call last):
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/nova/compute/manager.py", line 781, in _is_instance_storage_shared
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     instance, data, host=host))
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/nova/compute/rpcapi.py", line 587, in check_instance_shared_storage
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     data=data)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/rpc/client.py", line 181, in call
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     transport_options=self.transport_options)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/transport.py", line 129, in _send
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     transport_options=transport_options)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line 674, in send
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     transport_options=transport_options)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line 662, in _send
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     call_monitor_timeout)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line 551, in wait
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     message = self.waiters.get(msg_id, timeout=timeout)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]   File "/usr/lib/python3.6/site-packages/oslo_messaging/_drivers/amqpdriver.py", line 429, in get
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]     'to message ID %s' % msg_id)
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] oslo_messaging.exceptions.MessagingTimeout: Timed out waiting for a reply to message ID 2413c4ca97fb4f9abf63128602ba9f4d
2021-11-30 17:00:03.587 6 ERROR nova.compute.manager [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a]
2021-11-30 17:00:03.593 6 INFO nova.virt.libvirt.driver [-] [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Instance destroyed successfully.

This should not happen!

After adding new compute this shows in logs:

2021-11-30 16:38:50.123 6 WARNING nova.virt.libvirt.driver [req-a7eec156-2621-4491-a45b-f0124e232443 - - - - -] Cannot update service status on host "vcmp1" since it is not registered.: nova.exception_Remote.ComputeHostNotFound_Remote: Compute host vcmp1 could not be found.
2021-11-30 16:38:50.158 6 WARNING nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] Compute node vcmp1 not found in the database. If this is the first time this service is starting on this host, then you can ignore this warning.: nova.exception_Remote.ComputeHostNotFound_Remote: Compute host vcmp1 could not be found.
2021-11-30 16:38:50.234 6 INFO nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Cleaning up allocations of the instance as it has been evacuated from this host
2021-11-30 16:38:50.235 6 ERROR nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] [instance: 34f25e4c-6069-4ce0-ac28-79d28890d50a] Failed to clean allocation of evacuated instance as the source node vcmp1 is not found
2021-11-30 16:38:50.284 6 INFO nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Cleaning up allocations of the instance as it has been evacuated from this host
2021-11-30 16:38:50.284 6 ERROR nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] [instance: a7f4c239-0df1-4b40-99d1-dcd1f71799f1] Failed to clean allocation of evacuated instance as the source node vcmp1 is not found
2021-11-30 16:38:50.284 6 INFO nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] Looking for unclaimed instances stuck in BUILDING status for nodes managed by this host
2021-11-30 16:38:50.405 6 WARNING nova.compute.manager [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] No compute node record found for host vcmp1. If this is the first time this service is starting on this host, then you can ignore this warning.: nova.exception_Remote.ComputeHostNotFound_Remote: Compute host vcmp1 could not be found.
2021-11-30 16:38:50.460 6 WARNING nova.compute.resource_tracker [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] No compute node record for vcmp1:vcmp1: nova.exception_Remote.ComputeHostNotFound_Remote: Compute host vcmp1 could not be found.
2021-11-30 16:38:50.557 6 INFO nova.compute.resource_tracker [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] Compute node record created for vcmp1:vcmp1 with uuid: 2ed52717-cc77-49be-acff-62c0adff8d4e
2021-11-30 16:38:51.387 6 INFO nova.scheduler.client.report [req-9b7c3324-d839-4686-becd-f6798cc102c7 - - - - -] [req-14ecd216-27a2-4385-80b8-a53a0bfb6413] Created resource provider record via placement API for resource provider with UUID 2ed52717-cc77-49be-acff-62c0adff8d4e and name vcmp1.

As you can see nova_compute try to complete evacuation for hosts evacuated from vcmp1 but it fail because "Compute host vcmp1 could not be found." I think if there is no vcmp1 compute evacuation should be completed. This solve strange behavior described above.

Revision history for this message

Konrad Cempura (kcem) wrote on 2021-12-09:

#3

> "TBH, I don't really see a problem with what you say : if you recreate a nova-compute service, you need to restart it for removing the evacuated instances, but I could maybe misunderstand your concerns."

1. It is possible to cold migrate evacuated instances to new compute that has the same name like source compute from which instances has been evacuated earlier and that instances will be removed from libvirt on first compute restart (and you definitely don't want that).

2. Evacuations from compute-1 are not completed on first run of nova_compute on new compute-1 that get the same name like broken compute-1 and make possible scenario 1 to happen. They try to complete them but there are errors in logs and it is successful after restart.

3. It is possible to cold migrate evacuated instances to compute that has slightly different name; difference is in capital letters (ex. compute-1 vs. COMPUTE-1) and the result will be the same as in point 1. With exception that evacuations from compute-1 will never be completed. Instances evacuated from compute-1 will be deleted if they are moved to COMPUTE-1 in future... but in unexpected moment -> after COMPUTE-1 restart (it may never happen but when it happen you will lose some/all instances).

Scenarios to point 1, 2, 3:

Scenario 1:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1 to compute-2
- remove compute-1 service from openstack cloud
- remove orphaned resource provider for compute-1
- configure new compute with name compute-1 and add it to openstack cloud
- cold migrate evacuated instances from compute-2 to compute-1
- accept migrations
- restart compute-1
- instances from compute-1 are GONE from libvirt (they exists in OpenStack)

Scenario 2:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1
- remove compute-1 service from openstack
- remove orphaned resource provider for compute-1
- add new compute with the same name: compute-1
- evacuations are not completed, error during first start occur in logs; second restart finish evacuations

Scenario 3:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1 to compute-2
- remove compute-1 service from openstack cloud
- remove orphaned resource provider for compute-1
- configure new compute with slightly different name: COMPUTE-1 and add it to openstack cloud
- restart COMPUTE-1 or service nova_compute on COMPUTE-1 as many times as you like
- cold migrate evacuated instances from compute-2 to COMPUTE-1
- accept migrations
- restart COMPUTE-1
- instances from COMPUTE-1 are GONE from libvirt (they exists in OpenStack)
- evacuations from compute-1 will be never completed (until instances got on COMPUTE-1 and COMPUTE-1 will be restarted)

> "TBH, I don't really see a problem with what you say : if you recreate a nova-compute service, you need to restart it for removing the evacuated instances, but I could maybe misunderstand your concerns."

1. It is possible to cold migrate evacuated instances to new compute that has the same name like source compute from which instances has been evacuated earlier and that instances will be removed from libvirt on first compute restart (and you definitely don't want that).

2. Evacuations from compute-1 are not completed on first run of nova_compute on new compute-1 that get the same name like broken compute-1 and make possible scenario 1 to happen. They try to complete them but there are errors in logs and it is successful after restart.

3. It is possible to cold migrate evacuated instances to compute that has slightly different name; difference is in capital letters (ex. compute-1 vs. COMPUTE-1) and the result will be the same as in point 1. With exception that evacuations from compute-1 will never be completed. Instances evacuated from compute-1 will be deleted if they are moved to COMPUTE-1 in future... but in unexpected moment -> after COMPUTE-1 restart (it may never happen but when it happen you will lose some/all instances).

Scenarios to point 1, 2, 3:

Scenario 1:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1 to compute-2
- remove compute-1 service from openstack cloud
- remove orphaned resource provider for compute-1
- configure new compute with name compute-1 and add it to openstack cloud
- cold migrate evacuated instances from compute-2 to compute-1
- accept migrations
- restart compute-1
- instances from compute-1 are GONE from libvirt (they exists in OpenStack)

Scenario 2:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1
- remove compute-1 service from openstack
- remove orphaned resource provider for compute-1
- add new compute with the same name: compute-1
- evacuations are not completed, error during first start occur in logs; second restart finish evacuations

Scenario 3:
- remove compute-1 physically by format disk
- evacuate instances from removed compute-1 to compute-2
- remove compute-1 service from openstack cloud
- remove orphaned resource provider for compute-1
- configure new compute with slightly different name: COMPUTE-1 and add it to openstack cloud
- restart COMPUTE-1 or service nova_compute on COMPUTE-1 as many times as you like
- cold migrate evacuated instances from compute-2 to COMPUTE-1
- accept migrations
- restart COMPUTE-1
- instances from COMPUTE-1 are GONE from libvirt (they exists in OpenStack)
- evacuations from compute-1 will be never completed (until instances got on COMPUTE-1 and COMPUTE-1 will be restarted)

Konrad Cempura (kcem) on 2021-12-13

Changed in nova:
status:	Incomplete → New

Revision history for this message

Artom Lifshitz (notartom) wrote on 2022-05-09:

#4

I suspect this is a valid bug.

When nova-compute starts up, it looks for migration records with type 'evacuation', status 'done', and itself as the source host. It then destroys the associated libvirt instances from the hypervisor, and sets the migration record to 'completed' to avoid destroying them again on subsequent startups.

This is all well and good if it's the original compute host that comes back after being evacuated, but what if it's a brand new compute host with the same host name?

It'll find the 'done' evacuations, look for the associated libvirt instances to destroy on the hypervisor, not find any because it's a brand new compute, and I suspect at this point it will not set the migration record to 'completed', though code examination doesn't support this [1] unfortunately.

Assuming I'm correct, if previously-evacuated instances are now migrated back to the new (but with the old hostname) compute, and the nova-compute service is restarted, it'll pick up those 'done' migration records and destroy the libvirt instances.

I'm trying to reproduce this with a functional test, but not having any luck (unrelated issues) so far.

[1] https://opendev.org/openstack/nova/src/branch/master/nova/compute/manager.py#L837-L873

Artom Lifshitz (notartom) on 2022-05-09

Changed in nova:
status:	New → Confirmed

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-05-09: Fix proposed to nova (master)

#5

Fix proposed to branch: master
Review: https://review.opendev.org/c/openstack/nova/+/841170

Revision history for this message

sean mooney (sean-k-mooney) wrote on 2022-05-10:

#6

To me this is more of a feature request then a bug.

the way the compute node was replaced i don't belive is currently supported
by nova.

we expect that if you use the same hostname you are actually preserving all data on the server and just replacing the failed component i.e. you are preparing the failed server not replacing it.

if you are replacing the server it should use a different hostname.

so i agree for the sake of operator UX we could enhance nova to support this type of replacement but really this feels like an RFE to me. give the potential for data loss i am willing to consider this a bug so that it can be backported but normally i would suggest this should be a specless blueprint to add this new feature.

Revision history for this message

Artom Lifshitz (notartom) wrote on 2022-05-10:

#7

So this is what I see in the func test logs:

2022-05-10 09:48:27,041 WARNING [nova.compute.manager] Compute node compute0 not found in the database. If this is the first time this service is starting on this host, then you can ignore this warning.
2022-05-10 09:48:27,065 INFO [nova.compute.manager] Cleaning up allocations of the instance as it has been evacuated from this host
2022-05-10 09:48:27,065 ERROR [nova.compute.manager] Failed to clean allocation of evacuated instance as the source node compute0 is not found
2022-05-10 09:48:27,065 INFO [nova.compute.manager] Looking for unclaimed instances stuck in BUILDING status for nodes managed by this host
2022-05-10 09:48:27,075 WARNING [nova.compute.manager] No compute node record found for host compute0. If this is the first time this service is starting on this host, then you can ignore this warning.
2022-05-10 09:48:27,084 WARNING [nova.compute.resource_tracker] No compute node record for compute0:compute0
2022-05-10 09:48:27,087 INFO [nova.compute.resource_tracker] Compute node record created for compute0:compute0 with uuid: 934ed5eb-8426-4bc5-a614-9fee6d2068cc

In other words, at the time we do the destroy, there is no compute node record, which causes us to hit the following if+continue:

            if migration.source_node not in hostname_to_cn_uuid:
                LOG.error("Failed to clean allocation of evacuated "
                          "instance as the source node %s is not found",
                          migration.source_node, instance=instance)
                continue

Meaning we continue out of the outer for loop and never do this bit:

migration.status = 'completed'
migration.save()

The compute node record is created by the resource tracker from update_available_resource(), which is called from the pre_start_hook() in the compute manager, which is called *after* init_host() at service init time:

        self.manager.init_host()
        self.model_disconnected = False
        ctxt = context.get_admin_context()
        self.service_ref = objects.Service.get_by_host_and_binary(
            ctxt, self.host, self.binary)
        if self.service_ref:
            _update_service_ref(self.service_ref)

        else:
            try:
                self.service_ref = _create_service_ref(self, ctxt)
            except (exception.ServiceTopicExists,
                    exception.ServiceBinaryExists):
                # NOTE(danms): If we race to create a record with a sibling
                # worker, don't fail here.
                self.service_ref = objects.Service.get_by_host_and_binary(
                    ctxt, self.host, self.binary)

self.manager.pre_start_hook()

So I wonder if we could just move the pre_start_hook call further up?

So this is what I see in the func test logs:

2022-05-10 09:48:27,041 WARNING [nova.compute.manager] Compute node compute0 not found in the database. If this is the first time this service is starting on this host, then you can ignore this warning.
2022-05-10 09:48:27,065 INFO [nova.compute.manager] Cleaning up allocations of the instance as it has been evacuated from this host
2022-05-10 09:48:27,065 ERROR [nova.compute.manager] Failed to clean allocation of evacuated instance as the source node compute0 is not found
2022-05-10 09:48:27,065 INFO [nova.compute.manager] Looking for unclaimed instances stuck in BUILDING status for nodes managed by this host
2022-05-10 09:48:27,075 WARNING [nova.compute.manager] No compute node record found for host compute0. If this is the first time this service is starting on this host, then you can ignore this warning.
2022-05-10 09:48:27,084 WARNING [nova.compute.resource_tracker] No compute node record for compute0:compute0
2022-05-10 09:48:27,087 INFO [nova.compute.resource_tracker] Compute node record created for compute0:compute0 with uuid: 934ed5eb-8426-4bc5-a614-9fee6d2068cc

In other words, at the time we do the destroy, there is no compute node record, which causes us to hit the following if+continue:

if migration.source_node not in hostname_to_cn_uuid:
                LOG.error("Failed to clean allocation of evacuated "
                          "instance as the source node %s is not found",
                          migration.source_node, instance=instance)
                continue

Meaning we continue out of the outer for loop and never do this bit:

migration.status = 'completed'
            migration.save()

The compute node record is created by the resource tracker from update_available_resource(), which is called from the pre_start_hook() in the compute manager, which is called *after* init_host() at service init time:

self.manager.init_host()
        self.model_disconnected = False
        ctxt = context.get_admin_context()
        self.service_ref = objects.Service.get_by_host_and_binary(
            ctxt, self.host, self.binary)
        if self.service_ref:
            _update_service_ref(self.service_ref)

else:
            try:
                self.service_ref = _create_service_ref(self, ctxt)
            except (exception.ServiceTopicExists,
                    exception.ServiceBinaryExists):
                # NOTE(danms): If we race to create a record with a sibling
                # worker, don't fail here.
                self.service_ref = objects.Service.get_by_host_and_binary(
                    ctxt, self.host, self.binary)

self.manager.pre_start_hook()

So I wonder if we could just move the pre_start_hook call further up?

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-05-10:

#8

Fix proposed to branch: master
Review: https://review.opendev.org/c/openstack/nova/+/841308

Changed in nova:
status:	Confirmed → In Progress

OpenStack Compute (nova)

Evacuated instances should be completed when ComputeHostNotFound

Bug Description

Other bug subscribers

Remote bug watches