Performing live migration after make sure we have live_migration_wait_for_vif_plug=False in nova.conf on the computes. Migration aborts with:
2019-06-26 08:17:43.518 6 INFO nova.compute.manager [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Took 6.75 seconds for pre_live_migration on destination host compute-0.localdomain.
2019-06-26 08:17:43.519 6 DEBUG nova.compute.manager [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Not waiting for events after pre_live_migration: [('network-vif-plugged', 'fbb880ff-b834-4cca-b12f-77a60a1984b2')]. _do_live_migration /usr/lib/python3.6/site-packages/nova/compute/manager.py:6548
...
2019-06-26 08:17:43.728 6 INFO nova.virt.libvirt.migration [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Increasing downtime to 50 ms after 0 sec elapsed time
2019-06-26 08:17:43.857 6 INFO nova.virt.libvirt.driver [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Migration running for 0 secs, memory 100% remaining; (bytes processed=0, remaining=0, total=0)
2019-06-26 08:17:44.293 6 ERROR nova.virt.libvirt.driver [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Live Migration failure: operation failed: Failed to connect to remote libvirt URI qemu+ssh://<email address hidden>:2022/system?keyfile=/etc/nova/migration/identity: End of file while reading data: Ncat: No such file or directory.: Input/output error: libvirt.libvirtError: operation failed: Failed to connect to remote libvirt URI qemu+ssh://<email address hidden>:2022/system?keyfile=/etc/nova/migration/identity: End of file while reading data: Ncat: No such file or directory.: Input/output error
2019-06-26 08:17:44.295 6 DEBUG nova.virt.libvirt.driver [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Migration operation thread notification thread_finished /usr/lib/python3.6/site-packages/nova/virt/libvirt/driver.py:8015
2019-06-26 08:17:44.360 6 DEBUG nova.virt.libvirt.migration [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] VM running on src, migration failed find_job_type /usr/lib/python3.6/site-packages/nova/virt/libvirt/migration.py:360
2019-06-26 08:17:44.360 6 DEBUG nova.virt.libvirt.driver [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Fixed incorrect job type to be 4 _live_migration_monitor /usr/lib/python3.6/site-packages/nova/virt/libvirt/driver.py:7844
2019-06-26 08:17:44.361 6 ERROR nova.virt.libvirt.driver [-] [instance: a20ab13e-b348-4dc2-b294-e23865e2fc4b] Migration operation has aborted
Additional info:
during the migration, the disk files get copied to the new compute, so it is not a general connection issue between the computes:
[root@compute-0 ~]# ll -R /var/lib/nova/instances/a20ab13e-b348-4dc2-b294-e23865e2fc4b
/var/lib/nova/instances/a20ab13e-b348-4dc2-b294-e23865e2fc4b:
total 396
-rw-r--r--. 1 42436 42436 196616 Jun 26 08:47 disk
-rw-r--r--. 1 42436 42436 196624 Jun 26 08:47 disk.eph0
-rw-r--r--. 1 42436 42436 161 Jun 26 08:47 disk.info
The issue was introduced with [1]. The nova-migration- wrapper inside the container needs access to the libvirt socket [1].
[1] https:/ /review. opendev. org/#/c/ 662109/ /github. com/rdo- packages/ nova-distgit/ blob/rpm- master/ nova-migration- wrapper# L31
[2] https:/