[ovn] No connection to VM during live-migration
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
In Progress
|
Medium
|
Stefan Hoffmann |
Bug Description
Problem: In environments with many hypervisors and VMs, a live-migration leads to VMs being not reachable for some seconds (4-20s).
Description:
We run a big environment with many hypervisors and VMs, so northd reconcile cycles take some time.
At live-migration, even nova has live_migration_
Nova starts the live migration at libvirt and it is done, before southbound and ovn-controller of destination have the change.
So the VM is started at destination hypervisor but the port setup is not done yet.
From what I saw, the vif-plugged event is generated by neutron, when the transaction to northbound ovsdb is finished [1].
Is there a way to wait till the change is propagated to southbound ovsdb?
Version:
neutron-server 21.2.1 zed / unmaintained/zed
ml2 plugin: ovn
at neutron: ovsdb-client (Open vSwitch) 3.3.0
Nova zed / unmaintained/zed
nova.conf: live_migration_
Hypervisor OS: Ubuntu 22.04 with newer kernel (but that shouldn't be relevant here)
Steps to Reproduce:
1. Run neutron with ovn setup and create a VM that you can ping (via FIP or other VM in same private network)
2. Stop northd
3. Start live-migration
4. Wait till live-migration is done - VM is not reachable anymore
More info:
This problem has two steps.
First, nova don't wait for network-vif-plugged event if using ovn backend, as portbinding options missing some attribute
Also Neutron OVN plugin currently sends vif-plugged events as soon northbound ovsdb has the update and on LogicalSwitchPort Events (so at northbound updates)
[1] https:/
[2] https:/
description: | updated |
tags: | added: ovn |
Changed in neutron: | |
assignee: | nobody → Stefan Hoffmann (mr-hopeman) |
description: | updated |
You raise a good question, I will bring it up at the next Neutron meeting to see what others think.