[OVN] The "neutron_sync_mode = repair" option breaks the whole cloud!
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
Hello,
I'm using Ubuntu 16.04 with Kernel 4.8 (HWE), plus Ocata from Cloud Archive. Playing with Networking OVN, and planning to deploy it into production in a couple weeks.
After deploying everything and starting using OVN, with Floating IPs, Security Groups, multiple compute nodes and everything else, I can say that it looks awesome! Way better than the "neutron-*-agents".
However, I noted that after running:
---
systemctl restart neutron-server
---
Literally ALL my stacks, on all projects, becomes unreachable!!!
The Instances could not even ping its own Floating IP anymore, and of course, the Internet.
Also, the Internet could not reach the Instances via its Floating IPs...
After double checking the config files, and comparing it with the doc*, I did one single change in my ml2_conf.ini, from:
---
neutron_sync_mode = repair
---
To:
---
# neutron_sync_mode = off
---
Then, problem solved!
Now, I can restart the neutron-server without any problems!
What to do if this happens again?
Ask ALL my customers to rebuild their stacks?!
How to REALLY repair OVN if something like this happen again?
Now that the problem is solved, I'll keep trying to use and stress test it even more but, I'm losing confidence on Networking OVN on its current state.
* doc: https:/
affects: | neutron → networking-ovn |
Changed in networking-ovn: | |
importance: | Undecided → High |
Hi Thiago,
You can use the "neutron- ovn-db- sync-util" utility to repair the OVN database. ovn-db- sync-util --config-file /etc/neutron/ neutron. conf --config-file /etc/neutron/ plugins/ ml2/ml2_ conf.ini"
You can run it as "neutron-
Make sure to change the mode to repair in ml2_conf.ini or pass the neutron_ sync_mode= repair in the command line.
Having said that, neutron-server should repair the OVN database as well.
The sync util and neutron-sever share the same code for sync.
Can you please share the neutron server logs to see what is happening there.