OVS tunneling between multiple neutron nodes misconfigured if amqp is restarted
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
Won't Fix
|
Medium
|
Unassigned | ||
oslo.messaging |
Fix Released
|
Undecided
|
Unassigned | ||
tripleo |
Fix Released
|
High
|
Giulio Fidente |
Bug Description
At completion of a deployment with multiple controllers, by observing the gre tunnels created in OVS by the neutron ovs-agent, one will find that some neutron nodes may miss the tunnels in between them or to the computes.
This is due to ovs-agents getting disconnected from the rabbit cluster without them noticing and as a result, being unable to receive updates from other nodes or publish updates.
The disconnection may happen following a reconfig of a rabbit node, the VIP moving over a different node when rabbit is load balanced, or even _during_ tripleo overcloud deployment due to rabbit cluster configuration changes.
This was observed using Kombu 3.0.33 as well as 2.5.
Use of some aggressive (low) kernel keepalive probes interval seems to improve the reliability but a more appropriate fix seems to be support for heartbeat in oslo.messaging
summary: |
- OVS tunneling between multiple neutron nodes breaks if amqp is restarted + OVS tunneling between multiple neutron nodes misconfigured if amqp is + restarted |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
Changed in neutron: | |
importance: | Undecided → High |
Changed in tripleo: | |
status: | New → Triaged |
Changed in neutron: | |
assignee: | nobody → Romil Gupta (romilg) |
tags: | added: ovs |
Changed in neutron: | |
importance: | High → Medium |
status: | New → Confirmed |
Changed in oslo.messaging: | |
status: | New → Incomplete |
Changed in tripleo: | |
status: | Fix Committed → Fix Released |
Changed in oslo.messaging: | |
status: | Incomplete → Fix Released |
Changed in neutron: | |
status: | Incomplete → Won't Fix |
related to https:/ /bugs.launchpad .net/oslo. messaging/ +bug/856764