[library] l3-agent active-passive failover takes 30+ seconds when a controller fails
Bug #1328970 reported by
Chris Clason
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
High
|
Sergey Vasilenko |
Bug Description
l3-agent takes 30+ seconds to failover to a standby controller when a controller node fails. Customers are asking for this time to be minimized as much as possible.
tags: | added: customer-found |
Changed in fuel: | |
status: | New → Confirmed |
importance: | Undecided → High |
tags: | added: ha |
Changed in fuel: | |
milestone: | none → 5.1 |
assignee: | nobody → Fuel Library Team (fuel-library) |
Changed in fuel: | |
assignee: | Fuel Library Team (fuel-library) → Sergey Vasilenko (xenolog) |
summary: |
- l3-agent active-passive failover takes 30+ seconds when a controller - fails + [puppet] l3-agent active-passive failover takes 30+ seconds when a + controller fails |
summary: |
- [puppet] l3-agent active-passive failover takes 30+ seconds when a + [library] l3-agent active-passive failover takes 30+ seconds when a controller fails |
Changed in fuel: | |
milestone: | 5.1 → 6.0 |
tags: | added: release-notes |
To post a comment you must log in.
There are hold down sleeps inside both the ocf script (+33 Sec) https:/ /github. com/stackforge/ fuel-library/ blob/master/ deployment/ puppet/ neutron/ files/ocf/ neutron- agent-l3# L411 and some in reconnect sleeps in https:/ /github. com/stackforge/ fuel-library/ blob/master/ deployment/ puppet/ neutron/ files/q- agent-cleanup. py#L420. Since we have cleaned up service recovery we should be able to remove most of these and have a fast re-connect attempt to fix this for us if there are still gaps.