L3 HA routers master state flapping between nodes after router updates or failovers when using 1.2.14 or 1.2.15 (-1.2.15-6)

Bug #1433172 reported by Assaf Muller
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
neutron
Won't Fix
Low
Unassigned
openstack-ansible
Fix Released
High
Major Hayden

Bug Description

keepalived 1.2.14 introduced a regression when running it in no-preempt mode. More details here in a thread I started on the keepalived-devel list:
http://sourceforge.net/p/keepalived/mailman/message/33604497/

A fix was backported to 1.2.15-6, and is present in 1.2.16.

Current status (Updated on the 30th of April, 2015):
Fedora 20, 21 and 22 have 1.2.16.
CentOS and RHEL are on 1.2.13

Ubuntu is using 1.2.10 or older.
Debian is using 1.2.13.

In summary, as long as you're not using 1.2.14 or 1.2.15 (Excluding 1.2.15-6), you're OK, which should be the case if you're using the latest keepalived packaged for your distro.

Tags: l3-ha
Assaf Muller (amuller)
Changed in neutron:
assignee: nobody → Assaf Muller (amuller)
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to neutron (master)

Fix proposed to branch: master
Review: https://review.openstack.org/165162

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on neutron (master)

Change abandoned by Assaf Muller (<email address hidden>) on branch: master
Review: https://review.openstack.org/165162

Revision history for this message
Assaf Muller (amuller) wrote : Re: L3 HA routers master state flapping between nodes after router updates or failovers

Quick update: Fedora, CentOS and RHEL most recent versions of keepalived have a backport applied. Ubuntu and Debian have an old version without the faulty behavior.

Changed in neutron:
status: In Progress → Triaged
importance: Undecided → Medium
importance: Medium → Low
Revision history for this message
Abhishek Raut (abhraut) wrote :

Is the back port applied in RHEL 7 Update 1?

Revision history for this message
Assaf Muller (amuller) wrote :

Sorry I was actually confused. CentOS and RHEL, like Debian are on 1.2.13 which is not affected. Fedora 20, 21 and 22 most recent of keepalived, with the backport, is now 1.2.16, and 1.2.15-6 is good too.

The bad versions are 1.2.14 and 1.2.15 (But not 1.2.15-6, which has the backport).

description: updated
Revision history for this message
Assaf Muller (amuller) wrote :

I updated the original bug description with up to date information.

Assaf Muller (amuller)
Changed in neutron:
assignee: Assaf Muller (amuller) → nobody
summary: L3 HA routers master state flapping between nodes after router updates
- or failovers
+ or failovers when using 1.2.14 or 1.2.15 (-1.2.15-6)
Changed in openstack-ansible:
assignee: nobody → Jean-Philippe Evrard (jean-philippe-evrard)
status: New → Confirmed
importance: Undecided → High
Changed in openstack-ansible:
status: Confirmed → In Progress
Changed in openstack-ansible:
assignee: Jean-Philippe Evrard (jean-philippe-evrard) → Jesse Pretorius (jesse-pretorius)
Changed in openstack-ansible:
assignee: Jesse Pretorius (jesse-pretorius) → Jean-Philippe Evrard (jean-philippe-evrard)
Changed in openstack-ansible:
assignee: Jean-Philippe Evrard (jean-philippe-evrard) → Major Hayden (rackerhacker)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to openstack-ansible-os_neutron (master)

Reviewed: https://review.openstack.org/338129
Committed: https://git.openstack.org/cgit/openstack/openstack-ansible-os_neutron/commit/?id=0b8721141f9b526ba4902f5cfc53f05c2fc0758e
Submitter: Jenkins
Branch: master

commit 0b8721141f9b526ba4902f5cfc53f05c2fc0758e
Author: Jean-Philippe Evrard <email address hidden>
Date: Wed Jul 6 10:10:52 2016 +0100

    Use UCA for non-OVS neutron

    This commit refactors tasks to allow the use of UCA for Linux Bridge.
    It also changes default behavior: now every neutron install will
    make use of Ubuntu Cloud Archive, unless mentionned.

    Closes-Bug: 1497272
    Closes-Bug: 1433172

    Change-Id: I4373f544eb178720f33795a71adae925a8b8cb03
    Signed-off-by: Jean-Philippe Evrard <email address hidden>

Changed in openstack-ansible:
status: In Progress → Fix Released
Revision history for this message
Doug Hellmann (doug-hellmann) wrote : Fix included in openstack/openstack-ansible-os_neutron 14.0.0.0b2

This issue was fixed in the openstack/openstack-ansible-os_neutron 14.0.0.0b2 development milestone.

Revision history for this message
Ihar Hrachyshka (ihar-hrachyshka) wrote :

The bug is in keepalived not neutron, moving to Won't Fix.

Changed in neutron:
status: Triaged → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers