Keepalived not considered to declare unhealthy an amphorae on reboot

Bug #1695090 reported by Leandro Reox
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
octavia
Fix Released
Critical
Michael Johnson

Bug Description

Doing ACTIVE STANDBY reboot tests, due to a race condition keepalived failed to start but the amphorae wasnt considered as unhealthy so it can be replaced.

This can cause that if we:

1 - Reboot the master node
2 - Keepalived fail to start on master node
3 - Shutdown backup node
4 - Traffic dropped

Keepalived should be considered on a reboot as a health indicator, cause the rebooted node will not claim the VIP again if the backup fails in the middle.

summary: - Keepalived not considered to declare unhealthy an amphorae
+ Keepalived not considered to declare unhealthy an amphorae on reboot
Changed in octavia:
assignee: nobody → Michael Johnson (johnsom)
importance: Undecided → Critical
status: New → Triaged
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to octavia (master)

Fix proposed to branch: master
Review: https://review.openstack.org/470081

Changed in octavia:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to octavia (master)

Reviewed: https://review.openstack.org/470081
Committed: https://git.openstack.org/cgit/openstack/octavia/commit/?id=c7a2babf545a30ac29f5dda91e19eaca933c106a
Submitter: Jenkins
Branch: master

commit c7a2babf545a30ac29f5dda91e19eaca933c106a
Author: Michael Johnson <email address hidden>
Date: Thu Jun 1 18:55:54 2017 -0700

    Check keepalived health in the amphora

    The health manager did not detect a keepalived failure inside the amphora.
    This patch will not send a health heartbeat if keepalived is configured but
    not running.

    This patch also allows the health checks to continue after an initial failure.

    Change-Id: Id21310bd5ded3747218d3872ab3c966e5ddf5356
    Closes-Bug: #1695090

Changed in octavia:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/octavia 1.0.0.0b2

This issue was fixed in the openstack/octavia 1.0.0.0b2 development milestone.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.