HA : service monitor failed on all three nodes when mysql is down

Bug #1454147 reported by venu kolli
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Juniper Openstack
Status tracked in Trunk
R2.20
Fix Committed
High
Sanju Abraham
Trunk
Fix Committed
High
Sanju Abraham

Bug Description

HA configured topology running 3 (openstack + config + control ) and compute nodes

service monitor failed on all three nodes when mysql is down during node failures tests.

supervisor tried to restart and gave after certain attempts.

When mysql is recovered back , contrail-svc-monitor is still in failed state as supervisor gave up to restart the service

Issue observed on build 2.01 build 43 , and will also exists on other branches as well.

Tags: ha
venu kolli (vkolli)
Changed in juniperopenstack:
assignee: nobody → Sanju Abraham (asanju)
importance: Undecided → Critical
information type: Proprietary → Public
Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : R2.20

Review in progress for https://review.opencontrail.org/10396
Submitter: Sanju (<email address hidden>)

Revision history for this message
Sanju Abraham (asanju) wrote :

Fixed issue where mysql was not able to restart because of a double failure causing cluster monitor DB read and bootstrap failures

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : master

Review in progress for https://review.opencontrail.org/10398
Submitter: Sanju (<email address hidden>)

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote : A change has been merged

Reviewed: https://review.opencontrail.org/10396
Committed: http://github.org/Juniper/contrail-provisioning/commit/2b4f4c416c6cfdf5d6406250a18c8961b1654feb
Submitter: Zuul
Branch: R2.20

commit 2b4f4c416c6cfdf5d6406250a18c8961b1654feb
Author: Sanju Abraham <email address hidden>
Date: Thu May 14 19:05:16 2015 -0700

Close-Bug: #1454147. Fix addresses the issue of double failures where cluster monitor is not able to connect to mysql due to loss of quorum and bootstrap does not complete.

Change-Id: I33b4832b1ff7b90f7e2b20f3ee268ca43b9eb8de

Revision history for this message
OpenContrail Admin (ci-admin-f) wrote :

Reviewed: https://review.opencontrail.org/10398
Committed: http://github.org/Juniper/contrail-provisioning/commit/f1d78ae7def9db037dad7f694b5160de2efc7141
Submitter: Zuul
Branch: master

commit f1d78ae7def9db037dad7f694b5160de2efc7141
Author: Sanju Abraham <email address hidden>
Date: Thu May 14 19:21:17 2015 -0700

Close-Bug: #1454147. Fix addresses the issue of double failures where cluster monitor is not able to connect to mysql due to loss of quorum and bootstrap does not complete.

Change-Id: I55ca3bc530dc8da31c7cc0350ed6c1f8dd8f6854

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.