split brain cluster from leader handling failure on older jujus

Bug #1486580 reported by JuanJo Ciarlante
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
percona-cluster (Juju Charms Collection)
Expired
High
Unassigned

Bug Description

* Deployment:
- openstack HA with 1507 charms release
- percona-cluster r63 (service name: mysql )
- juju 1.20.14

This HA deployment reliably fails to properly
cluster percona units:

- wsrep_cluster_size = 1 on each unit
-> http://paste.ubuntu.com/12124888/

- openstack DBs only present on mysql/0
-> http://paste.ubuntu.com/12124891/

- log excerpts showing all units logging
 "Leader unit - bootstrap required=True"
-> http://paste.ubuntu.com/12124893/

Afaics looks like r62 changes, confirming same deployment
clusters ok with r61.

JuanJo Ciarlante (jjo)
tags: added: canonical-bootstack
Ryan Beisner (1chb1n)
tags: added: openstack uosci
Revision history for this message
Liam Young (gnuoy) wrote :

Please could try setting min-cluster-size when doing the deployment.

Changed in percona-cluster (Juju Charms Collection):
status: New → Incomplete
importance: Undecided → High
Revision history for this message
JuanJo Ciarlante (jjo) wrote :

Confirming setting min-cluster-size=3 fixes the split brain issue, ie finding Ok:
wsrep_cluster_size 3
, but I'm now seeing a never-ending loop on the 3 units calling
cluster-relation-changed hook, filed lp#1488140 - feel free to close
this one.

Revision history for this message
David Ames (thedac) wrote :

JuanJo,

Have you had a chance to try min-cluster-size. The charm relies on this.

For the record I just tested the amulet test with and without min-cluster-size set. When it is not set the results are as you describe. With it set the cluster comes up.

Note you can run the test directly (without dealing with juju test):

 python AMULET_VIP=$IP1 AMULET_OS_VIP=$IP2 python tests/42-test-bootstrap-multi-min.py

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for percona-cluster (Juju Charms Collection) because there has been no activity for 60 days.]

Changed in percona-cluster (Juju Charms Collection):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.