After a network split, a node can make a write progress and end-up with a diverged local seqno

Bug #1583521 reported by Bogdan Dobrelya
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MySQL patches by Codership
Invalid
Undecided
Unassigned
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC
Invalid
Undecided
Unassigned
percona-xtradb-cluster-5.6 (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

How to reproduce:

Given:
* A cluster of 5 Galera nodes (percona packages)
* running in the multi-master writes/reads layout,
* auto-recovery decisions made by the custom Pacemaker OCF RA, it can only monitor/stop/start the mysqld_safe, no more interference, it removes nothing in the data path.

With that, do multiple runs of custom jepsen tests with Nemesis in the random network-split partitions mode (all links and details about test cases I described here https://goo.gl/VHyIIE , although that's tl;dr).

Results:
Expected: nodes will always recover after partitions with a merged state, isolated nodes can't make write progress w/o quorum.

Actual: a node (the n5 here) have done write progress, diverged, and refuses to start with the errors like:
[ERROR] WSREP: Local state seqno (189675) is greater than group seq no (188050): states diverged.

Logs, package versions, configs and wsrep status/vars are attached.

Revision history for this message
Bogdan Dobrelya (bogdando) wrote :
description: updated
Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :
Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

upstream is closed will close this issue too.

Changed in percona-xtradb-cluster:
status: New → Invalid
Changed in percona-xtradb-cluster-5.6 (Ubuntu):
status: New → Invalid
Changed in codership-mysql:
status: New → Invalid
Revision history for this message
Shahriyar Rzayev (rzayev-sehriyar) wrote :

Percona now uses JIRA for bug reports so this bug report is migrated to: https://jira.percona.com/browse/PXC-1903

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.