Simultanious Crash of two nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MySQL patches by Codership |
Fix Released
|
Medium
|
Seppo Jaakola | ||
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
For some days now we have two (out of 4) nodes crash at exactly the same time with the following error messages in the log.
Recovery is only possible with a full SST.
-- snip --
-- snip --
today:
Slave SQL: Could not execute Delete_rows event on table somedb.
120630 1:30:10 [Warning] WSREP: RBR event 4 Delete_rows apply warning: 120, 30131914
120630 1:30:10 [ERROR] WSREP: Failed to apply trx: source: f879462c-
thursday:
box1:
120628 8:51:06 [ERROR] Slave SQL: Could not execute Delete_rows event on
table somedb.
Error_code: 1032; handler error HA_ERR_
log FIRST, end_log_pos 398, Error_code: 1032
120628 8:51:06 [Warning] WSREP: RBR event 4 Delete_rows apply warning:
120, 25220082
120628 8:51:06 [ERROR] WSREP: Failed to apply trx: source:
1337a766-
CERTIFYING flags: 1 conn_id: 52899913 trx_id: 3397040790 seqnos (l:
17019008, g: 25220082, s: 25220081, d: 25220080, ts: 134086626520285
120628 8:51:06 [ERROR] WSREP: Failed to apply app buffer:
ÃþëO^SÃ<8a>^A, seqno: 25220082, status: WSREP_FATAL
at galera/
at galera/
120628 8:51:06 [ERROR] WSREP: Node consistency compromized, aborting...
120628 8:51:06 [Note] WSREP: Closing send monitor...
120628 8:51:06 [Note] WSREP: Closed send monitor.
box2:
120628 8:51:06 [ERROR] Slave SQL: Could not execute Delete_rows event on
table somedb.
Error_code: 1032; handler error HA_ERR_
log FIRST, end_log_pos 9845, Error_code: 1032
120628 8:51:06 [Warning] WSREP: RBR event 10 Delete_rows apply warning:
120, 25220082
120628 8:51:06 [ERROR] WSREP: Failed to apply trx: source:
1337a766-
CERTIFYING flags: 1 conn_id: 52899913 trx_id: 3397040790 seqnos (l:
17537022, g: 25220082, s: 25220081, d: 25220080, ts: 134086626520285
120628 8:51:06 [ERROR] WSREP: Failed to apply app buffer:
ÃþëO^SÃ<8a>^A, seqno: 25220082, status: WSREP_FATAL
at galera/
at galera/
120628 8:51:06 [ERROR] WSREP: Node consistency compromized, aborting...
120628 8:51:06 [Note] WSREP: Closing send monitor...
120628 8:51:06 [Note] WSREP: Closed send monitor.
Changed in codership-mysql: | |
assignee: | nobody → Seppo Jaakola (seppo-jaakola) |
Changed in codership-mysql: | |
status: | New → Incomplete |
assignee: | Seppo Jaakola (seppo-jaakola) → nobody |
Changed in percona-xtradb-cluster: | |
status: | New → Incomplete |
tags: | added: i29618 |
Changed in codership-mysql: | |
status: | Confirmed → In Progress |
importance: | Undecided → Medium |
assignee: | nobody → Seppo Jaakola (seppo-jaakola) |
Changed in codership-mysql: | |
status: | Fix Committed → Fix Released |
Changed in percona-xtradb-cluster: | |
milestone: | none → 5.5.30-23.7.4 |
status: | Confirmed → Fix Released |
Please attach following information:
1. full mysql error logs from all the cluster nodes.
2. mysql configuration, preferably by:
mysql> show global variables\G
3. table definitions for: some_table_ one; some_table_ two;
mysql> show create table somedb.
mysql> show create table somedb.
If this is sensitive information, you can send directly to: <email address hidden>
What kind of SQL load has been issued for the cluster? Especially how does the DELETE statement for these tables look like?
What is the life time of the rows in 'some_table_one': and 'some_table_two'
* when is the row first inserted?
* are there later updates on the row?
* can several sessions try to delete the row?