Random crash on the same node
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC |
Fix Released
|
High
|
Krunal Bauskar |
Bug Description
This is the third time the same node crashes.
First 2 times were consistent at 2am, I then upgraded from Ubuntu Precise to Trusty to see if that would change anything, and now I'm running with a compiled version of Xtradb Cluster 5.6.17-65.0 on both servers (third is a garbd instance).
Memory does not seem to be an issue http://
Total memory 16GB.
innodb_
2014-06-08 13:03:19 27080 [Warning] WSREP: BF applier failed to open_and_
2014-06-08 13:03:19 27080 [Warning] WSREP: RBR event 3 Update_rows apply warning: 1615, 10486174
2014-06-08 13:03:19 27080 [Warning] WSREP: Failed to apply app buffer: seqno: 10486174, status: 1
at galera/
Retrying 2th time
2014-06-08 13:03:19 27080 [Warning] WSREP: BF applier failed to open_and_
2014-06-08 13:03:19 27080 [Warning] WSREP: RBR event 3 Update_rows apply warning: 1615, 10486174
2014-06-08 13:03:19 27080 [Warning] WSREP: Failed to apply app buffer: seqno: 10486174, status: 1
at galera/
Retrying 3th time
2014-06-08 13:03:19 27080 [Warning] WSREP: BF applier failed to open_and_
2014-06-08 13:03:19 27080 [Warning] WSREP: RBR event 3 Update_rows apply warning: 1615, 10486174
2014-06-08 13:03:19 27080 [Warning] WSREP: Failed to apply app buffer: seqno: 10486174, status: 1
at galera/
Retrying 4th time
2014-06-08 13:03:19 27080 [Warning] WSREP: BF applier failed to open_and_
2014-06-08 13:03:19 27080 [Warning] WSREP: RBR event 3 Update_rows apply warning: 1615, 10486174
2014-06-08 13:03:19 27080 [Warning] WSREP: failed to replay trx: source: 8739bc85-
2014-06-08 13:03:19 27080 [Warning] WSREP: Failed to apply trx 10486174 4 times
2014-06-08 13:03:19 27080 [ERROR] WSREP: trx_replay failed for: 6, query: void
2014-06-08 13:03:19 27080 [ERROR] Aborting
2014-06-08 13:03:21 27080 [Note] WSREP: killing local connection: 746345
2014-06-08 13:03:21 27080 [Note] WSREP: killing local connection: 746348
2014-06-08 13:03:21 27080 [Note] WSREP: killing local connection: 746349
2014-06-08 13:03:21 27080 [Note] WSREP: Closing send monitor...
2014-06-08 13:03:21 27080 [Note] WSREP: Closed send monitor.
2014-06-08 13:03:21 27080 [Note] WSREP: gcomm: terminating thread
2014-06-08 13:03:21 27080 [Note] WSREP: gcomm: joining thread
2014-06-08 13:03:21 27080 [Note] WSREP: gcomm: closing backend
2014-06-08 13:03:22 27080 [Note] WSREP: view(view_
} joined {
} left {
} partitioned {
})
2014-06-08 13:03:22 27080 [Note] WSREP: view((empty))
2014-06-08 13:03:22 27080 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
2014-06-08 13:03:22 27080 [Note] WSREP: gcomm: closed
2014-06-08 13:03:22 27080 [Note] WSREP: Flow-control interval: [16, 16]
2014-06-08 13:03:22 27080 [Note] WSREP: Received NON-PRIMARY.
2014-06-08 13:03:22 27080 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 10486223)
2014-06-08 13:03:22 27080 [Note] WSREP: Received self-leave message.
2014-06-08 13:03:22 27080 [Note] WSREP: Flow-control interval: [0, 0]
2014-06-08 13:03:22 27080 [Note] WSREP: Received SELF-LEAVE. Closing connection.
2014-06-08 13:03:22 27080 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 10486223)
2014-06-08 13:03:22 27080 [Note] WSREP: RECV thread exiting 0: Success
2014-06-08 13:03:22 27080 [Note] WSREP: recv_thread() joined.
2014-06-08 13:03:22 27080 [Note] WSREP: Closing replication queue.
2014-06-08 13:03:22 27080 [Note] WSREP: Closing slave action queue.
2014-06-08 13:03:22 27080 [Note] WSREP: Service disconnected.
2014-06-08 13:03:22 27080 [Note] WSREP: rollbacker thread exiting
2014-06-08 13:03:23 27080 [Note] WSREP: Some threads may fail to exit.
2014-06-08 13:03:23 27080 [Note] Binlog end
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'partition'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'PERFORMANCE_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_SYS_FIELDS'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_SYS_TABLES'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_FT_CONFIG'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_FT_DELETED'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_METRICS'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_CMPMEM'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_CMP_RESET'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_CMP'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_LOCK_WAITS'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_LOCKS'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'INNODB_TRX'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'XTRADB_RSEG'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'XTRADB_
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'XTRADB_READ_VIEW'
2014-06-08 13:03:23 27080 [Note] Shutting down plugin 'InnoDB'
2014-06-08 13:03:23 27080 [Note] InnoDB: FTS optimize thread exiting.
2014-06-08 13:03:23 27080 [Note] InnoDB: Starting shutdown...
11:03:23 UTC - mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https:/
key_buffer_
read_buffer_
max_used_
max_threads=502
thread_count=12
connection_count=12
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x10a04b10
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7fb778271e20 thread_stack 0x30000
/usr/local/
/usr/local/
/lib/x86_
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/usr/local/
/lib/x86_
/lib/x86_
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (0): is an invalid pointer
Connection ID (thread ID): 746356
Status: NOT_KILLED
You may download the Percona XtraDB Cluster operations manual by visiting
http://
in the manual which will help you identify the cause of the crash.
140608 13:03:23 mysqld_safe Number of processes running now: 0
140608 13:03:23 mysqld_safe WSREP: not restarting wsrep node automatically
140608 13:03:23 mysqld_safe mysqld from pid file /var/run/
tags: | added: i66365 |
Changed in percona-xtradb-cluster: | |
milestone: | none → 5.6.29-25.15 |
Changed in percona-xtradb-cluster: | |
status: | Fix Committed → Fix Released |
Just to add, when starting the crashed node, it did a SST
140608 22:57:11 mysqld_safe WSREP: Recovered position fc34347d- e515-11e3- ab6a-5f5ace43d8 d1:10486173 position var submitted: 'fc34347d- e515-11e3- ab6a-5f5ace43d8 d1:10486173' defaults_ for_timestamp server option (see documentation for more details). mysql/share/ english/ . libgalera_ smm.so' 0000-0000- 0000-0000000000 00:-1 check_period = PT0.5S; evs.inactive_ timeout = PT15S; evs.join_ retrans_ period = PT
2014-06-08 22:57:11 0 [Note] WSREP: wsrep_start_
2014-06-08 22:57:11 0 [Warning] TIMESTAMP with implicit DEFAULT value is deprecated. Please use --explicit_
2014-06-08 22:57:11 7727 [Warning] Using pre 5.5 semantics to load error messages from /usr/local/
2014-06-08 22:57:11 7727 [Warning] If this is not intended, refer to the documentation for valid usage of --lc-messages-dir and --language parameters.
2014-06-08 22:57:11 7727 [Note] WSREP: Read nil XID from storage engines, skipping position init
2014-06-08 22:57:11 7727 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib/
2014-06-08 22:57:11 7727 [Note] WSREP: wsrep_load(): Galera 3.5(r178) by Codership Oy <email address hidden> loaded successfully.
2014-06-08 22:57:11 7727 [Note] WSREP: CRC-32C: using hardware acceleration.
2014-06-08 22:57:11 7727 [Note] WSREP: Found saved state: 00000000-
2014-06-08 22:57:11 7727 [Note] WSREP: Passing config to GCS: base_host = 10.41.172.66; base_port = 4567; cert.log_conflicts = no; debug = no; evs.inactive_
2014-06-08 22:57:11 7727 [Note] WSREP: Service thread queue flushed.