Regression of bug 1232747

Bug #1677555 reported by Jonathan Cutting
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC
Fix Released
Undecided
Unassigned

Bug Description

Upgraded to version 5.6.35-26.20. Short while after a subsequent staggered reboot of nodes, they failed exhibiting hallmarks of bug 1232747.

Cluster total de-sync, attempted to bootstrap from single node but could only recover by downgrading to version 5.6.35-26.19.

See log output below for stack trace:

Version: '5.6.35-80.0-56-log' socket: '/var/run/mysqld/mysqld.sock' port: 3306 Percona XtraDB Cluster (GPL), Release rel80.0, Revision 62752df, WSREP version 26.20, wsrep_26.20
2017-03-30 09:53:37 7815 [Note] WSREP: Initialized wsrep sidno 2
2017-03-30 09:53:37 7815 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
2017-03-30 09:53:37 7815 [Note] WSREP: REPL Protocols: 7 (3, 2)
2017-03-30 09:53:37 7815 [Note] WSREP: Assign initial position for certification: 166051560, protocol version: 3
2017-03-30 09:53:37 7815 [Note] WSREP: Service thread queue flushed.
2017-03-30 09:53:37 7815 [Note] WSREP: GCache history reset: old(12d5ca61-b1a1-11e6-9911-7ec4afa00e5e:0) -> new(12d5ca61-b1a1-11e6-9911-7ec4afa00e5e:166051560)
2017-03-30 09:53:37 7815 [Note] WSREP: Synchronized with group, ready for connections
2017-03-30 09:53:37 7815 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
terminate called after throwing an instance of 'std::out_of_range'
  what(): vector::_M_range_check: __n (which is 4294967295) >= this->size() (which is 0)
08:53:59 UTC - mysqld got signal 6 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https://bugs.launchpad.net/percona-xtradb-cluster

key_buffer_size=16777216
read_buffer_size=131072
max_used_connections=10
max_threads=402
thread_count=20
connection_count=3
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 176557 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0 thread_stack 0x30000
/usr/sbin/mysqld(my_print_stacktrace+0x2c)[0x9144ac]
/usr/sbin/mysqld(handle_fatal_signal+0x479)[0x68fc59]
/lib/x86_64-linux-gnu/libpthread.so.0(+0xf890)[0x7f0221f37890]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x37)[0x7f021febd067]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x148)[0x7f021febe448]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x15d)[0x7f02207aab3d]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x5ebb6)[0x7f02207a8bb6]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x5ec01)[0x7f02207a8c01]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x5ee19)[0x7f02207a8e19]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZSt24__throw_out_of_range_fmtPKcz+0x11f)[0x7f02207fecdf]
/usr/sbin/mysqld[0xaf01de]
/usr/sbin/mysqld[0xaf3599]
/usr/sbin/mysqld[0xaf62d5]
/usr/sbin/mysqld[0xaf7760]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x8064)[0x7f0221f30064]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d)[0x7f021ff7062d]
You may download the Percona XtraDB Cluster operations manual by visiting
http://www.percona.com/software/percona-xtradb-cluster/. You may find information
in the manual which will help you identify the cause of the crash.

Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

* Not sure if is regression or new issue.
* Can you help with test-case ?

Changed in percona-xtradb-cluster:
status: New → Incomplete
Revision history for this message
Michael De Swert (kode) wrote :

I encountered the same crash on now 2 different nodes of a 3 node cluster. I upgraded 28days ago to 5.6.35-26.20-3.wheezy. Today it crashed immediatly after restarting mysql after the first crash. It is now running again. The cluster is still intact though.

log + stack trace:
Version: '5.6.35-81.0-56' socket: '/var/run/mysqld/mysqld.sock' port: 3306 Percona XtraDB Cluster (GPL), Release rel81.0, Revision 7f9b6ae, WSREP version 26.20, wsrep_26.20
2017-05-19 15:34:48 6455 [Note] WSREP: Receiving IST: 36986 writesets, seqnos 8272149026-8272186012
terminate called after throwing an instance of 'std::out_of_range'
  what(): vector::_M_range_check
13:34:48 UTC - mysqld got signal 6 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https://bugs.launchpad.net/percona-xtradb-cluster

key_buffer_size=16777216
read_buffer_size=131072
max_used_connections=0
max_threads=2050
thread_count=2
connection_count=0
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 833246 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0 thread_stack 0x30000
/usr/sbin/mysqld(my_print_stacktrace+0x2e)[0x91260e]
/usr/sbin/mysqld(handle_fatal_signal+0x491)[0x6c8f81]
/lib/x86_64-linux-gnu/libpthread.so.0(+0xf0a0)[0x7f86cf6240a0]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x35)[0x7f86cd64e125]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x180)[0x7f86cd6513a0]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x11d)[0x7f86cdea689d]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x63996)[0x7f86cdea4996]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x639c3)[0x7f86cdea49c3]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x63bee)[0x7f86cdea4bee]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZSt20__throw_out_of_rangePKc+0x5d)[0x7f86cdef66fd]
/usr/sbin/mysqld[0xade2a6]
/usr/sbin/mysqld[0xadfa4d]
/usr/sbin/mysqld[0xae0f98]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x6b50)[0x7f86cf61bb50]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d)[0x7f86cd6f9fbd]
You may download the Percona XtraDB Cluster operations manual by visiting
http://www.percona.com/software/percona-xtradb-cluster/. You may find information
in the manual which will help you identify the cause of the crash.

Revision history for this message
Markus F. (digilist) wrote :

I am having the same error for some days now. I am running a 3-node cluster and two nodes are constantly crashing (at least once every 24 hours). The third node is running without any problems.

terminate called after throwing an instance of 'std::out_of_range'
  what(): vector::_M_range_check: __n (which is 4294967295) >= this->size() (which is 0)
07:59:26 UTC - mysqld got signal 6 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https://bugs.launchpad.net/percona-xtradb-cluster

key_buffer_size=16777216
read_buffer_size=131072
max_used_connections=37
max_threads=502
thread_count=23
connection_count=20
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 4200276 K bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0 thread_stack 0x30000
/usr/sbin/mysqld(my_print_stacktrace+0x3b)[0x9272ab]
/usr/sbin/mysqld(handle_fatal_signal+0x49a)[0x68ef6a]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x11390)[0x7f7a7169c390]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x38)[0x7f7a70a56428]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x16a)[0x7f7a70a5802a]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x16d)[0x7f7a7139884d]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x8d6b6)[0x7f7a713966b6]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x8d701)[0x7f7a71396701]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0x8d919)[0x7f7a71396919]
/usr/lib/x86_64-linux-gnu/libstdc++.so.6(_ZSt24__throw_out_of_range_fmtPKcz+0x107)[0x7f7a713bf3f7]
/usr/sbin/mysqld[0xb06f1a]
/usr/sbin/mysqld[0xb08577]
/usr/sbin/mysqld[0xb09bd0]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba)[0x7f7a716926ba]
/lib/x86_64-linux-gnu/libc.so.6(clone+0x6d)[0x7f7a70b2782d]
You may download the Percona XtraDB Cluster operations manual by visiting
http://www.percona.com/software/percona-xtradb-cluster/. You may find information
in the manual which will help you identify the cause of the crash.

Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

Bug is in mysql upstream. here is the reference.

https://bugs.mysql.com/bug.php?id=84940

It is fixed in MySQL 5.7.18 and PXC will inherit it due-course.

Revision history for this message
Jonathan Cutting (jcutting) wrote :

Please can you confirm that this bug is fixed in 5.6.36-82? It appears to be a duplicate of https://bugs.launchpad.net/percona-server/+bug/1664519 which is noted as fixed in the release notes https://www.percona.com/doc/percona-server/5.6/release-notes/Percona-Server-5.6.36-82.0.html#module-5.6.36-82.0

Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

This bug is for PXC and we yet to release PXC 5.6.36 or PXC 5.7.18. It will be fixed once we release the said version.

Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

PXC-5.6.36 and PXC-5.7.18 has been released that has upstream fix. So marked this bug as done.

Changed in percona-xtradb-cluster:
status: Incomplete → Fix Released
Revision history for this message
Shahriyar Rzayev (rzayev-sehriyar) wrote :

Percona now uses JIRA for bug reports so this bug report is migrated to: https://jira.percona.com/browse/PXC-1965

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.