xtradb cluster instance crashed in openshift origin 1.2
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC |
New
|
Undecided
|
Unassigned |
Bug Description
Sorry - I'm not sure what information you want here. So I have pasted the log file. The stacktrace is at the bottom.
The environment is running in a openshift origin cluster, v1.2
Host is CentOS Linux release 7.2.1511 (Core)
Using a custom container (extract from dockerfile):
FROM ubuntu:precise
...
RUN echo "deb http://
RUN echo "deb-src http://
RUN apt-key adv --keyserver keys.gnupg.net --recv-keys 1C4CBDCDCD2EFD2A
RUN apt-get update && apt-get upgrade -y
RUN apt-get install -y percona-
The datadir in the container uses a host path volume type in origin.
The volume is on a ext4 filesystem on the host.
I'm working on a staging cluster so am currently prototyping. I have seen this error a few times. Restarting the container results in the same error. The only way to resolve it is to rm -f the data dir then restart the container (which triggers a SST)
I'll hang onto the datadir so that I can run any further debugging with it if you direct me to.
Here is the log :
exec mysqld --log-error=
/lib/x86_
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (0): Connection ID (thread ID): 1
Status: NOT_KILLED
You may download the Percona XtraDB Cluster operations manual by visiting
http://
in the manual which will help you identify the cause of the crash.
2016-08-21 15:18:16 0 [Warning] option 'wsrep_
2016-08-21 15:18:16 0 [Warning] TIMESTAMP with implicit DEFAULT value is deprecated. Please use --explicit_
2016-08-21 15:18:16 0 [Note] mysqld (mysqld 5.6.30-76.3-56) starting as process 1 ...
2016-08-21 15:18:16 1 [Note] WSREP: Read nil XID from storage engines, skipping position init
2016-08-21 15:18:16 1 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib/
2016-08-21 15:18:16 1 [Note] WSREP: wsrep_load(): Galera 3.16(r5c765eb) by Codership Oy <email address hidden> loaded successfully.
2016-08-21 15:18:16 1 [Note] WSREP: CRC-32C: using hardware acceleration.
2016-08-21 15:18:16 1 [Note] WSREP: Found saved state: 00000000-
2016-08-21 15:18:16 1 [Note] WSREP: Passing config to GCS: base_dir = /var/lib/mysql/; base_host = 10.128.1.3; base_port = 4567; cert.log_conflicts = no; debug = no; evs.auto_evict = 0; evs.delay_margin = PT1S; evs.delayed_
2016-08-21 15:18:16 1 [Note] WSREP: Service thread queue flushed.
2016-08-21 15:18:16 1 [Note] WSREP: Assign initial position for certification: -1, protocol version: -1
2016-08-21 15:18:16 1 [Note] WSREP: wsrep_sst_grab()
2016-08-21 15:18:16 1 [Note] WSREP: Start replication
2016-08-21 15:18:16 1 [Note] WSREP: Setting initial position to 00000000-
2016-08-21 15:18:16 1 [Note] WSREP: protonet asio version 0
2016-08-21 15:18:16 1 [Note] WSREP: Using CRC-32C for message checksums.
2016-08-21 15:18:16 1 [Note] WSREP: backend: asio
2016-08-21 15:18:16 1 [Note] WSREP: gcomm thread scheduling priority set to other:0
2016-08-21 15:18:16 1 [Warning] WSREP: access file(/var/
2016-08-21 15:18:16 1 [Note] WSREP: restore pc from disk failed
2016-08-21 15:18:16 1 [Note] WSREP: GMCast version 0
2016-08-21 15:18:16 1 [Warning] WSREP: Failed to resolve tcp://xtradb-
2016-08-21 15:18:16 1 [Note] WSREP: (7c144311, 'tcp://
2016-08-21 15:18:16 1 [Note] WSREP: (7c144311, 'tcp://
2016-08-21 15:18:16 1 [Note] WSREP: EVS version 0
2016-08-21 15:18:16 1 [Note] WSREP: gcomm: connecting to group 'xtradb', peer 'xtradb-
2016-08-21 15:18:16 1 [Note] WSREP: (7c144311, 'tcp://
2016-08-21 15:18:17 1 [Note] WSREP: declaring 600ebe01 at tcp://10.
2016-08-21 15:18:17 1 [Note] WSREP: declaring 95940ff9 at tcp://10.
2016-08-21 15:18:17 1 [Note] WSREP: Node 600ebe01 state prim
2016-08-21 15:18:17 1 [Note] WSREP: view(view_
600ebe01,0
7c144311,0
95940ff9,0
} joined {
} left {
} partitioned {
})
2016-08-21 15:18:17 1 [Note] WSREP: save pc into disk
2016-08-21 15:18:17 1 [Note] WSREP: gcomm: connected
2016-08-21 15:18:17 1 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
2016-08-21 15:18:17 1 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
2016-08-21 15:18:17 1 [Note] WSREP: Opened channel 'xtradb'
2016-08-21 15:18:17 1 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 1, memb_num = 3
2016-08-21 15:18:17 1 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
2016-08-21 15:18:17 1 [Note] WSREP: Waiting for SST to complete.
2016-08-21 15:18:17 1 [Note] WSREP: STATE EXCHANGE: sent state msg: 7c64dc2a-
2016-08-21 15:18:17 1 [Note] WSREP: STATE EXCHANGE: got state msg: 7c64dc2a-
2016-08-21 15:18:17 1 [Note] WSREP: STATE EXCHANGE: got state msg: 7c64dc2a-
2016-08-21 15:18:17 1 [Note] WSREP: STATE EXCHANGE: got state msg: 7c64dc2a-
2016-08-21 15:18:17 1 [Note] WSREP: Quorum results:
version = 4,
component = PRIMARY,
conf_id = 42,
members = 2/3 (joined/total),
act_id = 516900,
last_appl. = -1,
protocols = 0/7/3 (gcs/repl/appl),
group UUID = 84871112-
2016-08-21 15:18:17 1 [Note] WSREP: Flow-control interval: [28, 28]
2016-08-21 15:18:17 1 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 516900)
2016-08-21 15:18:17 1 [Note] WSREP: State transfer required:
Group state: 84871112-
Local state: 00000000-
2016-08-21 15:18:17 1 [Note] WSREP: New cluster view: global state: 84871112-
2016-08-21 15:18:17 1 [Warning] WSREP: Gap in state sequence. Need state transfer.
2016-08-21 15:18:17 1 [Note] WSREP: Running: 'wsrep_
WSREP_SST: [INFO] Streaming with xbstream (20160821 15:18:17.656)
WSREP_SST: [INFO] Using socat as streamer (20160821 15:18:17.658)
WSREP_SST: [INFO] Stale sst_in_progress file: /var/lib/
WSREP_SST: [INFO] Evaluating timeout -k 1810 1800 socat -u TCP-LISTEN:
2016-08-21 15:18:17 1 [Note] WSREP: Prepared SST request: xtrabackup-
2016-08-21 15:18:17 1 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
2016-08-21 15:18:17 1 [Note] WSREP: REPL Protocols: 7 (3, 2)
2016-08-21 15:18:17 1 [Note] WSREP: Service thread queue flushed.
2016-08-21 15:18:17 1 [Note] WSREP: Assign initial position for certification: 516900, protocol version: 3
2016-08-21 15:18:17 1 [Note] WSREP: Service thread queue flushed.
2016-08-21 15:18:17 1 [Warning] WSREP: Failed to prepare for incremental state transfer: Local state UUID (00000000-
at galera/
2016-08-21 15:18:17 1 [Warning] WSREP: Member 1.0 (xtradb-node02) requested state transfer from '10.128.0.9', but it is impossible to select State Transfer donor: No route to host
2016-08-21 15:18:17 1 [ERROR] WSREP: Requesting state transfer failed: -113(No route to host)
2016-08-21 15:18:17 1 [ERROR] WSREP: State transfer request failed unrecoverably: 113 (No route to host). Most likely it is due to inability to communicate with the cluster primary component. Restart required.
2016-08-21 15:18:17 1 [Note] WSREP: Closing send monitor...
2016-08-21 15:18:17 1 [Note] WSREP: Closed send monitor.
2016-08-21 15:18:17 1 [Note] WSREP: gcomm: terminating thread
2016-08-21 15:18:17 1 [Note] WSREP: gcomm: joining thread
2016-08-21 15:18:17 1 [Note] WSREP: gcomm: closing backend
2016-08-21 15:18:17 1 [Note] WSREP: view(view_
7c144311,0
} joined {
} left {
} partitioned {
600ebe01,0
95940ff9,0
})
2016-08-21 15:18:17 1 [Note] WSREP: view((empty))
2016-08-21 15:18:17 1 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
2016-08-21 15:18:17 1 [Note] WSREP: gcomm: closed
2016-08-21 15:18:17 1 [Note] WSREP: Flow-control interval: [16, 16]
2016-08-21 15:18:17 1 [Note] WSREP: Received NON-PRIMARY.
2016-08-21 15:18:17 1 [Note] WSREP: Shifting PRIMARY -> OPEN (TO: 516900)
2016-08-21 15:18:17 1 [Note] WSREP: Received self-leave message.
2016-08-21 15:18:17 1 [Note] WSREP: Flow-control interval: [0, 0]
2016-08-21 15:18:17 1 [Note] WSREP: Received SELF-LEAVE. Closing connection.
2016-08-21 15:18:17 1 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 516900)
2016-08-21 15:18:17 1 [Note] WSREP: RECV thread exiting 0: Success
2016-08-21 15:18:17 1 [Note] WSREP: recv_thread() joined.
2016-08-21 15:18:17 1 [Note] WSREP: Closing replication queue.
2016-08-21 15:18:17 1 [Note] WSREP: Closing slave action queue.
2016-08-21 15:18:17 1 [Note] WSREP: mysqld: Terminated.
15:18:17 UTC - mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https:/
key_buffer_size=0
read_buffer_
max_used_
max_threads=10002
thread_count=2
connection_count=0
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x7f54d4000990
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7f54f405ea60 thread_stack 0x40000
mysqld(
mysqld(
/lib/x86_
/lib/x86_
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
mysqld[0x5e1d01]
mysqld(
/lib/x86_
/lib/x86_
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (0): Connection ID (thread ID): 1
Status: NOT_KILLED
You may download the Percona XtraDB Cluster operations manual by visiting
http://
in the manual which will help you identify the cause of the crash.
Percona now uses JIRA for bug reports so this bug report is migrated to: https:/ /jira.percona. com/browse/ PXC-1921