PXC crashes while running pt-table-checksum
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC |
Expired
|
Undecided
|
Unassigned |
Bug Description
We have a PXC of 4 nodes. These nodes are docker container based on CentOS 7, running
Percona-
Percona-
Percona-
Percona-
Percona-
percona-
percona-
NODE3 and NODE4 are running on the same docker host. NODE4 ist configured as slave, but slave was not startet yet. While running
pt-table-checksum --host=NODE3 --user=root --password=$pw --recursion-
NODE4 crashed with following output:
2016-08-29 12:09:50 1 [Note] 'CHANGE MASTER TO executed'. Previous state master_host='', master_port= 3306, master_log_file='', master_log_pos= 4, master_bind=''. New state master_
2016-08-29 12:11:36 1 [Note] 'CHANGE MASTER TO executed'. Previous state master_
2016-08-29 14:11:22 1 [Warning] IP address '192.168.3.2' could not be resolved: Name or service not known
InnoDB: Page directory corruption: infimum not pointed to
2016-08-29 14:11:57 7f42347f8700 InnoDB: Page dump in ascii and hex (16384 bytes):
len 16384; hex [32768x'0', not printed here]
InnoDB: End of page dump
2016-08-29 14:11:57 7f42347f8700 InnoDB: uncompressed page, stored checksum in field1 0, calculated checksums for field1: crc32 536728786, innodb 1575996416, none 3735928559, stored checksum in field2 0, calculated checksums for field2: crc32 536728786, innodb 1371122432, none 3735928559, page LSN 0 0, low 4 bytes of LSN at page end 0, page number (if stored to page already) 0, space id (if created with >= MySQL-4.1.1 and stored already) 0
InnoDB: Page may be a freshly allocated page
InnoDB: Page directory corruption: supremum not pointed to
2016-08-29 14:11:57 7f42347f8700 InnoDB: Page dump in ascii and hex (16384 bytes):
len 16384; hex [32768x'0', not printed here]
InnoDB: End of page dump
2016-08-29 14:11:58 7f42347f8700 InnoDB: uncompressed page, stored checksum in field1 0, calculated checksums for field1: crc32 536728786, innodb 1575996416, none 3735928559, stored checksum in field2 0, calculated checksums for field2: crc32 536728786, innodb 1371122432, none 3735928559, page LSN 0 0, low 4 bytes of LSN at page end 0, page number (if stored to page already) 0, space id (if created with >= MySQL-4.1.1 and stored already) 0
InnoDB: Page may be a freshly allocated page
14:11:58 UTC - mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https:/
key_buffer_
read_buffer_
max_used_
max_threads=153
thread_count=4
connection_count=1
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x7f4200000990
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7f42347f79a0 thread_stack 0x40000
mysqld(
mysqld(
/usr/lib64/
mysqld[0x98f726]
mysqld[0xa3f81a]
mysqld[0x9e5619]
mysqld[0x9257d9]
mysqld(
mysqld(
mysqld(
mysqld(
mysqld[0x836b0d]
mysqld(
mysqld(
mysqld(
mysqld(
mysqld(
mysqld(
mysqld(
mysqld(
mysqld(
/usr/lib64/
/usr/lib64/
/usr/lib64/
/usr/lib64/
/usr/lib64/
/usr/lib64/
/usr/lib64/
/usr/lib64/
mysqld[0x5b1834]
mysqld(
/usr/lib64/
/usr/lib64/
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (7f4200008fd0): is an invalid pointer
Connection ID (thread ID): 3
Status: NOT_KILLED
You may download the Percona XtraDB Cluster operations manual by visiting
http://
in the manual which will help you identify the cause of the crash.
Changed in percona-xtradb-cluster: | |
status: | New → Incomplete |
If I get you setup correctly:
Setup:
-----
You have have 4 nodes (3 pxc and 1 independent slave using PXC binaries with wsrep_provider= none).
n1, n2, n3 (pxc-node)
n4 (independent slave configured to replicate from pxc-cluster-node n3).
Process:
-------
Cluster nodes are active and running and you can execute needed workload on cluster node w/o any issues. Eventually you introduce n4.
a. How do you sync n4 from n3 or you let it catchup on its own.
b. At what stage do you pt-table-checksum. (After n4 has catched up completely ?)