Activity log for bug #1042727

Date Who What changed Old value New value Message
2012-08-28 13:06:21 Baron Schwartz bug added bug
2012-08-29 01:00:14 Brian Fraser percona-toolkit: status New Confirmed
2012-09-10 19:04:04 Baron Schwartz summary pt-table-checksum doesn't keep the master DBH alive pt-table-checksum doesn't reconnect the slave $dbh
2012-09-10 19:04:53 Baron Schwartz description When replication is very delayed, pt-table-checksum will not keep its connection to the master alive, and when the replica catches up or if it dies for some reason, we get an error. It looks like this: ================ 08-27T09:44:10 Error waiting for the last checksum of table <...> to replicate to replica <...>: DBD::mysql::db selectrow_array failed: MySQL server has gone away [for Statement "SELECT MAX(chunk) FROM `percona`.`checksum` WHERE ... at pt-table-checksum line 8581. Check that the replica is running and has the replicate table `percona`.`checksum`. Checking the replica for checksum differences will probably cause another error. 08-27T09:44:10 Error checking for checksum differences of table <...> on replica <...>: DBD::mysql::db selectall_arrayref failed: MySQL server has gone away [for Statement "SELECT CONCAT(db, '.', tbl) AS `table`, chunk, chunk_index, lower_boundary, upper_boundary, COALESCE(this_cnt-master_cnt, 0) AS cnt_diff, COALESCE(this_crc <> master_crc OR ISNULL(master_crc) <> ISNULL(this_crc), 0) AS crc_diff, this_cnt, master_cnt, this_crc, master_crc FROM `rkdb`.`archivechecksum` WHERE (master_cnt <> this_cnt OR master_crc <> this_crc OR ISNULL(master_crc) <> ISNULL(this_crc)) AND (db='...' AND tbl='...')"] at pt-table-checksum line 4118. Check that the replica is running and has the replicate table `percona`.`checksum`. ================ I think the tool needs to do a keepalive SELECT 1 or something like that. When replication is very delayed, pt-table-checksum will not keep its connection to the replica [was:master] alive, and when the replica catches up or if it dies for some reason, we get an error. It looks like this: ================ 08-27T09:44:10 Error waiting for the last checksum of table <...> to replicate to replica <...>: DBD::mysql::db selectrow_array failed: MySQL server has gone away [for Statement "SELECT MAX(chunk) FROM `percona`.`checksum` WHERE ... at pt-table-checksum line 8581. Check that the replica is running and has the replicate table `percona`.`checksum`. Checking the replica for checksum differences will probably cause another error. 08-27T09:44:10 Error checking for checksum differences of table <...> on replica <...>: DBD::mysql::db selectall_arrayref failed: MySQL server has gone away [for Statement "SELECT CONCAT(db, '.', tbl) AS `table`, chunk, chunk_index, lower_boundary, upper_boundary, COALESCE(this_cnt-master_cnt, 0) AS cnt_diff, COALESCE(this_crc <> master_crc OR ISNULL(master_crc) <> ISNULL(this_crc), 0) AS crc_diff, this_cnt, master_cnt, this_crc, master_crc FROM `rkdb`.`archivechecksum` WHERE (master_cnt <> this_cnt OR master_crc <> this_crc OR ISNULL(master_crc) <> ISNULL(this_crc)) AND (db='...' AND tbl='...')"] at pt-table-checksum line 4118. Check that the replica is running and has the replicate table `percona`.`checksum`. ================ I think the tool needs to reconnect to replicas. [redacted: I think the tool needs to do a keepalive SELECT 1 or something like that.]
2012-10-01 20:22:12 Daniel Nichter tags pt-table-checksum error-recovery pt-table-checksum
2015-04-14 11:54:27 Brian Bird bug added subscriber Brian Bird
2015-06-24 22:07:19 Daniel Nichter percona-toolkit: status Confirmed In Progress
2015-06-24 22:07:22 Daniel Nichter percona-toolkit: assignee Daniel Nichter (daniel-nichter)
2015-06-24 22:07:31 Daniel Nichter percona-toolkit: importance Undecided High
2015-06-25 03:20:12 Daniel Nichter percona-toolkit: status In Progress Fix Committed
2015-07-10 21:34:45 Frank Cizmich percona-toolkit: milestone 2.2.15
2015-08-28 12:42:55 Hrvoje Matijakovic percona-toolkit: status Fix Committed Fix Released
2015-09-17 17:44:58 Frank Cizmich percona-toolkit: importance High Medium
2015-09-17 17:45:02 Frank Cizmich percona-toolkit: importance Medium High