Test suite "wait_for_slaves()" doesn't check replication is running

Bug #1011380 reported by Baron Schwartz
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Percona Toolkit moved to https://jira.percona.com/projects/PT
Triaged
Undecided
Unassigned

Bug Description

I found that the test suite will hang indefinitely if replication is actually broken and wait_for_slaves() is called. This happens, for example, if the sandboxes are set with BINLOG_FORMAT=ROW during setup. The 12346 server will die like this:

               Last_SQL_Error: Could not execute Update_rows event on table test.t3; Can't find record in 't3', Error_code: 1032; handler error HA_ERR_END_OF_FILE; the event's master log mysql-bin.000001, end_log_pos 228033

To avoid this, we should make wait_for_slaves check if replication is running, and BAIL_OUT if not, so we stop as soon as possible after breakage, and can thus try to fix it.

Tags: test-failure
tags: added: test-failure
Changed in percona-toolkit:
status: New → Triaged
Revision history for this message
Shahriyar Rzayev (rzayev-sehriyar) wrote :

Percona now uses JIRA for bug reports so this bug report is migrated to: https://jira.percona.com/browse/PT-986

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.