MMM dealing with events out-of-sync?

Bug #668178 reported by Arjen Lentz
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
mysql-mmm
New
High
Unassigned

Bug Description

This timeline does not make sense:
 a) cat has a problem
 b) cat is ok
 c) cat state changed because not ok.
 d) cat state to awaiting recovery.

c would have been done before b. And if it wasn't, it should not have been done at all since it was already clear that cat was ok again (as per b). right?

2010/10/29 03:43:44 ERROR Check 'rep_threads' on 'eggs' has failed for 21 seconds! Message: ERROR: Timeout
2010/10/29 03:43:44 ERROR Check 'rep_backlog' on 'eggs' has failed for 21 seconds! Message: ERROR: Timeout
2010/10/29 03:43:44 ERROR Check 'mysql' on 'eggs' has failed for 21 seconds! Message: ERROR: Timeout
2010/10/29 03:44:09 ERROR Check 'rep_threads' on 'cat' has failed for 41 seconds! Message: ERROR: Timeout
2010/10/29 03:44:09 ERROR Check 'rep_backlog' on 'cat' has failed for 41 seconds! Message: ERROR: Timeout
2010/10/29 03:44:15 ERROR Check 'mysql' on 'cat' has failed for 46 seconds! Message: ERROR: Timeout
2010/10/29 03:44:44 FATAL State of host 'eggs' changed from ONLINE to HARD_OFFLINE (ping: OK, mysql: not OK)
2010/10/29 03:44:58 INFO Removing all roles from host 'eggs':
2010/10/29 03:45:09 INFO Removed role 'writer(74.207.240.17)' from host 'eggs'
2010/10/29 03:45:32 INFO Check 'rep_threads' on 'cat' is ok!
2010/10/29 03:45:32 INFO Check 'rep_backlog' on 'cat' is ok!
2010/10/29 03:45:32 INFO Check 'mysql' on 'cat' is ok!
2010/10/29 03:45:35 FATAL State of host 'cat' changed from ONLINE to HARD_OFFLINE (ping: OK, mysql: not OK)
2010/10/29 03:45:35 INFO Removing all roles from host 'cat':
2010/10/29 03:45:45 FATAL State of host 'cat' changed from HARD_OFFLINE to AWAITING_RECOVERY
2010/10/29 03:46:04 INFO Check 'mysql' on 'eggs' is ok!
2010/10/29 03:46:04 INFO Check 'rep_backlog' on 'eggs' is ok!
2010/10/29 03:46:16 FATAL State of host 'eggs' changed from HARD_OFFLINE to AWAITING_RECOVERY
2010/10/29 03:46:33 ERROR Check 'rep_backlog' on 'eggs' has failed for 18 seconds! Message: ERROR: Timeout
2010/10/29 03:46:33 ERROR Check 'mysql' on 'eggs' has failed for 18 seconds! Message: ERROR: Timeout
2010/10/29 03:46:37 FATAL State of host 'eggs' changed from AWAITING_RECOVERY to HARD_OFFLINE
2010/10/29 03:46:38 INFO Check 'rep_threads' on 'eggs' is ok!
2010/10/29 03:46:38 INFO Check 'mysql' on 'eggs' is ok!
2010/10/29 03:46:38 INFO Check 'rep_backlog' on 'eggs' is ok!
2010/10/29 03:46:40 FATAL State of host 'eggs' changed from HARD_OFFLINE to AWAITING_RECOVERY

Changed in mysql-mmm:
importance: Undecided → High
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.