IST is not interrupted properly during shutdown

Bug #1176852 reported by Alex Yurchenko
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
Galera
Status tracked in 3.x
2.x
Confirmed
Medium
Teemu Ollakka
3.x
Confirmed
Medium
Teemu Ollakka
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC
Status tracked in 5.6
5.5
Incomplete
High
Unassigned
5.6
Incomplete
High
Unassigned

Bug Description

People report:

May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: view((empty))
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: gcomm: closed
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Flow-control interval: [64, 64]
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Received NON-PRIMARY.
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Shifting JOINER -> OPEN (TO: 4095760)
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Received self-leave message.
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Flow-control interval: [64, 64]
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Received SELF-LEAVE. Closing connection.
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 4095760)
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: RECV thread exiting 0: Success
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: recv_thread() joined.
May 3 00:46:08 localhost mysqld: 130503 0:46:08 [Note] WSREP: Closing slave action queue.
May 3 02:19:11 localhost mysqld: 130503 2:19:11 [Note] WSREP: IST received: 92f5a26e-aeaf-11e2-0800-fcc64d722c65:4093185
May 3 02:19:11 localhost mysqld: 130503 2:19:11 [ERROR] WSREP: gcs/src/gcs.c:_join():800: Sending JOIN failed: -103 (Software caused connection abort).
May 3 02:19:11 localhost mysqld: 130503 2:19:11 [ERROR] WSREP: Failed to JOIN the cluster after SST
May 3 02:19:12 localhost mysqld: 130503 2:19:12 [Warning] WSREP: Failed to report last committed 4094167, -77 (File descriptor in bad state)
May 3 02:19:14 localhost mysqld: 130503 2:19:14 [Warning] WSREP: Failed to report last committed 4095153, -77 (File descriptor in bad state)

- the node won't shut down until IST is over - in this case 1.5 hours. IST should be easily interruptible and the node should be able to shutdown nicely.

Changed in galera:
assignee: nobody → Teemu Ollakka (teemu-ollakka)
status: New → Confirmed
Changed in galera:
milestone: 23.2.6 → 23.2.7
Changed in galera:
milestone: 23.2.7 → 24.2.8
Changed in galera:
milestone: 25.2.8 → 25.2.9
Changed in galera:
milestone: 25.2.9 → 25.2.10
Revision history for this message
Krunal Bauskar (krunal-bauskar) wrote :

Can you help explain the TCs.

Revision history for this message
Shahriyar Rzayev (rzayev-sehriyar) wrote :

Percona now uses JIRA for bug reports so this bug report is migrated to: https://jira.percona.com/browse/PXC-962

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.