Bug #1197771 “Cluster stalls while distributing transaction” : Bugs : Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-04:

#1

Dump + killer sql Edit (633.8 KiB, application/zip)

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-04:

#2

db5_dump.zip Edit (877.0 KiB, application/zip)

Sory, it looks like I've made the dump and the SQL-log from different databases.
Here is the right dump with matching killer-sql (doublechecked this time).

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-07-04:

#3

Dmitry,
1) You're saying connections hang for at least 5 minutes. Does it mean that they "unhang" eventually or you just don't wait that long?
2) Have you noticed that you're using a terribly old Galera version: 2.2rc2? The last one (2.6) fixes one case of software deadlock among other things. So I'd suggest you upgrade that one too and report if you're still seeing this.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-04:

#4

1. First time we hit the issue connectionc hang for about a minute, then cluster unhang, yes. After certain point (the number of inserts grow from day to day) it lasted several times for 5 minutes or longer and we had to restart nodes B and/or C. This releases the lock somehow. The problem happens on the proidaction server, so we unfortunatly can't wait an hour to see if the hang will end in an hour or not. I guess after some time the server will recover.
2. All galera upgrades we have tested smashed the system to the dust one way or another so we stopped trying at some point. We could try playing with it if you really believe it may be involved.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-04:

#5

Tried galera 2.6 in the test environment, seems to work. Tomorrow we will try in with the subject issue

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-05:

#6

Download full text (7.5 KiB)

Updated, nothing changed - cluster still hangs.
Assuming MySQL finishes the update transactions during the normal shutdown, I believe the "hang" would last ~6minutes and then would end my itself. And when I say "the number of inserts grow from day to day" I mean it is now growing FAST. Say when in worked fine it was 100%, with 1 minute hang it was 300% and with 6minutes it is about 330% from the base. There is some exponential dependency.

After restart both (B,C) nodes entered cluster with normal IST.
In the log a see this:

130705 17:02:03 [Note] /usr/local/mysql/bin/mysqld: Normal shutdown

130705 17:02:03 [Note] WSREP: Stop replication
130705 17:02:03 [Note] WSREP: Closing send monitor...
130705 17:02:03 [Note] WSREP: Closed send monitor.
130705 17:02:03 [Note] WSREP: gcomm: terminating thread
130705 17:02:03 [Note] WSREP: gcomm: joining thread
130705 17:02:03 [Note] WSREP: gcomm: closing backend
130705 17:02:03 [Note] WSREP: view(view_id(NON_PRIM,1f8117dc-e56e-11e2-b18b-be707def1a41,17) memb {
        46f7b72e-e56e-11e2-93b4-eb7ea19e049f,
} joined {
} left {
} partitioned {
        1f8117dc-e56e-11e2-b18b-be707def1a41,
        77acdf64-e56e-11e2-af2b-4771634ab74b,
})
130705 17:02:03 [Note] WSREP: view((empty))
130705 17:02:03 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
130705 17:02:03 [Note] WSREP: gcomm: closed
130705 17:02:03 [Note] WSREP: Flow-control interval: [16, 16]
130705 17:02:03 [Note] WSREP: Received NON-PRIMARY.
130705 17:02:03 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 753333185)
130705 17:02:03 [Note] WSREP: Received self-leave message.
130705 17:02:03 [Note] WSREP: Flow-control interval: [0, 0]
130705 17:02:03 [Note] WSREP: Received SELF-LEAVE. Closing connection.
130705 17:02:03 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 753333185)
130705 17:02:03 [Note] WSREP: RECV thread exiting 0: Success
130705 17:02:03 [Note] WSREP: recv_thread() joined.
130705 17:02:03 [Note] WSREP: Closing replication queue.
130705 17:02:03 [Note] WSREP: Closing slave action queue.
130705 17:02:03 [ERROR] /usr/local/mysql/bin/mysqld: Sort aborted: Server shutdown in progress
130705 17:02:05 [Note] WSREP: killing local connection: 6953
130705 17:02:05 [Note] WSREP: killing local connection: 6952
--//--
130705 17:02:05 [Note] WSREP: killing local connection: 402
130705 17:02:05 [Note] WSREP: killing local connection: 346
130705 17:02:05 [Note] WSREP: killing local connection: 258
130705 17:08:00 [Note] WSREP: New cluster view: global state: e25852ac-d618-11e2-0800-6c53c8061134:753333185, view# -1: non-Primary, number of nodes: 1, m
y index: 0, protocol version 2
130705 17:08:00 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
130705 17:08:00 [Note] WSREP: New cluster view: global state: e25852ac-d618-11e2-0800-6c53c8061134:753333185, view# -1: non-Primary, number of nodes: 0, m
y index: -1, protocol version 2
130705 17:08:00 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
130705 17:08:00 [Note] WSREP: applier thread exiting (code:0)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:...

Updated, nothing changed - cluster still hangs.
Assuming MySQL finishes the update transactions during the normal shutdown, I believe the "hang" would last ~6minutes and then would end my itself. And when I say "the number of inserts grow from day to day" I mean it is now growing FAST. Say when in worked fine it was 100%, with 1 minute hang it was 300% and with 6minutes it is about 330% from the base. There is some exponential dependency.

After restart both (B,C) nodes entered cluster with normal IST.
In the log a see this:

130705 17:02:03 [Note] /usr/local/mysql/bin/mysqld: Normal shutdown

130705 17:02:03 [Note] WSREP: Stop replication
130705 17:02:03 [Note] WSREP: Closing send monitor...
130705 17:02:03 [Note] WSREP: Closed send monitor.
130705 17:02:03 [Note] WSREP: gcomm: terminating thread
130705 17:02:03 [Note] WSREP: gcomm: joining thread
130705 17:02:03 [Note] WSREP: gcomm: closing backend
130705 17:02:03 [Note] WSREP: view(view_id(NON_PRIM,1f8117dc-e56e-11e2-b18b-be707def1a41,17) memb {
        46f7b72e-e56e-11e2-93b4-eb7ea19e049f,
} joined {
} left {
} partitioned {
        1f8117dc-e56e-11e2-b18b-be707def1a41,
        77acdf64-e56e-11e2-af2b-4771634ab74b,
})
130705 17:02:03 [Note] WSREP: view((empty))
130705 17:02:03 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
130705 17:02:03 [Note] WSREP: gcomm: closed
130705 17:02:03 [Note] WSREP: Flow-control interval: [16, 16]
130705 17:02:03 [Note] WSREP: Received NON-PRIMARY.
130705 17:02:03 [Note] WSREP: Shifting SYNCED -> OPEN (TO: 753333185)
130705 17:02:03 [Note] WSREP: Received self-leave message.
130705 17:02:03 [Note] WSREP: Flow-control interval: [0, 0]
130705 17:02:03 [Note] WSREP: Received SELF-LEAVE. Closing connection.
130705 17:02:03 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 753333185)
130705 17:02:03 [Note] WSREP: RECV thread exiting 0: Success
130705 17:02:03 [Note] WSREP: recv_thread() joined.
130705 17:02:03 [Note] WSREP: Closing replication queue.
130705 17:02:03 [Note] WSREP: Closing slave action queue.
130705 17:02:03 [ERROR] /usr/local/mysql/bin/mysqld: Sort aborted: Server shutdown in progress
130705 17:02:05 [Note] WSREP: killing local connection: 6953
130705 17:02:05 [Note] WSREP: killing local connection: 6952
--//--
130705 17:02:05 [Note] WSREP: killing local connection: 402
130705 17:02:05 [Note] WSREP: killing local connection: 346
130705 17:02:05 [Note] WSREP: killing local connection: 258
130705 17:08:00 [Note] WSREP: New cluster view: global state: e25852ac-d618-11e2-0800-6c53c8061134:753333185, view# -1: non-Primary, number of nodes: 1, m
y index: 0, protocol version 2
130705 17:08:00 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
130705 17:08:00 [Note] WSREP: New cluster view: global state: e25852ac-d618-11e2-0800-6c53c8061134:753333185, view# -1: non-Primary, number of nodes: 0, m
y index: -1, protocol version 2
130705 17:08:00 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
130705 17:08:00 [Note] WSREP: applier thread exiting (code:0)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
--//--
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: applier thread exiting (code:5)
130705 17:08:00 [Note] WSREP: rollbacker thread exiting
130705 17:08:00 [Note] Event Scheduler: Purging the queue. 0 events
130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6950  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6790  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6520  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6486  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6475  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6472  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6456  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6453  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6451  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6444  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6437  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6422  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6221  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 6217  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 5911  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 5610  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 5456  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 5247  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4645  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4632  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4623  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4533  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4532  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4526  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4518  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4517  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4501  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4498  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4496  user: 'fbhub'

130705 17:08:02 [Warning] /usr/local/mysql/bin/mysqld: Forcing close of thread 4495  user: 'fbhub'

130705 17:08:02 [Note] WSREP: dtor state: CLOSED
130705 17:08:02 [Note] WSREP: mon: entered 12259 oooe fraction 0 oool fraction 0.000815727
130705 17:08:02 [Note] WSREP: mon: entered 12259 oooe fraction 0.0576719 oool fraction 0.000815727
130705 17:08:02 [Note] WSREP: mon: entered 68834 oooe fraction 0 oool fraction 1.45277e-05
130705 17:08:02 [Note] WSREP: cert index usage at exit 9715
130705 17:08:02 [Note] WSREP: cert trx map usage at exit 226
130705 17:08:02 [Note] WSREP: deps set usage at exit 0
130705 17:08:02 [Note] WSREP: avg deps dist 195.756
130705 17:08:02 [Note] WSREP: wsdb trx map usage 0 conn query map usage 0
130705 17:08:02 [Note] WSREP: Shifting CLOSED -> DESTROYED (TO: 753333185)
130705 17:08:02 [Note] WSREP: Flushing memory map to disk...
130705 17:08:02  InnoDB: Starting shutdown...
130705 17:08:03  InnoDB: Waiting for 200 pages to be flushed
130705 17:08:08  InnoDB: Shutdown completed; log sequence number 7347342151538
130705 17:08:08 [Note] /usr/local/mysql/bin/mysqld: Shutdown complete

130705 17:08:08 mysqld_safe mysqld from pid file /mysql/data/mysql.pid ended

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-07-05:

#7

s/I mean it is now growing FAST./I mean it is _NOT_ growing FAST/

No way to edit nor to preview, sorry for my spelling.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-09-09:

#8

The issue is still here :(

Revision history for this message

Raghavendra D Prabhu (raghavendra-prabhu) wrote on 2013-09-09:

#9

@Dmitry,

Several stall related issues have been fixed in PXC 5.5.33, it has not been released yet, however, it has been pushed to Percona experimental repo. Can you please test it and give us the feedback?

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-02:

#10

The issue is still here: time to apply transaction on the master and on the slaves differs drastically and this "distribution" time grows exponentially.
Lock happens after successful commit on the master node.

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-10-02:

#11

Dmitry, I can't seem to find any mention of system limits here, so just in case I'm asking what says
- cat /proc/$(pidof mysqld)/limits | grep "open files"
- cat /proc/sys/vm/dirty_ratio
- cat /sys/block/<your data drive>/queue/scheduler
first of all - on slaves.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-02:

#12

# cat /proc/$(pidof mysqld)/limits | grep "open files"
Max open files 268155 268155 files
# cat /proc/sys/vm/dirty_ratio
20
# cat /sys/block/sda/queue/scheduler
noop deadline [cfq]

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-02:

#13

ps. All servers are identical, master-slave only means a role in the transaction. Any server may start transaction (perform as master) and lock down the cluster.

Revision history for this message

Daniel Ylitalo (danigl) wrote on 2013-10-03:

#14

percona-xtradb-cluster-lock.jpg Edit (206.6 KiB, image/jpeg)

Download full text (3.9 KiB)

I believe I'm hitting this bug too.

Although I'm running 5.5.33 as follows:

mysql> show status like '%wsrep%';
+----------------------------+-------------------------------------------+
| Variable_name | Value |
+----------------------------+-------------------------------------------+
| wsrep_local_state_uuid | 6296f7ce-0067-11e3-8c0b-87899827493a |
| wsrep_protocol_version | 4 |
| wsrep_last_committed | 1142049610 |
| wsrep_replicated | 197311 |
| wsrep_replicated_bytes | 148631191 |
| wsrep_received | 412172 |
| wsrep_received_bytes | 172662669 |
| wsrep_local_commits | 197308 |
| wsrep_local_cert_failures | 3 |
| wsrep_local_bf_aborts | 0 |
| wsrep_local_replays | 0 |
| wsrep_local_send_queue | 0 |
| wsrep_local_send_queue_avg | 0.000000 |
| wsrep_local_recv_queue | 0 |
| wsrep_local_recv_queue_avg | 0.022472 |
| wsrep_flow_control_paused | 0.000000 |
| wsrep_flow_control_sent | 0 |
| wsrep_flow_control_recv | 0 |
| wsrep_cert_deps_distance | 378.693525 |
| wsrep_apply_oooe | 0.083333 |
| wsrep_apply_oool | 0.000000 |
| wsrep_apply_window | 1.089744 |
| wsrep_commit_oooe | 0.000000 |
| wsrep_commit_oool | 0.000000 |
| wsrep_commit_window | 1.038462 |
| wsrep_local_state | 4 |
| wsrep_local_state_comment | Synced |
| wsrep_cert_index_size | 637 |
| wsrep_causal_reads | 0 |
| wsrep_incoming_addresses | 10.0.8.8:3306,10.0.8.9:3306,10.0.8.6:3306 |
| wsrep_cluster_conf_id | 9 |
| wsrep_cluster_size | 3 |
| wsrep_cluster_state_uuid | 6296f7ce-0067-11e3-8c0b-87899827493a |
| wsrep_cluster_status | Primary |
| wsrep_connected | ON |
| wsrep_local_index | 2 |
| wsrep_provider_name | Galera |
| wsrep_provider_vendor | Codership Oy <email address hidden> |
| wsrep_pro...

I believe I'm hitting this bug too.

Although I'm running 5.5.33 as follows:

mysql> show status like '%wsrep%';
+----------------------------+-------------------------------------------+
| Variable_name              | Value                                     |
+----------------------------+-------------------------------------------+
| wsrep_local_state_uuid     | 6296f7ce-0067-11e3-8c0b-87899827493a      |
| wsrep_protocol_version     | 4                                         |
| wsrep_last_committed       | 1142049610                                |
| wsrep_replicated           | 197311                                    |
| wsrep_replicated_bytes     | 148631191                                 |
| wsrep_received             | 412172                                    |
| wsrep_received_bytes       | 172662669                                 |
| wsrep_local_commits        | 197308                                    |
| wsrep_local_cert_failures  | 3                                         |
| wsrep_local_bf_aborts      | 0                                         |
| wsrep_local_replays        | 0                                         |
| wsrep_local_send_queue     | 0                                         |
| wsrep_local_send_queue_avg | 0.000000                                  |
| wsrep_local_recv_queue     | 0                                         |
| wsrep_local_recv_queue_avg | 0.022472                                  |
| wsrep_flow_control_paused  | 0.000000                                  |
| wsrep_flow_control_sent    | 0                                         |
| wsrep_flow_control_recv    | 0                                         |
| wsrep_cert_deps_distance   | 378.693525                                |
| wsrep_apply_oooe           | 0.083333                                  |
| wsrep_apply_oool           | 0.000000                                  |
| wsrep_apply_window         | 1.089744                                  |
| wsrep_commit_oooe          | 0.000000                                  |
| wsrep_commit_oool          | 0.000000                                  |
| wsrep_commit_window        | 1.038462                                  |
| wsrep_local_state          | 4                                         |
| wsrep_local_state_comment  | Synced                                    |
| wsrep_cert_index_size      | 637                                       |
| wsrep_causal_reads         | 0                                         |
| wsrep_incoming_addresses   | 10.0.8.8:3306,10.0.8.9:3306,10.0.8.6:3306 |
| wsrep_cluster_conf_id      | 9                                         |
| wsrep_cluster_size         | 3                                         |
| wsrep_cluster_state_uuid   | 6296f7ce-0067-11e3-8c0b-87899827493a      |
| wsrep_cluster_status       | Primary                                   |
| wsrep_connected            | ON                                        |
| wsrep_local_index          | 2                                         |
| wsrep_provider_name        | Galera                                    |
| wsrep_provider_vendor      | Codership Oy <info@codership.com>         |
| wsrep_provider_version     | 2.7(r157)                                 |
| wsrep_ready                | ON                                        |
+----------------------------+-------------------------------------------+
40 rows in set (0.00 sec)

I'm attaching a screenshot of how the processlist looks on the slaves when the deadlock occurs.

In the attached screenshot I haven't waited for that long but it persists through out atleast 2 hours, then I couldnt wait longer and all I could do is to kill mysqld and reboostrap the cluster. This has happened a couple of times, first running 5.5.31, then upgraded to 5.5.33 to see if it solved the issue but it didnt.

On the master that receives all the queries the queries are stuck in state "wsrep in pre-commit stage"

Hope this helps!

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-10-03:

#15

Dmitry, open files limit must be good, but the other two parameters are not so.
IIRC, dirty_ratio=20 means that the kernel may use up to 20% of RAM for caching writes, CFQ scheduler is not recommended for database loads - use deadline. Not sure if fixing those will help, but worth a try.

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-10-03:

#16

Daniel, first thing to check is a presence of any table without a primary key. ROW-format DELETEs on such tables are notoriously slow. Yes, THAT MUCH slow - they do full table scans.

Revision history for this message

Daniel Ylitalo (danigl) wrote on 2013-10-03:

#17

Alex, thank you!

That seems as a probable explanation as the job that hangs the cluster indeed tries to delete rows from a table without a primary key.

If I may suggest a feature of a config option, something like "wsrep_deny_delete_without_index" that denies delete queries against tables without a primary key

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-03:

#18

> Not sure if fixing those will help, but worth a try.
We will check this, but if the problem would be IO I'd expect a) master slowing exponentially as well, while it's linearly slowing b) this would noticebly affect lot's of other cases. But very seldom query is capable of hanging of this kind. There is some secret trigger, it is not just a raw slowness - in general our load is only 15% CPU and 7% IO, all is nice and fast.

And I believed MySQL and it's forks force direct_io on it's file handles for any way on Linux . So with InnoDB I can safely ignore default IO caching as MySQL will forcibly bypass it. Am I wrong?

pps. One more (probably) important thing: we use ram disk for MySQL temporary drive.

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-10-09:

#19

> pps. One more (probably) important thing: we use ram disk for MySQL temporary drive.

Are you sure in this case that you're not simply running out of memory? Do you have swap enabled?

Revision history for this message

Alex Yurchenko (ayurchen) wrote on 2013-10-09:

#20

> So with InnoDB I can safely ignore default IO caching as MySQL will forcibly bypass it. Am I wrong?

with InnoDB perhaps you can, with Galera cache file you can't. And to my understanding scheduler affects non-cached IO as well.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-09:

#21

Error log fragment Edit (8.3 KiB, text/plain)

No, there is enough memory/space. And yes, we will check IO options

And latest release (in addidion to some big transactions) stalls with other error, and this stall looks like "total" - it's locked forever, error log just writes "BF lock wait long" + "INNODB MONITOR OUTPUT" over and over again.

Perhaps, I should fire a new ticket upon this, but I'm still not sure, how to reproduce this and is it a separate bug oк it is all about the same.
_Perhaps_ this is caused by small, but highly-concurrent requests on several nodes like "REPLACE INTO catalit_bookmark_locks ( user_id, art_id, lock_date, lock_id ) VALUES ( ?, ?, NOW(), UUID() )" where table looks like this:
CREATE TABLE `catalit_bookmark_locks` (
  `user_id` int(10) unsigned NOT NULL,
  `art_id` varchar(100) NOT NULL DEFAULT '',
  `lock_date` datetime NOT NULL,
  `lock_id` varchar(36) NOT NULL,
  PRIMARY KEY (`user_id`,`art_id`),
  CONSTRAINT `catalit_bookmark_locks_ibfk_1` FOREIGN KEY (`user_id`) REFERENCES `users` (`id`) ON DELETE CASCADE
) ENGINE=InnoDB DEFAULT CHARSET=utf8

Revision history for this message

Raghavendra D Prabhu (raghavendra-prabhu) wrote on 2013-10-09:

#22

@Dmitry,

>>_Perhaps_ this is caused by small, but highly-concurrent requests on several nodes like "REPLACE INTO catalit_bookmark_locks ( user_id, art_id, lock_date, lock_id ) VALUES ( ?, ?, NOW(), UUID() )" where table looks like this:

Yes, from the fragment of the error it looks like you are seeing
conflicts.

------------------------
LATEST DETECTED DEADLOCK
------------------------
131008 21:17:23
*** (1) TRANSACTION:
TRANSACTION 130A88E499, ACTIVE 0 sec starting index read
mysql tables in use 1, locked 1
LOCK WAIT 2 lock struct(s), heap size 376, 1 row lock(s)
MySQL thread id 224217, OS thread handle 0x7fe1cba0e700, query id 201781327 192.168.100.31 fbhub Updating
UPDATE users SET
                                utc_offset = IF(NULL IS NULL, utc_offset, NULL),
                                last_used=NOW(),
                                last_host_id = if(last_host_id = 19,19,'1735')
                        WHERE id='18700125' and last_used < now() - interval 30 minute
*** (1) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88E499 lock_mode X locks rec but not gap waiting
*** (2) TRANSACTION:
TRANSACTION 130A88DEA3, ACTIVE 1 sec starting index read, thread declared inside InnoDB 500
mysql tables in use 1, locked 1
8 lock struct(s), heap size 1248, 3 row lock(s), undo log entries 1
MySQL thread id 225033, OS thread handle 0x7fe1db71c700, query id 201781913 192.168.100.31 fbhub Updating
UPDATE users SET
                                s_subscr_text_authors = '<87>\r\n',
                                subscr_type = IF(subscr_type = 1, 1, 2),
                                subscribe_new_buys = 1
                        WHERE id = '18700125'
*** (2) HOLDS THE LOCK(S):
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88DEA3 lock mode S locks rec b
ut not gap
*** (2) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88DEA3 lock_mode X locks rec b
ut not gap waiting
*** WE ROLL BACK TRANSACTION (1)

============================================================================================

As you can see, both the conflicting statements refer the same id
for update.

@Dmitry,

>>_Perhaps_ this is caused by small, but highly-concurrent requests on several nodes like "REPLACE INTO catalit_bookmark_locks ( user_id, art_id, lock_date, lock_id ) VALUES ( ?, ?, NOW(), UUID() )" where table looks like this:

Yes, from the fragment of the error it looks like you are seeing 
 conflicts.

------------------------
LATEST DETECTED DEADLOCK
------------------------
131008 21:17:23
*** (1) TRANSACTION:
TRANSACTION 130A88E499, ACTIVE 0 sec starting index read
mysql tables in use 1, locked 1
LOCK WAIT 2 lock struct(s), heap size 376, 1 row lock(s)
MySQL thread id 224217, OS thread handle 0x7fe1cba0e700, query id 201781327 192.168.100.31 fbhub Updating
UPDATE users SET
                                utc_offset = IF(NULL IS NULL, utc_offset, NULL),
                                last_used=NOW(),
                                last_host_id = if(last_host_id = 19,19,'1735')
                        WHERE id='18700125' and last_used < now() - interval 30 minute
*** (1) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88E499 lock_mode X locks rec but not gap waiting
*** (2) TRANSACTION:
TRANSACTION 130A88DEA3, ACTIVE 1 sec starting index read, thread declared inside InnoDB 500
mysql tables in use 1, locked 1
8 lock struct(s), heap size 1248, 3 row lock(s), undo log entries 1
MySQL thread id 225033, OS thread handle 0x7fe1db71c700, query id 201781913 192.168.100.31 fbhub Updating
UPDATE users SET
                                s_subscr_text_authors = '<87>\r\n',
                                subscr_type = IF(subscr_type = 1, 1, 2),
                                subscribe_new_buys = 1
                        WHERE id = '18700125'
*** (2) HOLDS THE LOCK(S):
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88DEA3 lock mode S locks rec b
ut not gap
*** (2) WAITING FOR THIS LOCK TO BE GRANTED:
RECORD LOCKS space id 0 page no 6423238 n bits 136 index `PRIMARY` of table `lib_area_100`.`users` trx id 130A88DEA3 lock_mode X locks rec b
ut not gap waiting
*** WE ROLL BACK TRANSACTION (1)

============================================================================================

As you can see, both the conflicting statements refer the same id 
for update.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-09:

#23

So what do we do about that? New bug report?

Revision history for this message

Raghavendra D Prabhu (raghavendra-prabhu) wrote on 2013-10-17:

#24

@Dmitry,

Yes please.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-10-22:

#25

Tomorrow we have the stall without "BF lock wait long", just like it's described in the starting report - stall without any visible symptoms in the log file. I'll consider there are at least two bugs, yes.

Made #1243156 ticket about conflicting statements. Hope it's clear enough. I'll add more details into it when (if) the problem will happen again. As we have diminished our transactions all kind of stalls happen quite seldom now, once a week instead of once a day.

Now I have pt-stalk output with some private data written during the stall. I can put it into the issue tracker on customers.percona.com or send by e-mail.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2013-11-11:

#26

In PXC 5.5.34 with Galera 23.2.8 subject problem is not reproduced. Neither we see "BF lock wait long" for some time, hopefully it has gone as well.

There are some stall cases with updating tables with no primary key, Daniel hit it as well. I believe this should be handled somehow as well - at least log warnings in detailed logging modes when making this updates.

But all this has nothing to do with the subject issue anyway.

Revision history for this message

Raghavendra D Prabhu (raghavendra-prabhu) wrote on 2013-11-11:

#27

@Dmitry,

Thanks for the acknowledgment. Yes, having tables without primary key can stress the performance. Anyways, closing this issue for now.

Changed in percona-xtradb-cluster:
status:	New → Fix Committed

Revision history for this message

Srand (cyril-scetbon) wrote on 2013-11-11:

#28

We have the same issue. I'lll let you know asap if using the new version help us to not meet the issue again.

Revision history for this message

Dmitry Yu Okunev (dyokunev) wrote on 2014-04-29:

#29

I have a similar issue with different versions of galera/mysql and OS:
The working node have a lot of "wsrep in pre-commit stage" processes (without any CPU/HDD/network load), but the receiving node have a process in stage of "System lock" with 100% CPU load.

I've tried to attach with gdb on the moment of problem and got this backtrace:

#0 0x00007fa4c1bf34a3 in poll () from /lib/x86_64-linux-gnu/libc.so.6
#1 0x00000000005f92b2 in handle_connections_sockets () at /mnt/workspace/percona-xtradb-cluster-5.6-debs/label_exp/debian-wheezy-x64/target/Percona-XtraDB-Cluster-5.6.15/sql/mysqld.cc:7099
#2 0x00000000006001f6 in mysqld_main (argc=68, argv=0x137f028) at /mnt/workspace/percona-xtradb-cluster-5.6-debs/label_exp/debian-wheezy-x64/target/Percona-XtraDB-Cluster-5.6.15/sql/mysqld.cc:6494
#3 0x00007fa4c1b40ead in __libc_start_main () from /lib/x86_64-linux-gnu/libc.so.6
#4 0x00000000005f30fd in _start ()

I'll try to recompile with without "HAVE_POLL", now.

Revision history for this message

Dmitry Yu Okunev (dyokunev) wrote on 2014-04-29:

#30

As expected, the same problem with select() instead of poll() :)

Revision history for this message

Srand (cyril-scetbon) wrote on 2014-04-29:

#31

damn it, no workaround for now :(

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2014-04-29:

#32

We have set up pt-stalk to catch on "BF lock wait long" in the log and restart affected mysqld, worked nice as a workaround (except for "BF lock wait long" only show up after the clasterr stopped for some time).

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2014-05-08:

#33

The "BF lock wait long" stall was reproduced twice since then. So I suppose it was not solved completely or there is another reason for such a thing. Now it's so seldom I have no idea how to find the reason (but we must have pt-stalk output for this cases).

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2014-07-08:

#34

Noted: there were many "BF lock wait long" before, now there is one in the log, than the note stalls, just as it did before. We restart it - all is fine. Happens not so often, but still.
And I feel like I will soon guess which query triggers this now.

Revision history for this message

Dmitry Gribov (grib-d) wrote on 2014-07-08:

#35

ps. Never seing "BF lock wait long" in the log without cluster being hang. Perhaps this wait-transactions should be simply handled as a deadlocks and an error should be fired to the software?

Revision history for this message

Krunal Bauskar (krunal-bauskar) wrote on 2015-11-17:

#36

So the things has gone up in meantime as related issued are fixed now.
Please upgrade to 5.6 for more better experience.
Will close this issue for now.

Changed in percona-xtradb-cluster:
importance:	Undecided → Medium

Revision history for this message

Rodrigo (rodri-bernardo) wrote on 2017-03-08:

#37

I have this issue in Percona 5.6. Is there a new bug report?

Revision history for this message

Shahriyar Rzayev (rzayev-sehriyar) wrote on 2018-01-18:

#38

Percona now uses JIRA for bug reports so this bug report is migrated to: https://jira.percona.com/browse/PXC-1073

Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC

Cluster stalls while distributing transaction

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches