pt-kill kills prepared statements without checking busy-time
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Percona Toolkit moved to https://jira.percona.com/projects/PT |
Fix Committed
|
Medium
|
Carlos Salguero |
Bug Description
We're using pt-kill with the following options:
pt-kill --busy-time 120s --print --match-info "^(select|SELECT)" --interval 10
The idea is to only kill select queries running longer than 120 seconds.
However there's a problem with the logic that pt-kill uses to kill queries. If the Command is not equal to Query, then busy time is ignored, and other match parameters are free to take effect.
In this case, a prepared statement is in the "Execute" Command state, and running a SELECT in the Info part. It gets killed immediately by pt-kill every time.
The current assumption that only items in the processlist with Command=Query are "busy" But that isn't the case.
I think the right fix is to say that any query which is *not idle* is busy.
That is to change the current busy time check from:
if ( $find_spec{
to
if ( $find_spec{
However, I understand that this isn't a trivial change. The smallest change which would fix this issue is to add Execute to the if statement as well as Query, but that seems like it could leave the script vulnerable to similar issues.
The MySQL documentation in this case seems to agree with my case: that many queries not having Command=Query should actually be considered busy: http://
affects: | perconatools → percona-toolkit |
tags: | added: pt-kill |
Changed in percona-toolkit: | |
status: | New → Triaged |
Changed in percona-toolkit: | |
importance: | Undecided → Medium |
assignee: | nobody → Carlos Salguero (carlos-salguero) |
tags: | added: pt167 |
Changed in percona-toolkit: | |
status: | Triaged → In Progress |
By the way, we can work around this by adding --ignore-command Execute, but it seems the default behavior does not work as intended.