Reset deployment task (and maybe some others) sometimes hangs
Bug #1529613 reported by
Vitaly Kramskikh
This bug report is a duplicate of:
Bug #1549750: Async tasks (cluster deletion, resetting, snapshot generation) sometimes fail.
Edit
Remove
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Confirmed
|
Medium
|
Fuel Python (Deprecated) | ||
8.0.x |
Won't Fix
|
Medium
|
Fuel Python (Deprecated) | ||
Mitaka |
Confirmed
|
Medium
|
Fuel Python (Deprecated) |
Bug Description
Please see the logs and artifacts here:
https:/
This bug started to occur ~3 weeks ago and lead to hanging of cluster reset (and maybe deployment, but I'm not sure if it wasn't random failure of UI tests - we didn't have nailgun logs that time) during UI functional tests. After adding nailgun logs in case of UI test failure it seems there are deadlocks causing this:
Possible deadlock found: Possible deadlock found while attempting to lock table: 'tasks'. Lock transition is not allowed: clusters, nodes, tasks.
Changed in fuel: | |
status: | New → Confirmed |
tags: | added: team-bugfix |
Changed in fuel: | |
assignee: | Fuel Python Team (fuel-python) → Sergey Slipushenko (sslypushenko) |
Changed in fuel: | |
status: | Confirmed → In Progress |
Changed in fuel: | |
milestone: | 8.0 → 9.0 |
Changed in fuel: | |
assignee: | Sergey Slipushenko (sslypushenko) → nobody |
assignee: | nobody → Fuel Python Team (fuel-python) |
status: | In Progress → Confirmed |
tags: |
added: team-bugfix removed: tech-debt |
tags: |
added: tech-debt removed: team-bugfix |
To post a comment you must log in.
@Vitaly there are no DB deadlocks in the console logs. Only warnings from the deadlock detector. Warnings caused by differences between actual locking order in the code and allowed by detector. It should be fixed, but it is not the High bug. On the production environment we have disabled detector.
When DB deadlock occurs we see ShareLock exception from PostgreSQL in the logs. Have you an example of failed tests with ShareLock exception?