Cleaning may leave nodes locked and require manual intervention to unlock
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ironic |
Fix Released
|
High
|
Josh Gachnang |
Bug Description
When attempting to clean multiple nodes at the same time, the conductors become sluggish and the logs are filled with errors failing to acquire locks at the beginning of conductor.
With rpc.call(), I was having issues with more than 2 or 3 nodes getting through cleaning (some would make it fine, others would deadlock). With rpc.cast() I got 20 to go through simultaneously without any issues I could see, with each node running 8 steps, each requiring the use of the continue_
As a note, this only occurs in the agent driver currently, but would happen in any driver doing asynchronous cleaning steps.
description: | updated |
description: | updated |
Changed in ironic: | |
importance: | Undecided → High |
milestone: | none → kilo-rc1 |
summary: |
- Cleaning results in deadlocks + Cleaning may leave nodes locked and require manual intervention to + unlock |
Changed in ironic: | |
status: | Fix Committed → Fix Released |
Changed in ironic: | |
milestone: | kilo-rc1 → 2015.1.0 |
Fix proposed to branch: master /review. openstack. org/172582
Review: https:/