Cells: Race between instance 'unlock' and 'stop' can cause 'stop' to fail
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Won't Fix
|
Low
|
Unassigned |
Bug Description
Observed in the tempest-dsvm-cells job during tempest.
The test locks an instance, attempts to stop it, makes sure that fails, unlocks it, attempts to stop it, and makes sure that succeeds.
The problem happens during the succession of actions "unlock" and "stop". The "unlock" does an instance.save() of the locked state at the top cell which will sync to the child. If the "stop" request reaches the child cell before the instance.save() state locked = False syncs to the child cell, the "stop" will fail with the following trace in screen-
2015-06-18 19:09:23.852 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
2015-06-18 19:09:23.852 16161 ERROR nova.cells.
Logstash query: message:
tags: | added: liberty-backport-potential |
Changed in nova: | |
status: | Confirmed → Won't Fix |
tags: | removed: liberty-backport-potential |
Could it be related to https:/ /bugs.launchpad .net/tempest/ +bug/1468623 ?