Race in resource tracker causes 500 response on deleting during verify_resize state
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Critical
|
Dan Smith |
Bug Description
During a tempest run occasionally a during the
tempest.
test it will fail when the test attempts to delete a server in the verify_resize state. The failure is caused by a 500 response given being returned from nova. Looking at the nova-api log this is caused by an rpc call never receiving a response:
looking at the n-cpu logs for the handling of that rpc call yields:
Which looks like it is coming from attempting to updating the resource tracker being triggered by the server deletion. However the volume from that failure according to the tempest log is coming from a different test, in the test class ServerRescueNeg
Full logs for an example run that tripped this is here:
http://
tags: | added: libvirt |
Changed in nova: | |
milestone: | none → juno-rc1 |
Changed in nova: | |
milestone: | juno-rc1 → none |
Changed in nova: | |
status: | Fix Committed → Fix Released |
Changed in nova: | |
milestone: | juno-rc1 → 2014.2 |
Yeah I was just looking at this coincidentally.