gate-tempest-dsvm-large-ops fails with RPC MessagingTimeout in _try_deallocate_network
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| | OpenStack Compute (nova) |
Critical
|
Matt Riedemann | ||
| | PBR |
Undecided
|
Monty Taylor | ||
Bug Description
61 hits in 7 days, check and gate, all failures.
2015-01-27 10:55:34.025 ERROR nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
2015-01-27 10:55:34.025 32626 TRACE nova.compute.
This job uses nova-network, it's doing an rpc call on the network deallocate so if something is holding a lock in the same area would could be timing out that way, need to check the code to see what's all using locks at the same time in these flows.
| tags: | added: network |
| Changed in nova: | |
| status: | New → Confirmed |
| Matt Riedemann (mriedem) wrote : | #1 |
| Joe Gordon (jogo) wrote : | #2 |
2015-01-27 22:24:46.812 DEBUG oslo_concurrenc
| Changed in nova: | |
| importance: | Undecided → Critical |
| Matt Riedemann (mriedem) wrote : | #3 |
This might be suspect:
https:/
| Joe Gordon (jogo) wrote : | #4 |
This bug started on the 19th
Fix proposed to branch: master
Review: https:/
| Changed in nova: | |
| assignee: | nobody → Matt Riedemann (mriedem) |
| status: | Confirmed → In Progress |
| Joe Gordon (jogo) wrote : | #6 |
Based on logs from before this patch was introduced http://
We are holding the lock for the same amount of time, but waiting longer to acquire it.
| Joe Gordon (jogo) wrote : | #7 |
Contrary to my previous comment:
average time the lock was held with this issue:
Average: 0.442208
And before:
Average: 0.261565
| Joe Gordon (jogo) wrote : | #8 |
grep "restart_dhcp\" \:\: held" screen-
| Joe Gordon (jogo) wrote : | #9 |
After further investigation, it looks like https:/
| Davanum Srinivas (DIMS) (dims-v) wrote : | #10 |
Here's an old one from Nov
stack@bigblue:~$ cat /usr/local/
#!/usr/bin/python
# PBR Generated from u'console_scripts'
import sys
from oslo.rootwrap.cmd import main
if __name__ == "__main__":
sys.
stack@bigblue:~$
stack@bigblue:~$ ls -altr /usr/local/
-rwxr-xr-x 1 root root 156 Nov 12 13:21 /usr/local/
| Joe Gordon (jogo) wrote : | #11 |
It looks like this may be due to a setuptools change
| Joe Gordon (jogo) wrote : | #12 |
current rootwrap:
#!/usr/bin/python
# EASY-INSTALL-
__requires__ = 'nova==
import sys
from pkg_resources import load_entry_point
if __name__ == '__main__':
sys.exit(
)
| Joe Gordon (jogo) wrote : | #13 |
looks like this is the issue:
https:/
pip install -e hg+https:/
from python-novaclient:
python setup.py develop && cat /usr/local/bin/nova
| Davanum Srinivas (DIMS) (dims-v) wrote : | #14 |
| Joe Gordon (jogo) wrote : | #15 |
So it looks like setuptools patch https:/
Related fix proposed to branch: master
Review: https:/
Change abandoned by Matt Riedemann (<email address hidden>) on branch: master
Review: https:/
Reason: This is supposed to fix it:
| Changed in pbr: | |
| status: | New → In Progress |
| Changed in nova: | |
| status: | In Progress → Invalid |
| Changed in pbr: | |
| assignee: | nobody → Monty Taylor (mordred) |
Reviewed: https:/
Committed: https:/
Submitter: Jenkins
Branch: master
commit 10c510ec922792c
Author: Chet Burgess <email address hidden>
Date: Wed Jan 28 10:20:24 2015 -0800
Use a workarounds option to disable rootwrap
Create a workarounds option to disable rootwrap to work around
recent setuptools changes that impacted PBRs ability to make
rootwrap performance not terrible.
DocImpact
Related-Bug: #1415106
Change-Id: Iad1e57b18e490a
Reviewed: https:/
Committed: https:/
Submitter: Jenkins
Branch: feature/0.10
commit 1457aafa6095da4
Author: Monty Taylor <email address hidden>
Date: Wed Jan 28 06:57:06 2015 -0800
Support script text override for newer develop
setuptools > 12 has a new flow for writing out generated script text.
It's nicer, actually, because it means we can just subclass and extend
one method instead of monekypatching.
Closes-Bug: 1415106
Change-Id: I56e7bea60df8a5
| Changed in pbr: | |
| status: | In Progress → Fix Released |


From logstash it looks like this showed up on 1/19.