ironic tempest tests periodically failing: No valid host was found
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ironic |
Invalid
|
Undecided
|
Unassigned | ||
OpenStack Compute (nova) |
Invalid
|
Undecided
|
Unassigned | ||
devstack |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
This was noticed on the stable/juno ironic sideways grenade jobs, but is also confirmed to be happening on the check-tempest-
http://
A number of the early compute tests will fail to spawn an instance, getting a scheduling error on the client side:
BuildErrorExcep
Details: Server eb81ee40-
Details: {u'message': u'No valid host was found. ', u'created': u'2014-
Looking through the nova logs, the request never even makes to the nova-scheduler. The last error is reported in conductor:
2014-11-27 17:44:01.005 WARNING nova.scheduler.
Looking at the time stamps of the requests, the first instance is requested at 17:44:00
2014-11-27 17:44:00.944 24730 DEBUG tempest.
However, on the nova-compute side, the resource tracker has not been updated to include the enlisted Ironic nodes until much later. This first time the tracker contains any of the ironic resources is at 17:44:06:
2014-11-27 17:44:06.224 21645 AUDIT nova.compute.
So there's a race between the resource tracker's initial inclusion of available resources and Tempest running the first set of tests that require an instance. This can be worked around in a couple of ways:
* Adjust the periodic task interval on nova-compute to update much more frequently, tho this will just narrow the window.
* Have tempest run an admin 'nova hypervisor-stats' call on the client side and wait for resources before running any instances (in the case of baremetal only)
* Adjust devstack's nova cpu deployment to spin until hypervisor-stats reflect the ironic node parameters
tags: | removed: in-stable-juno |
Changed in ironic: | |
status: | New → Confirmed |
status: | Confirmed → New |
Fix proposed to branch: master /review. openstack. org/138158
Review: https:/