Instance creation sometimes fails after host aggregate deletion
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Low
|
Markus Zoeller (markus_z) | ||
Mitaka |
In Progress
|
Low
|
Roman Podoliaka |
Bug Description
Instance creation starts failing if nova scheduler gets in an inconsistent state wrt host aggregates. If remove_
This is because it cleans up state based on aggregate.hosts which is empty when aggregate is deleted, but the prior aggregate updates to remove individual hosts could have incorrect list of hosts added to the host_aggregates
Instance creation fails with below error once scheduler gets in this state:
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.780 15692 ERROR oslo_messaging.
2016-07-21 18:20:16.784 15692 ERROR oslo_messaging.
Sounds valid. The issue should be somewhere around here: https:/ /github. com/openstack/ nova/blob/ 7755b759b4a0ba0 6378afd7b64153e 4a83f2ce9e/ nova/scheduler/ host_manager. py#L383- L390