Activity log for bug #1724589

Date Who What changed Old value New value Message
2017-10-18 14:39:26 John Garbutt bug added bug
2017-10-18 14:39:42 John Garbutt tags ironic placement
2017-10-18 14:39:49 John Garbutt nova: importance Undecided High
2017-10-18 14:57:22 John Garbutt nova: assignee John Garbutt (johngarbutt)
2017-10-18 14:57:25 John Garbutt nova: status New In Progress
2017-10-18 15:23:15 John Garbutt description In Pike we ask people to: * Update Ironic Node with a Resource Class * Update flavors to request the new Resource Class (and not request VCPU, RAM, DISK), using the docs: https://docs.openstack.org/ironic/latest/install/configure-nova-flavors.html#scheduling-based-on-resource-classes Consider this case: * some old instances are running from before the updates * some new instances are created after the updates In placement: * all inventory is correct, new resource class and legacy resource classes are both present * old instance allocations: only request In nova db: * old instances and new instances correctly request the new resource class in their flavor * new instances also include the anti-request for VCPU, DISK and RAM Now this is the flow that shows the problem: * get list of candidate allocations * this includes nodes that already have instances on (they only claim part of the inventory, but the new instance is only requesting the bit of the inventory the old instance isn't using) * boom, scheduling new instances fails after you hit the retry count, unless you got lucky and found a free slot by accident Possible reason for this: * Pike no longer updated instance allocations, if we updated the allocations of old instances to request the new custom resource class allocations, we would fix the above issue. Possible work around: * in the new flavor, keep requesting VCPU, RAM and CPU resources for pike, fix that up in queens? So the scenario is: * upgraded to pike * have ironic with a multiple flavor * attempting to transition to resource class based scheduling, now pike is installed In Pike we ask people to: * Update Ironic Node with a Resource Class * Update flavors to request the new Resource Class (and not request VCPU, RAM, DISK), using the docs: https://docs.openstack.org/ironic/latest/install/configure-nova-flavors.html#scheduling-based-on-resource-classes Consider this case: * some old instances are running from before the updates * some new instances are created after the updates In placement: * all inventory is correct, new resource class and legacy resource classes are both present * old instance allocations: only request In nova db: * old instances and new instances correctly request the new resource class in their flavor * new instances also include the anti-request for VCPU, DISK and RAM Now this is the flow that shows the problem: * get list of candidate allocations * this includes nodes that already have instances on (they only claim part of the inventory, but the new instance is only requesting the bit of the inventory the old instance isn't using) * boom, scheduling new instances fails after you hit the retry count, unless you got lucky and found a free slot by accident Possible reason for this: * Pike no longer updated instance allocations, if we updated the allocations of old instances to request the new custom resource class allocations, we would fix the above issue. Possible work around: * in the new flavor, keep requesting VCPU, RAM and CPU resources for pike, fix that up in queens?
2017-10-18 19:06:39 Matt Riedemann nominated for series nova/pike
2017-10-18 19:06:39 Matt Riedemann bug task added nova/pike
2017-10-18 19:06:44 Matt Riedemann nova/pike: status New Triaged
2017-10-18 19:06:47 Matt Riedemann nova/pike: importance Undecided High
2017-10-18 20:14:56 OpenStack Infra nova: assignee John Garbutt (johngarbutt) Dan Smith (danms)
2017-10-18 22:33:05 OpenStack Infra nova/pike: status Triaged In Progress
2017-10-18 22:33:05 OpenStack Infra nova/pike: assignee Matt Riedemann (mriedem)
2017-10-19 04:58:33 OpenStack Infra nova: status In Progress Fix Released
2017-10-19 18:25:37 OpenStack Infra nova/pike: status In Progress Fix Committed
2017-10-19 20:20:32 Matt Riedemann nova: assignee Dan Smith (danms) John Garbutt (johngarbutt)