Evacuation will corrupt instance allocations

Bug #1661312 reported by Dan Smith
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)
Invalid
Undecided
Unassigned

Bug Description

The following sequence of events will result in a corrupted instance allocation in placement:

1. Instance running on host A, placement has allocations for instance on host A
2. Host A goes down
3. Instance is evacuated to host B, host B creates duplicated allocations in placement for instance
4. Host A comes up, notices that instance is gone, deletes all allocations for instance on both hosts A and B
5. Instance now has no allocations for a period
6. Eventually, host B will re-create the allocations for the instance

The period between #4 and #6 will have the scheduler making bad decisions because it thinks host B is less loaded than it is.

Tags: placement
Sujitha (sujitha-neti)
tags: added: placement
Revision history for this message
Chris Dent (cdent) wrote :

I guess a generic question here is: Do we want it to be possible for a compute-node to be able to delete allocations it hasn't created itself? Or should it only be able to delete those it created itself?

Revision history for this message
Matt Riedemann (mriedem) wrote :

Where does this happen today?

4. Host A comes up, notices that instance is gone, deletes all allocations for instance on both hosts A and B

Where in the code I mean?

Revision history for this message
Sean Dague (sdague) wrote :

Clarification question from mriedem

Changed in nova:
status: New → Incomplete
Revision history for this message
Chris Dent (cdent) wrote :

https://bugs.launchpad.net/nova/+bug/1709902 duplcates this, and that one has code, so invalidating this one.

Changed in nova:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.