Instances stuck in Image_snapshot/Queued should be cleaned up
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Wishlist
|
Unassigned |
Bug Description
Scenario:
If various systems like RabbitMQ, mysql DB server ot nova-compute go down during snapshot, the instance is stuck either in queued or Image_snapshot state. (Depending on the timing of when a specific component went down)
Expected Response:
Instance should be eventually brought back to ACTIVE state.
If there are snapshot entries in glance DB and/or on disk, they should be cleaned up.
Actual Response:
Instance remains stuck in Image_snapshot state.
This is problematic because once it is stuck in this state, no snapshot is allowed on this instance till it returns in either ACTIVE or SHUTOFF state.
in nova/compute/api.py
@check_
def snapshot(self, context, instance, name, extra_propertie
Notes :
This was reproduced forcefully for testing purposes by putting breakpoint at appropriate place(s) and then shutting down rabbitmq or mysql servers from other terminal window.
Branch: master
Changed in nova: | |
importance: | Undecided → Wishlist |
linked to blueprint https:/ /blueprints. launchpad. net/nova/ +spec/compute- instance- cleanup- service