Instances stuck in Image_snapshot/Queued should be cleaned up

Bug #966107 reported by Mandar Vaze
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)
Invalid
Wishlist
Unassigned

Bug Description

Scenario:
If various systems like RabbitMQ, mysql DB server ot nova-compute go down during snapshot, the instance is stuck either in queued or Image_snapshot state. (Depending on the timing of when a specific component went down)

Expected Response:
Instance should be eventually brought back to ACTIVE state.
If there are snapshot entries in glance DB and/or on disk, they should be cleaned up.

Actual Response:
Instance remains stuck in Image_snapshot state.
This is problematic because once it is stuck in this state, no snapshot is allowed on this instance till it returns in either ACTIVE or SHUTOFF state.

    in nova/compute/api.py
    @check_instance_state(vm_state=[vm_states.ACTIVE, vm_states.SHUTOFF])
    def snapshot(self, context, instance, name, extra_properties=None):

Notes :
This was reproduced forcefully for testing purposes by putting breakpoint at appropriate place(s) and then shutting down rabbitmq or mysql servers from other terminal window.

Branch: master

Tags: ntt
Revision history for this message
Mandar Vaze (mandarvaze) wrote :
Revision history for this message
Sean Dague (sdague) wrote :

Confirmed as it is part of a blueprint

Changed in nova:
status: New → Confirmed
Thierry Carrez (ttx)
Changed in nova:
importance: Undecided → Wishlist
Revision history for this message
Joe Gordon (jogo) wrote :

Is this still valid? The blueprint was superseded.

Changed in nova:
status: Confirmed → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.