Improve instance state recovery for Compute service failure during Create Server

Bug #1072734 reported by Rohit Karajgi on 2012-10-29
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)

Bug Description


Compute service spawns an instance but crashes just before instance's state is updated in database to Active, but instance has started running on the hypervisor.

In this situation, the recovery of the instance requires admin intervention:

- When compute service resumes, the check_instance_build_time periodic task sets the VM State to Error, while task state is still Spawning
- To recover the instance, Admin now has to reset the instance's state to Active (task state gets reset to None)

The instance can now be usable. The sync power state periodic task eventually sets the Power state to Running.

However , this is a tedious workflow needing admin intervention and should be handled in the code.

Michael Still (mikal) on 2012-10-30
Changed in nova:
status: New → Triaged
importance: Undecided → Medium
Grzegorz Grasza (xek) on 2015-03-09
Changed in nova:
assignee: nobody → Grzegorz Grasza (xek)
status: Triaged → In Progress
Grzegorz Grasza (xek) wrote :

To reproduce the error, I stopped the compute in _update_instance_after_spawn method.

wuhao (wuhao) wrote :

Grzegorz Grasza,

Is this work still in progress?

I wonder if it's ok to implement this work after your patch.

Change abandoned by Michael Still (<email address hidden>) on branch: master
Reason: This patch has been stalled for a long time, so I am abandoning it. Please feel free to restore it when the code is ready for review.

Grzegorz Grasza (xek) on 2015-11-26
Changed in nova:
assignee: Grzegorz Grasza (xek) → nobody
Changed in nova:
status: In Progress → Confirmed
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers