Improve instance state recovery for Compute service failure during Create Server

Bug #1072734 reported by Rohit Karajgi on 2012-10-29
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)
Grzegorz Grasza

Bug Description


Compute service spawns an instance but crashes just before instance's state is updated in database to Active, but instance has started running on the hypervisor.

In this situation, the recovery of the instance requires admin intervention:

- When compute service resumes, the check_instance_build_time periodic task sets the VM State to Error, while task state is still Spawning
- To recover the instance, Admin now has to reset the instance's state to Active (task state gets reset to None)

The instance can now be usable. The sync power state periodic task eventually sets the Power state to Running.

However , this is a tedious workflow needing admin intervention and should be handled in the code.

Changed in nova:
status: New → Triaged
importance: Undecided → Medium
Grzegorz Grasza (xek) on 2015-03-09
Changed in nova:
assignee: nobody → Grzegorz Grasza (xek)
status: Triaged → In Progress
Grzegorz Grasza (xek) wrote :

To reproduce the error, I stopped the compute in _update_instance_after_spawn method.

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers