Snapshot creation is racey
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
High
|
Rick Harris |
Bug Description
A race condition exists in the creation of snapshots since we check the task_state in compute/api but don't set the task_state until compute/manager receives the message.
This can allow two (or more) snapshots to be created at virtually the same time, which, in the case of XenServer may end up causing VHDs to not coalesce properly.
Even worse, when the VHD doesn't coalesce, this can cause the creation of "bad" images which ultimately corrupt the SR on the machine which they are restored to. This can have a cascading effect across the cluster where one bad image corrupts multiple machines.
The two pronged solution is to prevent "bad" images from getting created in the first place, and, 2), if we detect a bad image, bail on it before corrupting the SR.
Changed in nova: | |
assignee: | nobody → Rick Harris (rconradharris) |
importance: | Undecided → High |
status: | New → In Progress |
Changed in nova: | |
milestone: | none → essex-rc1 |
status: | Fix Committed → Fix Released |
Changed in nova: | |
milestone: | essex-rc1 → 2012.1 |
Fix proposed to branch: master /review. openstack. org/5059
Review: https:/