Make RBD Usable for Ephemeral Storage
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
| OpenStack Compute (nova) |
Medium
|
Dmitry Borodaenko |
Bug Description
Currently in Havana development, RBD as ephemeral storage has serious stability
and performance issues that makes the Ceph cluster a bottleneck for using an
image as a source.
Nova has to currently communicate with the external service Glance, which has
to talk to the separate Ceph storage backend to fetch path information. The
entire image is then downloaded to local disk, and then imported from local
disk to RBD. This leaves a stability concern, especially with large images for
the instance to be successfully created.
This can be eliminated by instead having Nova's RBD image backend utility
communicate directly with the Ceph backend to do a copy-on-write of the image.
Not only does this greatly improve stability, but performance is drastically
improved by not having to do a full copy of the image.
Changed in nova: | |
assignee: | nobody → Josh Durgin (jdurgin) |
status: | New → In Progress |
tags: | added: ceph rbd |
Jesse Pretorius (jesse-pretorius) wrote : | #2 |
This seems to have stalled?
Josh Durgin (jdurgin) wrote : | #3 |
hudson didn't update this, but it's up for review: https:/
I expect it will be merged soon since several rounds of review have already been done.
Andrew Woodward (xarses) wrote : | #4 |
I think that this patch is very useful to the community, What can we do to help push this through?
Can we get this tagged for inclusion into icehouse and an importance set?
Changed in nova: | |
importance: | Undecided → Medium |
Reviewed: https:/
Committed: https:/
Submitter: Jenkins
Branch: master
commit c25c60f6a9ab1cc
Author: Josh Durgin <email address hidden>
Date: Wed Jan 22 15:07:17 2014 -0800
enable cloning for rbd-backed ephemeral disks
Currently when using rbd as an image backend, nova downloads the
glance image to local disk and then copies it again into rbd. This
can be very slow for large images, and wastes bandwidth as well as
disk space.
When the glance image is stored in the same ceph cluster, the data is
being pulled out and pushed back in unnecessarily. Instead, create a
copy-on-write clone of the image. This is fast, and does not depend
on the size of the image. Instead of taking minutes, booting takes
seconds, and is not limited by the disk copy.
Add some rbd utility functions from cinder to support cloning and
let the rbd imagebackend rely on librbd instead of the rbd
command line tool for checking image existence.
Add an ImageHandler for rbd that does the cloning if an applicable
image location is available. If no such location is available, or rbd
is not configured for ephemeral disks, this handler does nothing, so
enable it by default.
blueprint rbd-clone-
Closes-bug: 1226351
Change-Id: I9b77a50206d0ed
Signed-off-by: Josh Durgin <email address hidden>
Signed-off-by: Zhi Yan Liu <email address hidden>
Changed in nova: | |
status: | In Progress → Fix Committed |
Changed in nova: | |
milestone: | none → icehouse-rc1 |
Changed in nova: | |
status: | Fix Committed → In Progress |
Changed in nova: | |
milestone: | icehouse-rc1 → none |
Bart Vanbrabant (bart-vanbrabant) wrote : | #6 |
Any status on this patch? Because it was reverted before icehouse was released.
Shuquan Huang (shuquan) wrote : | #7 |
I merge the patch from master. But it doesn't work. Is there any other place should be configured?
Changed in nova: | |
assignee: | Josh Durgin (jdurgin) → Dmitry Borodaenko (dborodaenko) |
Dmitry Borodaenko (angdraug) wrote : | #8 |
Current incarnation of this patch:
https:/
Changed in nova: | |
assignee: | Dmitry Borodaenko (dborodaenko) → Jay Pipes (jaypipes) |
Changed in nova: | |
assignee: | Jay Pipes (jaypipes) → Dmitry Borodaenko (dborodaenko) |
Changed in nova: | |
assignee: | Dmitry Borodaenko (dborodaenko) → Michael Still (mikalstill) |
Changed in nova: | |
assignee: | Michael Still (mikalstill) → Dmitry Borodaenko (dborodaenko) |
OpenStack Infra (hudson-openstack) wrote : | #9 |
Reviewed: https:/
Committed: https:/
Submitter: Jenkins
Branch: master
commit 112b140e2daa720
Author: Dmitry Borodaenko <email address hidden>
Date: Mon May 19 16:30:14 2014 -0700
Enable cloning for rbd-backed ephemeral disks
Currently when using rbd as an image backend, nova downloads the glance
image to local disk and then copies it again into rbd. This can be very
slow for large images, and wastes bandwidth as well as disk space.
When the glance image is stored in the same ceph cluster, the data is
being pulled out and pushed back in unnecessarily. Instead, create a
copy-on-write clone of the image. This is fast, and does not depend on
the size of the image. Instead of taking minutes, booting takes seconds,
and is not limited by the disk copy.
Add some rbd utility functions from cinder to support cloning and let
the rbd imagebackend rely on librbd instead of the rbd command line tool
for checking image existence.
Adds a new clone() method to the image backend, so backends like rbd can
make optimizations like this. Try to use clone() for the root disk when
it comes from an image, but fall back to fetch_to_raw() if clone()
fails.
Instead of calling disk.get_
verify_
method that is overridden by the Rbd subclass to get the disk size.
DocImpact
Implements: blueprint rbd-clone-
Closes-Bug: 1226351
Co-Authored-By: Josh Durgin <email address hidden>
Signed-Off-By: Josh Durgin <email address hidden>
Signed-Off-By: Zhi Yan Liu <email address hidden>
Signed-Off-By: Dmitry Borodaenko <email address hidden>
Change-Id: I0f50659b54a92f
Changed in nova: | |
status: | In Progress → Fix Committed |
Changed in nova: | |
milestone: | none → juno-3 |
status: | Fix Committed → Fix Released |
Abel Lopez (al592b) wrote : | #10 |
Would love to see this in Icehouse...
Xav Paice (xavpaice) wrote : | #11 |
+1 for Icehouse. The patch does apply cleanly - there's a good source for the patch in the Debian packages for Icehouse (plus a bunch of other useful ones for Ceph users).
Dmitry Borodaenko (angdraug) wrote : | #12 |
The whole patch series for Icehouse is available on Github here:
https:/
If there's a will from the Nova team to review and merge this to stable/icehouse, I can post this patch series to gerrit.
tags: | added: icehouse-backport-potential |
Changed in nova: | |
milestone: | juno-3 → 2014.2 |
cristi1979 (cristi-falcas) wrote : | #13 |
What files do I have to update from the git repo (icehouse) for this to work?
Dmitry Borodaenko (angdraug) wrote : | #14 |
$ git whatchanged --oneline 2014.1.
nova/compute/
nova/compute/
nova/exception.py
nova/virt/
nova/virt/driver.py
nova/virt/fake.py
nova/virt/
nova/virt/
nova/virt/images.py
nova/virt/
nova/virt/
nova/virt/
nova/virt/
nova/virt/
nova/virt/
nova/virt/
Fix proposed to branch: master /review. openstack. org/46879
Review: https:/