NetApp cDOT driver fails Tempest cleanup on clone workflows

Bug #1447339 reported by Clinton Knight
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Shared File Systems Service (Manila)
Fix Released
Undecided
Clinton Knight

Bug Description

During the create-from-snapshot workflow, the cDOT driver clones the flexvol backing a share, and a job is started to split the clone from its parent. The Tempest tests that exercise this feature attempt to delete all artifacts created for the test (child flexvol, snapshot, parent flexvol) immediately after the test assertions. This rapid deletion wouldn't normally happen in the field, but during the Tempest runs the clone split is still occurring during the cleanup phase, so the cleanup fails. There isn't a great way to detect and prevent the issue, so the workaround is to intercept the error and retry a few times.

Revision history for this message
Valeriy Ponomaryov (vponomaryov) wrote :

Quote: "There isn't a great way to detect and prevent the issue"

Why? There is the way! Share driver just should return "ok" response only when some resource is deleted indeed. So, it should not only send request for deletion, but also verify resource absense for some time and only then return answer "ok".

Revision history for this message
Clinton Knight (clintonk) wrote :

Hi, Valeriy, I think we're saying the same thing. My point was that there isn't an API we can use to unequivocally know the clone split is done and that subsequent delete APIs won't fail. So the simplest fix is to retry for a few seconds in the failing share-delete workflow step (volume unmount).

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to manila (master)

Fix proposed to branch: master
Review: https://review.openstack.org/178825

Changed in manila:
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to manila (master)

Reviewed: https://review.openstack.org/178825
Committed: https://git.openstack.org/cgit/openstack/manila/commit/?id=dd2047a886f48969d543018809b826c117a0ec92
Submitter: Jenkins
Branch: master

commit dd2047a886f48969d543018809b826c117a0ec92
Author: Clinton Knight <email address hidden>
Date: Fri Apr 17 11:17:06 2015 -0400

    NetApp cDOT driver fails Tempest cleanup on clone workflows

    During the create-from-snapshot workflow, the cDOT driver clones
    the flexvol backing a share, and a job is started to split the
    clone from its parent. The Tempest tests that exercise this
    feature attempt to delete all artifacts created for the test
    (child flexvol, snapshot, parent flexvol) immediately after
    the test assertions. This rapid deletion wouldn't normally
    happen in the field, but during the Tempest runs the clone
    split is still occurring during the cleanup phase, so the
    cleanup fails. There isn't a great way to detect and prevent
    the issue, so the workaround is to intercept the error and
    retry a few times.

    Closes-Bug: #1447339
    Change-Id: If272eb3c8060c95510226672a8d1928bea0cc6cf

Changed in manila:
status: In Progress → Fix Committed
Thierry Carrez (ttx)
Changed in manila:
milestone: none → liberty-1
status: Fix Committed → Fix Released
Thierry Carrez (ttx)
Changed in manila:
milestone: liberty-1 → 1.0.0
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.