Tempest test for dr/readable replication fails because share has two active replicas

Bug #1607150 reported by Goutham Pacha Ravi
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Shared File Systems Service (Manila)
Confirmed
Low
Douglas Viroel

Bug Description

The tempest test:

manila_tempest_tests.tests.api.test_replication.ReplicationTest#test_promote_in_sync_share_replica

fails sporadically at the gate for both ZFSonLinux and NetApp cDOT Single SVM drivers because of the update sequence in the share manager.

The test executes only when the backend_replication_type is 'dr' or 'readable'. It performs the following actions:

* Creates a share
* Creates a replica
* Waits for replica to become 'in_sync'
* Promotes the replica
* As soon as the promoted replica becomes available, it requests the list of replicas for the share

At this point, if the share manager is still updating the list of replicas, this operation gets the list of replicas in an inconsistent state. Either, we should wait to get the list of replicas or we should change the order of writing data into the database, i.e, update the promoted replica the last.

Automation might always find such bugs because our API service is uncoordinated from updates from the other services in manila.

Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :

Sample failure from NetApp CI attached below.

tags: added: netapp
tags: added: tempest
Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :

Sample failure from NetApp CI attached below.

Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :
Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :
Changed in manila:
importance: Undecided → Low
importance: Low → Medium
Changed in manila:
assignee: nobody → NidhiMittalHada (nidhimittal19)
Tom Barron (tpb)
tags: added: replication
tags: added: concurrency
tags: added: races
removed: concurrency
Revision history for this message
Jason Grosso (jgrosso) wrote :

Goutham how goes this defect? Need any assistance?

Jason Grosso (jgrosso)
Changed in manila:
assignee: NidhiMittalHada (nidhimittal19) → nobody
Revision history for this message
Jason Grosso (jgrosso) wrote :

Goutham do we have a milestone set fo this?

Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :

Jason: we'll add a milestone if someone picks this up.

Douglas Viroel (dviroel)
Changed in manila:
assignee: nobody → Douglas Viroel (dviroel)
Changed in manila:
importance: Medium → Low
milestone: none → victoria-3
Revision history for this message
Goutham Pacha Ravi (gouthamr) wrote :

This bug was discussed during the upstream openstack-manila meeting on Thu 25th June 2020: http://eavesdrop.openstack.org/meetings/manila/2020/manila.2020-06-25-15.01.log.html#l-63

Changed in manila:
milestone: victoria-3 → victoria-rc1
Changed in manila:
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.