tempest

Delay for safety in ObjectExpiryTest is not sufficient in all test environments

Bug #1452915 reported by Daryl Walleck on 2015-05-07

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	tempest	Fix Released	Undecided	Ievgeniia Zadorozhna

Bug Description

In the _test_object_expiry function of ObjectExpiryTest, an extra delay for latency is hard coded in:

https://github.com/openstack/tempest/blob/master/tempest/api/object_storage/test_object_expiry.py#L66

This appears to be sufficient for smaller test environments, but not enough for public environments under load (trial and error led me to determine that 7 seconds was sufficient for Rackspace's public environments). This value could be increased, but that would be a temporary patch until someone with another environment with different latency ran into the same problem. Would it make sense to turn this into a polling loop checking periodically over a configurable period of time? Or would increasing this value to a larger number be sufficient?

Revision history for this message

Attila Fazekas (afazekas) wrote on 2015-05-19:

What is the source of the delay ?

The object was stored and read back with `x-delete-at` , as I remember the server just checks the current time and this header for serving 404.

Is it still working like that?

Is it possible the clocks are not in sync ?

Changed in tempest:
status:	New → Incomplete

Revision history for this message

Ievgeniia Zadorozhna (izadorozhna) wrote on 2015-06-10:

I suggest adding a configuration parameter, that allows to set this delay in tempest.conf. If it is ann appropriate solution, I can propose a fix for it.

Changed in tempest:
assignee:	nobody → izadorozhna (izadorozhna)

Revision history for this message

Daryl Walleck (dwalleck) wrote on 2015-06-10:

I'm not sure what the exact cause of the delay is. My guess is that it has to do with propagation time, which is what the existing hardcoded leeway of 3 seconds was trying to solve. 7 seconds seems to be enough for my production regions, but that's as arbitrary of a number of 3. We could make this value configurable per environment as mentioned, or should there be no propagation delay and the API should be immediately up to date, in which case the sleep should be removed and a bug with Swift opened. Thoughts?

Revision history for this message

Ievgeniia Zadorozhna (izadorozhna) wrote on 2015-06-16:

I suggest to make this couple of seconds configurable. In tempest.conf in [object-storage] group there is a parameter "container_sync_interval". I suggest to use it instead of creating new parameter. By default it is 5 seconds.
As for me, on my cloud the test passes only when I am adding 10 seconds (so even 7 is not enough).
Review please my commit:
https://review.openstack.org/#/c/192111/

OpenStack Infra (hudson-openstack) on 2015-06-17

Changed in tempest:
status:	Incomplete → In Progress

Revision history for this message

Ievgeniia Zadorozhna (izadorozhna) wrote on 2015-06-17:

Changed the logic from the configurable parameter to the checking whether the object still exists and waiting for several seconds in a loop (waiting for 1 minute in whole).

OpenStack Infra (hudson-openstack) on 2015-06-18

Changed in tempest:
status:	In Progress → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.