test_router_rescheduling failed with unexpected FIP status after rescheduling
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tempest |
Fix Released
|
High
|
Ihar Hrachyshka |
Bug Description
Traceback (most recent call last):
File "tempest/test.py", line 119, in wrapper
return func(*func_args, **func_kwargs)
File "tempest/test.py", line 100, in wrapper
return f(self, *func_args, **func_kwargs)
File "tempest/
msg='After router rescheduling')
File "tempest/
self.
File "tempest/
st=status))
File "/opt/stack/
self.
File "/opt/stack/
raise mismatch_error
testtools.
In tempest log, we see that we rescheduled the same agent for a router, then check it's indeed in the list of scheduled agents, then check FIP status to be ACTIVE. The last check fails (initially it's ACTIVE, but then just before we check for the last time, it flips back to DOWN).
Looking into l3 agent logs, it seems like the agent flipped the status to DOWN while processing the previous unscheduling event.
It seems like we should wait for the FIP status to flip to DOWN after unscheduling, to make sure the agent is done with unscheduling event processing, so that we are safe to proceed with rescheduling without a risk of some previous update events still sitting in router update queue.
Changed in neutron: | |
status: | New → Confirmed |
assignee: | nobody → Ihar Hrachyshka (ihar-hrachyshka) |
importance: | Undecided → High |
tags: | added: gate-failure |
affects: | neutron → tempest |
Fix proposed to branch: master /review. openstack. org/403289
Review: https:/