All tempest tests fail on traceroute call with "TimeoutException: Request timed out"
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
networking-sfc |
Fix Released
|
Undecided
|
Bernard Cafarelli |
Bug Description
Since around Rocky mid-cycle, the tempest gates always fail on all tests. Sample failure:
http://
VMs creation looks OK, but the test fails when trying to connect to a VM and run traceroute to the other:
2018-07-23 17:56:56.323 6755 INFO tempest.
2018-07-23 17:56:56.333 6755 INFO paramiko.transport [-] Connected (version 2.0, client dropbear_2012.55)
2018-07-23 17:56:56.607 6755 INFO paramiko.transport [-] Authentication (publickey) successful!
2018-07-23 17:56:56.608 6755 INFO tempest.
2018-07-23 18:00:13.667 6755 ERROR tempest.
Details: Command: 'set -eu -o pipefail; PATH=$PATH:/sbin; traceroute -n -I 10.1.0.13' executed on host '172.24.5.20'.: TimeoutException: Request timed out
After some digging I suspect some security group issue, as I deployed a master devstack and manually tested SFC, still working fine. But I disable port security in my manual tests
Changed in networking-sfc: | |
status: | New → In Progress |
assignee: | nobody → Bernard Cafarelli (bcafarel) |
While tempest test is running, I made a quick test and run "openstack port set --disable- port-security --no-security- group" on all ports related to the test.
This allowed traceroute to finally report in:
traceroute to 10.0.0.5 (10.0.0.5), 30 hops max, 46 byte packets
1 * * *
2 * * *
3 * * *
4 * * *
5 * * *
6 * 10.0.0.5 2.316 ms 1.935 ms
2018-07-27 15:07:36,557 16774 ERROR [networking_ sfc.tests. tempest_ plugin. tests.scenario. base] length mismatch:
[u' 1 * * *', u' 2 * * *', u' 3 * * *', u' 4 * * *', u' 5 * * *']
vs
[[u'10.0.0.8']]
The first '* * *' were timeouts until I disabled port security