mixed Centos-8-9 job - os_tempest unable to ping neutron router
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
At [1] we are trying to create a mixed OS wallaby job with centos9 controller/
The second/compute deployment is completing and during os_tempest setup an overcloud neutron router is created, but we are unable to ping it from the undercloud. You can see this in the logs at [2]
2022-07-11 05:18:46.224731 | primary | TASK [os_tempest : Ping router ip address] *******
2022-07-11 05:18:46.224765 | primary | Monday 11 July 2022 09:18:46 +0000 (0:00:00.114) 0:02:05.574 ***********
2022-07-11 05:18:51.231873 | primary | FAILED - RETRYING: Ping router ip address (5 retries left).
2022-07-11 05:19:06.061119 | primary | FAILED - RETRYING: Ping router ip address (4 retries left).
2022-07-11 05:19:20.998833 | primary | FAILED - RETRYING: Ping router ip address (3 retries left).
2022-07-11 05:19:36.024830 | primary | FAILED - RETRYING: Ping router ip address (2 retries left).
2022-07-11 05:19:51.161485 | primary | FAILED - RETRYING: Ping router ip address (1 retries left).
2022-07-11 05:20:06.168826 | primary | fatal: [undercloud]: FAILED! => {"attempts": 5, "changed": true, "cmd": "set -e\nping -c2 \"192.168.
Communication between the undercloud and overcloud nodes is OK (e.g. ping works) and clearly undercloud can talk to overcloud neutron as the router is created fine - you can see the query on the overcloud router details in the attached file from comment #1 below.
The environment is using the ci/multinode.j2 [3] template (also used used by the c9 multinode job - for comparison example logs at [4]) and we are not facing the same issue there i.e. the neutron router can be reached. So with multinode.j2 we are adding the br-ex and have [5] to pass in the explicit datacentre:br-ex bridge mappings (though that is default anyway).
The attached file (comment #1 below) is the output from "ovs-vsctl show", "ip a", "ip r" on the 3 nodes (though you can find the logs at e.g. [6][7] where subnode-1 is control and subnode-2 is compute).
What am I missing please why cant we ping the overcloud neutron router IP address from the undercloud?
[1] https:/
[2] https:/
[3] https:/
[4] https:/
[5] https:/
[6] https:/
[7] https:/
description: | updated |
marking this as invalid - still not clear why the router ping was not working
however after skipping this the job has gone on to run tempest to completion so network seems to be OK at least from user point of view
https:/ /logserver. rdoproject. org/58/ 43558/13/ check/tripleo- ci-centos- 8-9-multinode- mixed-os/ e621742/ logs/undercloud /var/log/ tempest/ stestr_ results. html.gz