DVR multinode job intermittently failing
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
Invalid
|
Medium
|
Brian Haley |
Bug Description
Occasionally the DVR multinode jobs are failing in the check queue, typically one of the test VMs fails to get an IP address according to it's console output.
Looking in the dhcp-agent logs I sometimes see a failure in setting things up for a network, for example, http://
Looking back in the log I can see these operations (snipped for readability):
1. (no port existed, so one is created, including namespace)
DEBUG neutron.
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
['ip', '-o', 'link', 'show', 'br-int']
['ip', 'link', 'set', 'tapd5a978a6-e7', 'address', 'fa:16:
['ip', '-o', 'netns', 'list']
['ip', 'netns', 'add', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'link', 'set', 'tapd5a978a6-e7', 'netns', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
2. iptables rules applied
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
IPTablesManager
3. init_l3() is called to configure IP on device in namespace
['ip', 'netns', 'exec', 'qdhcp-
['ip', 'netns', 'exec', 'qdhcp-
(there should have been an 'ip addr add ...' here for the IP)
['ip', 'netns', 'exec', 'qdhcp-
Setting gateway for dhcp netns on net ff97a28f-
['ip', 'netns', 'exec', 'qdhcp-
Exit code: 2; Stdin: ; Stdout: ; Stderr: RTNETLINK answers: Network is unreachable
That will fail since there isn't an interface in the 10.100.0.1/24 subnet.
I have a debug patch up now, still investigating, https:/
Changed in neutron: | |
status: | New → Invalid |
Actually I see the full job hedging upwards:
http:// grafana. openstack. org/dashboard/ db/neutron- failure- rate?panelId= 8&fullscreen
Any idea of the most offending test?