"test_dualnet_dhcp6_stateless_from_os" failures seen in the gate

Bug #1509004 reported by Swaminathan Vasudevan
34
This bug affects 7 people
Affects Status Importance Assigned to Milestone
neutron
Invalid
High
Miguel Lavalle
tags: added: gate-failure
Revision history for this message
Ritesh Anand (ritesh-anand) wrote :

While triaging the issue, I found that:
In tempest/scenario/manager.py ScenarioTest:create_server() calls
servers_client.create_server(name=name, imageRef=image,
                                                   flavorRef=flavor,
                                                   **create_kwargs)['server']
With ports (1 on each subnet) passed in create_kwargs.

However, the returned server does not consistently have proper port mapping.

This port_mapping is needed for fip creation.
As queried here:
ports = self._list_ports(device_id=server['id'], status='ACTIVE',
                                 fixed_ip=ip_addr)

I also noticed some comments in the same function that may be relevant here:
# The instance retrieved on creation is missing network
# details, necessitating retrieval after it becomes active to
# ensure correct details.

The trace in the failed test:
Traceback (most recent call last):
  File "tempest/test.py", line 127, in wrapper
    return f(self, *func_args, **func_kwargs)
  File "tempest/scenario/test_network_v6.py", line 248, in test_dualnet_multi_prefix_dhcpv6_stateless
    dualnet=True)
  File "tempest/scenario/test_network_v6.py", line 158, in _prepare_and_test
    sshv4_2, ips_from_api_2, sid2 = self.prepare_server(networks=net_list)
  File "tempest/scenario/test_network_v6.py", line 126, in prepare_server
    fip = self.create_floating_ip(thing=srv)
  File "tempest/scenario/manager.py", line 778, in create_floating_ip
    port_id, ip4 = self._get_server_port_id_and_ip4(thing)
  File "tempest/scenario/manager.py", line 759, in _get_server_port_id_and_ip4
    % port_map)
  File "/opt/stack/new/tempest/.tox/full/local/lib/python2.7/site-packages/testtools/testcase.py", line 350, in assertEqual
    self.assertThat(observed, matcher, message)
  File "/opt/stack/new/tempest/.tox/full/local/lib/python2.7/site-packages/testtools/testcase.py", line 435, in assertThat
    raise mismatch_error
testtools.matchers._impl.MismatchError: 0 != 1: Found multiple IPv4 addresses: []. Unable to determine which port to target.

Revision history for this message
Matt Riedemann (mriedem) wrote :

http://logs.openstack.org/71/231271/8/gate/gate-tempest-dsvm-neutron-full/1be12e3/console.html#_2015-10-29_09_18_25_892

2015-10-29 09:18:25.892 | Captured traceback:
2015-10-29 09:18:25.892 | ~~~~~~~~~~~~~~~~~~~
2015-10-29 09:18:25.892 | Traceback (most recent call last):
2015-10-29 09:18:25.892 | File "tempest/test.py", line 127, in wrapper
2015-10-29 09:18:25.892 | return f(self, *func_args, **func_kwargs)
2015-10-29 09:18:25.892 | File "tempest/scenario/test_network_v6.py", line 242, in test_dualnet_dhcp6_stateless_from_os
2015-10-29 09:18:25.892 | self._prepare_and_test(address6_mode='dhcpv6-stateless', dualnet=True)
2015-10-29 09:18:25.893 | File "tempest/scenario/test_network_v6.py", line 199, in _prepare_and_test
2015-10-29 09:18:25.893 | self.subnets_v6[i].gateway_ip)
2015-10-29 09:18:25.893 | File "tempest/scenario/test_network_v6.py", line 211, in _check_connectivity
2015-10-29 09:18:25.893 | (dest, source.ssh_client.host)
2015-10-29 09:18:25.893 | File "/opt/stack/new/tempest/.tox/full/local/lib/python2.7/site-packages/unittest2/case.py", line 702, in assertTrue
2015-10-29 09:18:25.893 | raise self.failureException(msg)
2015-10-29 09:18:25.893 | AssertionError: False is not true : Timed out waiting for 2003::1 to become reachable from 172.24.5.129

http://logstash.openstack.org/#dashboard/file/logstash.json?query=message:%5C%22AssertionError:%20False%20is%20not%20true%20:%20Timed%20out%20waiting%20for%202003::1%20to%20become%20reachable%20from%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20NOT%20build_queue:%5C%22periodic-qa%5C%22

16 hits in 7 days, check and gate, all failures.

Changed in neutron:
status: New → Confirmed
importance: Undecided → Medium
Revision history for this message
Matt Riedemann (mriedem) wrote :

It looks like when this failed here:

http://logs.openstack.org/71/231271/8/gate/gate-tempest-dsvm-neutron-full/1be12e3/console.html#_2015-10-29_09_18_25_821

The port status was DOWN on 172.24.5.129 which is what we're trying to ssh into.

Revision history for this message
Matt Riedemann (mriedem) wrote :

Maybe the floating IP status being DOWN doesn't matter since I see that happens in cases that the test passes too:

http://logs.openstack.org/17/227517/15/check/gate-tempest-dsvm-neutron-full/fc0aae8/logs/tempest.txt.gz#_2015-10-22_20_35_29_627

Changed in neutron:
importance: Medium → Critical
Revision history for this message
Armando Migliaccio (armando-migliaccio) wrote :

This failed 0 times during the past 7 days in the gate queue.

Changed in neutron:
importance: Critical → High
Revision history for this message
Armando Migliaccio (armando-migliaccio) wrote :

Again, this failed 0 times during the past 7 days in any queue.

Changed in neutron:
status: Confirmed → Incomplete
importance: High → Undecided
Revision history for this message
Armando Migliaccio (armando-migliaccio) wrote :

Marking incomplete, unless this shows up again and new evidence is collected for triaging/troubleshooting.

Revision history for this message
Armando Migliaccio (armando-migliaccio) wrote :
Changed in neutron:
status: Incomplete → New
Revision history for this message
Jordan Pittier (jordan-pittier) wrote :

Stumbled upon this today: http://logs.openstack.org/00/295900/3/check/gate-tempest-dsvm-neutron-dvr/93918ea/console.html#_2016-03-27_18_53_46_331

Logstash query

message:"AssertionError: False is not true : Timed out waiting for 2003::1 to become reachable from" AND tags:console

100% failure, 10 hits in the past 7 days.

Revision history for this message
Oleg Bondarev (obondarev) wrote :
Revision history for this message
Ihar Hrachyshka (ihar-hrachyshka) wrote :

Just happened in stable/mitaka with: http://logs.openstack.org/88/305788/3/check/gate-tempest-dsvm-neutron-dvr/f7c6d69/logs/testr_results.html.gz Since the dnsmasq patch Oleg mentioned was already backported to the branch at the time the job was triggered, I doubt that fixes it.

Changed in neutron:
status: New → Confirmed
tags: added: l3-ipam-dhcp
Revision history for this message
Carl Baldwin (carl-baldwin) wrote :

0 hits

Changed in neutron:
status: Confirmed → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for neutron because there has been no activity for 60 days.]

Changed in neutron:
status: Incomplete → Expired
Revision history for this message
Dr. Jens Harbott (j-harbott) wrote :

54 hits in the last 7 days. Checked a couple and they all have an error like this:

http://logs.openstack.org/96/384696/1/check/gate-tempest-dsvm-neutron-placement-full-ubuntu-xenial-nv/c19ea11/logs/screen-q-svc.txt.gz?level=ERROR#_2016-10-10_20_45_30_546

/usr/local/lib/python2.7/dist-packages/sqlalchemy/util/langhelpers.py:68: SAWarning: An exception has occurred during handling of a previous exception. The previous exception is:
 <class 'oslo_db.exception.DBDeadlock'> (pymysql.err.InternalError) (1213, u'Deadlock found when trying to get lock; try restarting transaction') [SQL: u'INSERT INTO ipallocations (port_id, ip_address, subnet_id, network_id) VALUES (%(port_id)s, %(ip_address)s, %(subnet_id)s, %(network_id)s)'] [parameters: {'network_id': u'e345189c-5ab3-4696-a860-51874e215e0a', 'subnet_id': '0d3d3a6a-5957-4cd1-bfa3-009a45957589', 'port_id': u'84722cc3-107b-49bb-b422-e858a8fa7b79', 'ip_address': '2003::f816:3eff:feb9:b9a1'}]

Changed in neutron:
status: Expired → Confirmed
Revision history for this message
Anindita Das (anindita-das) wrote :

http://logs.openstack.org/72/385072/3/check/gate-tempest-dsvm-neutron-full-ubuntu-xenial/9f79eb8/console.html#_2016-10-17_15_50_55_879518

2016-10-17 15:50:55.879518 | Captured traceback:
2016-10-17 15:50:55.879534 | ~~~~~~~~~~~~~~~~~~~
2016-10-17 15:50:55.879565 | Traceback (most recent call last):
2016-10-17 15:50:55.879593 | File "tempest/test.py", line 100, in wrapper
2016-10-17 15:50:55.879618 | return f(self, *func_args, **func_kwargs)
2016-10-17 15:50:55.879657 | File "tempest/scenario/test_network_v6.py", line 256, in test_dualnet_multi_prefix_dhcpv6_stateless
2016-10-17 15:50:55.879674 | dualnet=True)
2016-10-17 15:50:55.879706 | File "tempest/scenario/test_network_v6.py", line 203, in _prepare_and_test
2016-10-17 15:50:55.879729 | self.subnets_v6[i]['gateway_ip'])
2016-10-17 15:50:55.879761 | File "tempest/scenario/test_network_v6.py", line 213, in _check_connectivity
2016-10-17 15:50:55.879783 | (dest, source.ssh_client.host)
2016-10-17 15:50:55.879826 | File "/opt/stack/new/tempest/.tox/tempest/local/lib/python2.7/site-packages/unittest2/case.py", line 702, in assertTrue
2016-10-17 15:50:55.879857 | raise self.failureException(msg)
2016-10-17 15:50:55.879898 | AssertionError: False is not true : Timed out waiting for 2003::1 to become reachable from 172.24.5.4

http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22AssertionError:%20False%20is%20not%20true%20:%20Timed%20out%20waiting%20for%202003::1%20to%20become%20reachable%20from%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20NOT%20build_queue:%5C%22periodic-qa%5C%22

53 hits in last 7days

Changed in neutron:
importance: Undecided → High
Revision history for this message
Miguel Lavalle (minsel) wrote :
Changed in neutron:
status: Confirmed → Incomplete
Revision history for this message
Dr. Jens Harbott (j-harbott) wrote :
Changed in neutron:
status: Incomplete → Confirmed
Revision history for this message
Ihar Hrachyshka (ihar-hrachyshka) wrote :

4 hits in last 7 days.

Revision history for this message
Miguel Lavalle (minsel) wrote :

Using this search:

http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22AssertionError:%20False%20is%20not%20true%20:%20Timed%20out%20waiting%20for%202003::1%20to%20become%20reachable%20from%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20NOT%20build_queue:%5C%22periodic-qa%5C%22

We are finding 0 hits for this bug. Also, Elastic recheck has 0 hits as of today:

Bug 1509004 - "test_dualnet_dhcp6_stateless_from_os" failures seen in the gate
0 fails in 24 hrs / 0 fails in 10 days
Projects: (neutron - Confirmed)
No matches
Logstash Launchpad

We are going to monitor this bug for one more week and close it if we don't see any hits.

Changed in neutron:
assignee: nobody → James Anziano (janzian)
Miguel Lavalle (minsel)
Changed in neutron:
assignee: James Anziano (janzian) → Miguel Lavalle (minsel)
Revision history for this message
Miguel Lavalle (minsel) wrote :

I have continued watching this bug over the past 14 days. The search:

http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22AssertionError:%20False%20is%20not%20true%20:%20Timed%20out%20waiting%20for%202003::1%20to%20become%20reachable%20from%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20NOT%20build_queue:%5C%22periodic-qa%5C%22

Only produced 2 hits, both in networking-sfc. One of the hits is is in a non-voting job: gate-tempest-dsvm-networking-sfc-multinode-ubuntu-xenial-nv. The other one is in gate-tempest-dsvm-networking-sfc-ubuntu-xenial, with a patchset that is failing many other tests.

Closing this bug

Changed in neutron:
status: Confirmed → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.