neutron-l3-agent crashes on boot without a running neutron-server
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Unassigned |
Bug Description
Restarting the neutron-l3-agent without a running neutron server (or presumably before its ready for connections) results in the traceback below and the l3 agent terminating after 1 minute
This in turn is causing VM's on the overcloud to fail the boot process (we just see)
2014-09-11 19:08:49.993 | + wait_for 30 10 ping -c 1 192.0.2.46
2014-09-11 19:15:20.422 | Timing out after 300 seconds:
2014-09-11 19:15:20.422 | COMMAND=ping -c 1 192.0.2.46
2014-09-11 19:15:20.422 | OUTPUT=PING 192.0.2.46 (192.0.2.46) 56(84) bytes of data.
2014-09-11 19:15:20.422 | From 192.168.1.110 icmp_seq=1 Destination Host Unreachable
2014-09-11 19:15:20.422 |
2014-09-11 19:15:20.422 | --- 192.0.2.46 ping statistics ---
2014-09-11 19:15:20.422 | 1 packets transmitted, 0 received, +1 errors, 100% packet loss, time 0ms
2014-09-11 19:15:20.463 | + get_state_
Which can mean any number of things
== l3 agent traceback
CRITICAL neutron [req-1c4f9f4a-
TRACE neutron Traceback (most recent call last):
TRACE neutron File "/opt/stack/
TRACE neutron sys.exit(main())
TRACE neutron File "/opt/stack/
TRACE neutron manager=manager)
TRACE neutron File "/opt/stack/
TRACE neutron periodic_
TRACE neutron File "/opt/stack/
TRACE neutron self.manager = manager_
TRACE neutron File "/opt/stack/
TRACE neutron super(L3NATAgen
TRACE neutron File "/opt/stack/
TRACE neutron self.plugin_
TRACE neutron File "/opt/stack/
TRACE neutron version='1.3')
TRACE neutron File "/opt/stack/
TRACE neutron return method(*args, **kwargs)
TRACE neutron File "/opt/stack/
TRACE neutron context, msg, rpc_method='call', **kwargs)
TRACE neutron File "/opt/stack/
TRACE neutron return func(context, msg['method'], **msg['args'])
TRACE neutron File "/opt/stack/
TRACE neutron retry=self.retry)
TRACE neutron File "/opt/stack/
TRACE neutron timeout=timeout, retry=retry)
TRACE neutron File "/opt/stack/
TRACE neutron retry=retry)
TRACE neutron File "/opt/stack/
TRACE neutron result = self._waiter.
TRACE neutron File "/opt/stack/
TRACE neutron reply, ending = self._poll_
TRACE neutron File "/opt/stack/
TRACE neutron % msg_id)
TRACE neutron MessagingTimeout: Timed out waiting for a reply to message ID e42af92d590b43c
TRACE neutron
Example from logs.openstack. org/99/ 120799/ 3/check- tripleo/ check-tripleo- novabm- overcloud- f20-nonha/ 66d3c9e/ logs/overcloud- controller0_ logs/
http://
2014-09-11 19:08:49.993 | + wait_for 30 10 ping -c 1 192.0.2.46
2014-09-11 19:15:20.422 | Timing out after 300 seconds:
2014-09-11 19:15:20.422 | COMMAND=ping -c 1 192.0.2.46
2014-09-11 19:15:20.422 | OUTPUT=PING 192.0.2.46 (192.0.2.46) 56(84) bytes of data.
2014-09-11 19:15:20.422 | From 192.168.1.110 icmp_seq=1 Destination Host Unreachable
2014-09-11 19:15:20.422 |
2014-09-11 19:15:20.422 | --- 192.0.2.46 ping statistics ---
-- Logs begin at Thu 2014-09-11 18:53:55 UTC, end at Thu 2014-09-11 19:10:07 UTC. -- controller0- vktbhu7b7qww systemd[1]: Starting neutron-l3-agent Service... controller0- vktbhu7b7qww systemd[1]: Started neutron-l3-agent Service. controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:58:18.723 30996 INFO oslo.messaging. _drivers. impl_rabbit [req-25335a75- 0bc8-4224- 935b-4b29b349a6 d7 ] Connecting to AMQP server on 192.0.2.3:5672 controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:58:18.840 30996 INFO oslo.messaging. _drivers. impl_rabbit [req-25335a75- 0bc8-4224- 935b-4b29b349a6 d7 ] Connected to AMQP server on 192.0.2.3:5672 controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:58:18.864 30996 INFO oslo.messaging. _drivers. impl_rabbit [req-25335a75- 0bc8-4224- 935b-4b29b349a6 d7 ] Connecting to AMQP server on 192.0.2.3:5672 controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:58:18.923 30996 INFO oslo.messaging. _drivers. impl_rabbit [req-25335a75- 0bc8-4224- 935b-4b29b349a6 d7 ] Connected to AMQP server on 192.0.2.3:5672 controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 CRITICAL neutron [req-25335a75- 0bc8-4224- 935b-4b29b349a6 d7 None] MessagingTimeout: Timed out waiting for a reply to message ID 3ca27749373d452 0b6bcb197aa928b b4 controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron Traceback (most recent call last): controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron File "/opt/stack/ venvs/openstack /bin/neutron- l3-agent" , line 10, in <module> controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron sys.exit(main()) controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron File "/opt/stack/ venvs/openstack /lib/python2. 7/site- packages/ neutron/ agent/l3_ agent.py" , line 1936, in main controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron manager=manager) controller0- vktbhu7b7qww neutron- l3-agent[ 30996]: 2014-09-11 18:59:18.989 30996 TRACE neutron File "/opt/stack/ venvs/openstack /lib/python2. 7/site- packages/ neutron/ service. py", line 264, in create controller0- vkt...
Sep 11 18:58:06 overcloud-
Sep 11 18:58:06 overcloud-
Sep 11 18:58:18 overcloud-
Sep 11 18:58:18 overcloud-
Sep 11 18:58:18 overcloud-
Sep 11 18:58:18 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-
Sep 11 18:59:19 overcloud-