After restarting keepalived , Most of the newly launched instances get stuck in build
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R2.20 |
Fix Committed
|
Critical
|
Jeya ganesh babu J | |||
Trunk |
Fix Committed
|
Critical
|
Jeya ganesh babu J |
Bug Description
On a HA cluster, after restating keepalived , most of the instances gets stuck in Build
Launching instances becomes very slow.. It becomes slight better after 20-30 mins
LOGS in : http://
version : 2.20 build 59 Juno with patches
Logs: http://
Someimes the below trace is also seen in nova- conductor
root@cs-
tailf /var/log/
2015-06-22 21:41:34.933 769 ERROR nova.scheduler.
2015-06-22 21:41:34.936 769 INFO oslo.messaging.
2015-06-22 21:41:34.933 769 ERROR nova.scheduler.
2015-06-22 21:41:34.936 769 INFO oslo.messaging.
root@cs-
+------
| Property | Value |
+------
| OS-DCF:diskConfig | MANUAL |
| OS-EXT-
| OS-EXT-
| OS-EXT-
| OS-EXT-
| OS-EXT-
| OS-EXT-
| OS-EXT-STS:vm_state | error |
| OS-SRV-
| OS-SRV-
| accessIPv4 | |
| accessIPv6 | |
| config_drive | |
| created | 2015-06-
| fault | {"message": "Build of instance eeca56e7-
| | filter_properties) |
| | File \"/usr/
| | 'create.error', fault=e) |
| | File \"/usr/
| | six.reraise(
| | File \"/usr/
| | block_device_
| | File \"/usr/
| | return self.gen.next() |
| | File \"/usr/
| | reason=msg) |
| | ", "created": "2015-06-
| flavor | V1 (10) |
| hostId | b0f49b19c9d1a90
| id | eeca56e7-
| image | A1-SNAP1 (3da3258d-
| key_name | - |
| metadata | {} |
| name | VIN2--eeca56e7-
| os-extended-
| status | ERROR |
| tenant_id | 1657ff89c9c54aa
| updated | 2015-06-
| user_id | a9b90ce5977d44e
+------
root@cs-
description: | updated |
description: | updated |
description: | updated |
description: | updated |
information type: | Proprietary → Public |
Cinder api shows timeouts from amqp - messages are logged in nova-api, cinder-api and cinder-volume
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging. _drivers. impl_rabbit routing_ key=self. routing_ key) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ kombu/messaging .py", line 82, in __init__ _drivers. impl_rabbit self.revive( self._channel) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ kombu/messaging .py", line 216, in revive _drivers. impl_rabbit self.declare() _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ kombu/messaging .py", line 102, in declare _drivers. impl_rabbit self.exchange. declare( ) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ kombu/entity. py", line 166, in declare _drivers. impl_rabbit nowait=nowait, passive=passive, _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ amqp/channel. py", line 613, in exchange_declare _drivers. impl_rabbit self._send_ method( (40, 10), args) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ amqp/abstract_ channel. py", line 56, in _send_method _drivers. impl_rabbit self.channel_id, method_sig, args, content, _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ amqp/method_ framing. py", line 221, in write_method _drivers. impl_rabbit write_frame(1, channel, payload) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ amqp/transport. py", line 177, in write_frame _drivers. impl_rabbit frame_type, channel, size, payload, 0xce, _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ eventlet/ greenio. py", line 307, in sendall _drivers. impl_rabbit tail = self.send(data, flags) _drivers. impl_rabbit File "/usr/lib/ python2. 7/dist- packages/ eventlet/ greenio. py", line 293, in send _drivers. impl_rabbit total_sent += fd.send( data[total_ sent:], flags) _drivers. impl_rabbit error: [Errno 104] Connection reset by peer
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:30:13.432 38170 TRACE oslo.messaging.
2015-06-22 21:31:24.102 26908 TRACE cinder. api.middleware. fault File "/usr/lib/ python2. 7/dist- packages/ cinder/ api/contrib/ volume_ actions. py", line 197, in _initialize_ connection
2015-06-22 21:31:24.102 26908 TRA...