Contrail - API time>50 secs causing Connection reset
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Juniper Openstack |
New
|
Undecided
|
Sachin Bansal |
Bug Description
Observing contrail api time > 50 secs - due to which there a lot of heat operations failure (heat V2)
Setup:
3 contrail control nodes, 16 compute nodes.
root@ccra-17:~# contrail-version
Package Version Build-ID | Repo | Package Name
-------
contrail-analytics 3.0.2.0-51 51
contrail-config 3.0.2.0-51 51
contrail-
contrail-control 3.0.2.0-51 51
contrail-dns 3.0.2.0-51 51
contrail-docs 3.0.2.0-51 51
contrail-f5 3.0.2.0-51 51
contrail-
contrail-heat 3.0.2.0-51 51
Test:
scale upto 500 vsrx instances at 50 per iteration.
Logs:
root@ccra-
39336:97.0.0.17 - - [2017-03-28 08:53:25] "POST /ref-update HTTP/1.1" socket - 73.988813
39697:97.0.0.17 - - [2017-03-28 08:53:31] "POST /ref-update HTTP/1.1" socket - 79.441704
40257:97.0.0.17 - - [2017-03-28 08:53:43] "POST /ref-update HTTP/1.1" socket - 91.056502
40580:97.0.0.17 - - [2017-03-28 08:53:46] "POST /ref-update HTTP/1.1" socket - 91.616593
40832:97.0.0.17 - - [2017-03-28 08:53:48] "POST /ref-update HTTP/1.1" socket - 95.972355
41071:97.0.0.17 - - [2017-03-28 08:53:48] "POST /ref-update HTTP/1.1" socket - 94.105240
41420:97.0.0.17 - - [2017-03-28 08:53:52] "POST /ref-update HTTP/1.1" socket - 97.865286
41702:97.0.0.17 - - [2017-03-28 08:53:54] "POST /ref-update HTTP/1.1" socket - 96.856899
41992:97.0.0.17 - - [2017-03-28 08:53:57] "POST /ref-update HTTP/1.1" socket - 98.459944
42422:97.0.0.17 - - [2017-03-28 08:54:04] "POST /ref-update HTTP/1.1" socket - 100.484664
42749:97.0.0.17 - - [2017-03-28 08:54:08] "POST /ref-update HTTP/1.1" socket - 107.162964
43000:97.0.0.17 - - [2017-03-28 08:54:09] "POST /ref-update HTTP/1.1" socket - 102.370817
43271:97.0.0.17 - - [2017-03-28 08:54:09] "POST /ref-update HTTP/1.1" socket - 100.639601
43554:97.0.0.17 - - [2017-03-28 08:54:13] "POST /ref-update HTTP/1.1" socket - 106.021679
43807:97.0.0.17 - - [2017-03-28 08:54:13] "POST /ref-update HTTP/1.1" socket - 100.504785
44179:97.0.0.17 - - [2017-03-28 08:54:18] "POST /ref-update HTTP/1.1" socket - 97.419570
44460:97.0.0.17 - - [2017-03-28 08:54:20] "POST /ref-update HTTP/1.1" socket - 98.502710
44912:97.0.0.17 - - [2017-03-28 08:54:26] "POST /ref-update HTTP/1.1" socket - 79.879059
45161:97.0.0.17 - - [2017-03-28 08:54:26] "POST /ref-update HTTP/1.1" socket - 79.403091
45395:97.0.0.17 - - [2017-03-28 08:54:27] "POST /ref-update HTTP/1.1" socket - 77.856576
45634:97.0.0.17 - - [2017-03-28 08:54:27] "POST /ref-update HTTP/1.1" socket - 78.348658
45874:97.0.0.17 - - [2017-03-28 08:54:29] "POST /ref-update HTTP/1.1" socket - 79.848005
46177:97.0.0.17 - - [2017-03-28 08:54:32] "POST /ref-update HTTP/1.1" socket - 75.647759
46447:97.0.0.17 - - [2017-03-28 08:54:34] "POST /ref-update HTTP/1.1" socket - 62.705114
46708:97.0.0.17 - - [2017-03-28 08:54:34] "POST /ref-update HTTP/1.1" socket - 56.943514
112065:97.0.0.17 - - [2017-03-28 09:32:17] "POST /neutron/subnet HTTP/1.1" socket - 114.207590
117360:97.0.0.17 - - [2017-03-28 09:33:27] "POST /neutron/subnet HTTP/1.1" socket - 67.087183
118066:97.0.0.17 - - [2017-03-28 09:33:34] "POST /neutron/subnet HTTP/1.1" socket - 60.053293
125194:97.0.0.17 - - [2017-03-28 09:36:11] "POST /neutron/subnet HTTP/1.1" socket - 52.694634
131268:97.0.0.17 - - [2017-03-28 09:38:07] "POST /neutron/subnet HTTP/1.1" socket - 61.057328
149899:97.0.0.17 - - [2017-03-28 09:42:39] "POST /neutron/subnet HTTP/1.1" socket - 52.098861
151194:97.0.0.17 - - [2017-03-28 09:42:50] "POST /neutron/subnet HTTP/1.1" socket - 50.425593
root@ccra-
127.0.0.1 - - [2017-03-28 09:42:50] "GET /virtual-
127.0.0.1 - - [2017-03-28 09:42:50] "POST /useragent-kv HTTP/1.1" 200 172 0.008971
<pre>Traceback (most recent call last):
File "/usr/lib/
subnet_info = cfgdb.subnet_
File "/usr/lib/
return func(self, *args, **kwargs)
File "/usr/lib/
subnet_
File "/usr/lib/
bottle.
File "/usr/lib/
raise HTTPError(code, text)
HTTPError
</pre>
Traceback (most recent call last):
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
data_sent += self.send(
File "/usr/lib/
return sock.send(data, flags)
error: [Errno 104] Connection reset by peer
Changed in juniperopenstack: | |
assignee: | nobody → Sachin Bansal (sbansal) |
milestone: | none → r3.2.6.0 |
We are still seeing timeouts in heat from vnc_api. Please see the log below.
2017-04-03 15:31:06.461 47874 INFO heat.engine. resource [-] CREATE: AttachPolicy "ns-entry" Stack "NSO-d10789f7- ba21-4a0e- bec0-0e974c463e 2a" [edfa7c59- 496d-4adf- 9f91-e3ebd18928 75] resource Traceback (most recent call last): resource File "/usr/lib/ python2. 7/dist- packages/ heat/engine/ resource. py", line 525, in _action_recorder resource yield resource File "/usr/lib/ python2. 7/dist- packages/ heat/engine/ resource. py", line 595, in _do_action resource yield self.action_ handler_ task(action, args=handler_args) resource File "/usr/lib/ python2. 7/dist- packages/ heat/engine/ scheduler. py", line 296, in wrapper resource step = next(subtask) resource File "/usr/lib/ python2. 7/dist- packages/ heat/engine/ resource. py", line 566, in action_handler_task resource handler_data = handler(*args) resource File "/usr/lib/ python2. 7/dist- packages/ vnc_api/ gen/heat/ resources/ attach_ policy. py", line 86, in handle_create resource 'network-policy', policy_obj.uuid, None, 'ADD', policy_order) resource File "/usr/lib/ python2. 7/dist- packages/ vnc_api/ vnc_api. py", line 39, in wrapper resource return func(self, *args, **kwargs) resource File "/usr/lib/ python2. 7/dist- packages/ vnc_api/ vnc_api. py", line 871, in ref_update resource content = self._request_ server( rest.OP_ POST, uri, data=json_body) resource File "/usr/lib/ python2. 7/dist- packages/ vnc_api/ vnc_api. py", line 712, in _request_server resource retry_count= retry_count) resource File "/usr/lib/ python2. 7/dist- packages/ vnc_api/ vnc_api. py", line 760, in _request resource raise TimeOutError( 'Gateway Timeout 504') resource TimeOutError: Timed out due to: Gateway Timeout 504 resource scheduler [-] Task update_task from Stack "NSO-d10789f7- ba21-4a0e- bec0-0e974c463e 2a" [edfa7c59- 496d-4adf- 9f91-e3ebd18928 75] sleeping _sleep /usr/lib/ python2. 7/dist- packages/ heat/engine/ scheduler. py:164 client [req-bb8b8a53- fdb9-4e85- 8885-eb7b8b4e4e 89 e54f4104921a457 d88835972125493 ba e81dedd775c74c9 6939bd3d7f866fc 65] 10.102. 2...
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.461 47874 TRACE heat.engine.
2017-04-03 15:31:06.524 47874 DEBUG heat.engine.
2017-04-03 15:31:06.931 47921 DEBUG neutronclient.
REQ: curl -i http://