Config services svc-monitor, device-manager and schema are not coming up in 5.1.0
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R5.0 |
Fix Released
|
High
|
Michael Henkel | |||
Trunk |
Fix Released
|
Critical
|
Michael Henkel |
Bug Description
The setup is freshly brought with 5.1.0 -215 - ocata. In this build config services like svc-monitor, device-manager and schema are not coming to active while checking through contrail-status.
Setup also available. It is a multi node multi interface setup.
nodec7 (root/c0ntrail123)
instances:
nodec57:
ip: 10.204.216.153
provider: bms
roles:
config: null
control: null
webui: null
nodec7:
ip: 10.204.216.64
provider: bms
roles:
config: null
control: null
webui: null
nodec8:
ip: 10.204.216.65
provider: bms
roles:
config: null
control: null
webui: null
nodei1:
ip: 10.204.216.150
provider: bms
roles:
vrouter:
nodei2:
ip: 10.204.217.114
provider: bms
roles:
vrouter:
nodei3:
ip: 10.204.217.115
provider: bms
roles:
vrouter:
The same error is seen in device-mgr and schema logs. Complete logs for both of the services are attached
== Contrail control ==
control: active
nodemgr: active
named: active
dns: active
== Contrail config-database ==
nodemgr: active
zookeeper: active
rabbitmq: active
cassandra: active
== Contrail database ==
kafka: active
nodemgr: active
zookeeper: active
cassandra: active
== Contrail analytics ==
snmp-collector: active
query-engine: active
api: active
alarm-gen: active
nodemgr: active
collector: active
topology: active
== Contrail webui ==
web: active
job: active
== Contrail config ==
HTTPSConnection
svc-monitor: initializing
nodemgr: active
HTTPSConnection
device-manager: initializing
api: active
HTTPSConnection
schema: initializing
device-mgr log:
===============
dm_logger, args)
File "/usr/lib/
self.
File "/usr/lib/
func(*args, **kwargs)
File "/usr/lib/
DeviceManag
File "/usr/lib/
self._object_db = DMCassandraDB.
File "/usr/lib/
cls.
File "/usr/lib/
ca_
File "/usr/lib/
ca_certs,
File "/usr/lib/
self.
File "/usr/lib/
self.
File "/usr/lib/
**create_
File "/usr/lib/
self.
File "/usr/lib/
return self._schema_
File "/usr/lib/
schema_version = schema_func(*args)
File "/usr/lib/
return self.recv_
File "/usr/lib/
(fname, mtype, rseqid) = self._iprot.
File "/usr/lib64/
sz = self.readI32()
File "/usr/lib64/
buff = self.trans.
File "/usr/lib64/
chunk = self.read(sz - have)
File "/usr/lib64/
self.
File "/usr/lib64/
buff = self.__
File "/usr/lib64/
chunk = self.read(sz - have)
File "/usr/lib64/
buff = self.handle.
File "/usr/lib64/
self.
File "/usr/lib64/
self.
File "/usr/lib64/
result = waiter.get()
File "/usr/lib64/
return self.hub.switch()
File "/usr/lib64/
return greenlet.
timeout: timed out
This issue is seen in 5.0 173
[root@nodem14 ~]# contrail-status analytics- alarm-gen running Up 24 hours analytics- api running Up 2 days analytics- collector running Up 2 days analytics- query-engine running Up 2 days analytics- snmp-collector running Up 24 hours analytics- topology running Up 2 days controller- config- api running Up 2 days controller- config- devicemgr running Up 24 hours controller- config- schema running Up 24 hours controller- config- svcmonitor running Up 24 hours external- cassandra running Up 2 days external- rabbitmq running Up 2 days external- zookeeper running Up 2 days controller- control- control running Up 2 days controller- control- dns running Up 2 days controller- control- named running Up 2 days external- cassandra running Up 2 days external- kafka running Up 2 days external- zookeeper running Up 2 days controller- webui-job running Up 2 days controller- webui-web running Up 2 days
Pod Service Original Name State Status
analytics alarm-gen contrail-
analytics api contrail-
analytics collector contrail-
analytics nodemgr contrail-nodemgr running Up 2 days
analytics query-engine contrail-
analytics snmp-collector contrail-
analytics topology contrail-
config api contrail-
config device-manager contrail-
config nodemgr contrail-nodemgr running Up 2 days
config schema contrail-
config svc-monitor contrail-
config-database cassandra contrail-
config-database nodemgr contrail-nodemgr running Up 2 days
config-database rabbitmq contrail-
config-database zookeeper contrail-
control control contrail-
control dns contrail-
control named contrail-
control nodemgr contrail-nodemgr running Up 2 days
database cassandra contrail-
database kafka contrail-
database nodemgr contrail-nodemgr running Up 2 days
database zookeeper contrail-
webui job contrail-
webui web contrail-
== Contrail control ==
control: active
nodemgr: active
named: active
dns: active
== Contrail config-database ==
nodemgr: active
zookeeper: active
rabbitmq: active
cassandra: active
== Contrail database ==
kafka: active
nodemgr: active
zookeeper: active
cassandra: active
== Contrail analytics == UVE:10. 204.216. 96:6379[ None] connection down) UVE:10. 204.216. 96:6379[ None], Zookeeper: AlarmGenerator[ ] connection down)
snmp-collector: active
query-engine: active
api: initializing (Redis-
alarm-gen: initializing (Redis-
nodemgr: active
collector: active
topology: active
== Contrail webui ==
web: active
job: active
== Contrail config == Pool(host= 'nodem. ..
HTTPSConnection