Build 2697: On multi node setup, when zookeeper on one of the db nodes goes down, contrail-topology/snmp collector not able to connect to remote zookeeper
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
Trunk |
Fix Committed
|
High
|
ted ghose |
Bug Description
Local node: zookeeper stopped/down as shown below:
root@nodeg20:
zookeeper stop/waiting <----------Local zookeeper down
root@nodeg20:
root@nodeg20:
== Contrail vRouter ==
supervisor-vrouter: active
contrail-
contrail-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-alarm-gen active
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology initializing (Zookeeper:
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema backup
contrail-
ifmap active
== Contrail Database ==
contrail-database: active
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
Remote node :
root@nodeg13:~# contrail-status
== Contrail Analytics ==
supervisor-
contrail-alarm-gen active
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema active
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
root@nodeg13:~# service zookeeper status
zookeeper start/running, process 15108 <----remote zookeeper is up
root@nodeg13:~#
testbed:
host1 = 'root@10.
host2 = 'root@10.
host3 = 'root@10.
'cfgm': [host1,
'webui': [host1],
'openstack': [host1],
'control': [host2, host3],
'collector': [host1, host2, host3],
'database': [host1, host2, host3],
'compute': [host2, host3],
Changed in juniperopenstack: | |
milestone: | none → r3.0-fcs |
def fixup_contrail_ topology( self): contrail- topology. conf' warn_only= True):
local( "[ -f %s ] || > %s" % (conf_fl, conf_fl))
self. set_config( conf_fl, 'DEFAULTS', 'zookeeper',
self. cassandra_ server_ list[0] [0] + ':2181') <<<< Only the first node in the cassandra_ server_ list [database-nodes] is provisioned for zookeeper in contrail-topology and contrail- snmp-collector
conf_fl = '/etc/contrail/
with settings(