Contrail :: R4.1 16.04 27 newton :: at times zookeeper fails to come up on port 2182 in analyticsdb.
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R4.1 |
Incomplete
|
High
|
Ritam Gangopadhyay | |||
R5.0 |
Incomplete
|
High
|
Ritam Gangopadhyay | |||
Trunk |
Incomplete
|
High
|
Ritam Gangopadhyay |
Bug Description
Once in a while, we see this issue where analytics services report initializing state due kafka connection.
root@nodec28:~# docker exec -it analytics contrail-status
== Contrail Analytics ==
contrail-collector: initializing (KafkaPub:
contrail-
On investigation it is seen that zookeeper on controller is up on port 2181, but zookeeper on analyticsdb, to which kafka connects is not up on port 2182.
On restarting zookeeper on analyticsdb it was able to aquire the port 2182 and kafka issue was no longer seen.
root@nodec28:~# docker exec -it analytics contrail-status
== Contrail Analytics ==
contrail-collector: active
contrail-
contrail-
contrail-alarm-gen: active
contrail-
contrail-topology: active
contrail-
root@nodec28:~#
*******
CONTRAIL ANALYTICS API LOGS
*******
2017-10-25 Wed 03:55:34:858.501 IST nodec28 [Thread 140144673298176, Pid 3355]: Failed to acquire metadata: Local: Broker transport failure
2017-10-25 Wed 03:55:39:859.149 IST nodec28 [Thread 140144669099776, Pid 3355]: No Kafka Callbacks
2017-10-25 Wed 03:55:39:859.217 IST nodec28 [Thread 140144669099776, Pid 3355]: Kafka Needs Restart
2017-10-25 Wed 03:55:44:859.129 IST nodec28 [Thread 140144669099776, Pid 3355]: Failed to acquire metadata: Local: Broker transport failure
2017-10-25 Wed 03:55:49:859.675 IST nodec28 [Thread 140144673298176, Pid 3355]: No Kafka Callbacks
2017-10-25 Wed 03:55:49:859.741 IST nodec28 [Thread 140144673298176, Pid 3355]: Kafka Needs Restart
*******
KAFKA LOGS
*******
[2017-10-25 11:00:04,715] FATAL Fatal error during KafkaServerStar
org.I0Itec.
at org.I0Itec.
at org.I0Itec.
at org.I0Itec.
at kafka.utils.
at kafka.utils.
at kafka.server.
at kafka.server.
at kafka.server.
at kafka.Kafka$
at kafka.Kafka.
[2017-10-25 11:00:04,716] INFO shutting down (kafka.
*******
ZOOKEEPER LOGS ON ANALYTICSDB
*******
2017-10-25 00:44:23,339 - ERROR [main:ZooKeeper
java.io.
at org.apache.
at org.apache.
at org.apache.
at org.apache.
at org.apache.
at org.apache.
2017-10-25 11:29:00,197 - INFO [main:QuorumPee
2017-10-25 11:29:00,208 - INFO [main:QuorumPee
2017-10-25 11:29:00,208 - ERROR [main:QuorumPee
2017-10-25 11:29:00,209 - INFO [main:DatadirCl
1. Is this an upgrade case or fresh install? ownership of dataDir= /var/lib/ zookeeper although I don’t have an answer for why it works fine when you restart zookeeper manually.
2. All-in-one node, multi-node or HA case?
3. Is it seen on ubuntu14.04 or other openstack releases?
4. Does it happen only on initial installation or have you seen it on reboot too?
5. Once you see the system is in good state, does it go back to bad state on its own?
6. After you restart zookeeper and system is in good state, does it go back to bad state on its own?
7. Please provide the combined.json files used for provisioning.
8. The issue you reported might be due to incorrect permission/
9. If you hit this issue again, please leave the box in that state and send login credentials.
Please provide below details when you hit the issue. u14d5(analytics db):/etc/ zookeeper/ conf_example# cat /etc/zookeeper/ conf_example/ zoo.cfg /var/lib/ zookeeper /var/log/ zookeeper ut=120000
root@sangupta-
tickTime=2000
dataDir=
dataLogDir=
clientPort=2182
initLimit=10
syncLimit=5
maxSessionTimeo
autopurge. purgeInterval= 3 snapRetainCount =3
autopurge.
server. 1=192.168. 0.44:2889: 3889 u14d5(analytics db):/# ls -asl /var/lib/zookeeper/ conf//myid u14d5(analytics db):/# ls -asl /var/lib/ zookeeper/ version- 2/
root@sangupta-
total 12
4 drwxr-xr-x 3 zookeeper zookeeper 4096 Oct 30 17:55 .
4 drwxr-xr-x 32 root root 4096 Oct 30 03:40 ..
0 lrwxrwxrwx 1 root root 25 Oct 30 17:55 myid -> /etc/zookeeper/
4 drwxr-xr-x 2 zookeeper zookeeper 4096 Nov 1 17:25 version-2
root@sangupta-
total 344
4 drwxr-xr-x 2 zookeeper zookeeper 4096 Nov 1 17:25 .
4 drwxr-xr-x 3 zookeeper zookeeper 4096 Oct 30 17:55 ..
4 -rw-r--r-- 1 zookeeper zookeeper 296 Oct 30 17:55 snapshot.0
332 -rw-r--r-- 1 zookeeper zookeeper 336862 Nov 1 17:25 snapshot.3477