SM:mainline:3036:centos: contrail-control process gets into timeout state with error in cassandra
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Juniper Openstack |
Won't Fix
|
High
|
Dheeraj Gautam | ||
R3.2 |
Won't Fix
|
High
|
Dheeraj Gautam | ||
R4.0 |
Won't Fix
|
High
|
Dheeraj Gautam | ||
Trunk |
Won't Fix
|
High
|
Dheeraj Gautam |
Bug Description
SM:mainline:
1) Install SM mitaka mainline build 3036 ;
2) add a cluster with roles as follows
root@nodej5:~# server-
+------
| id | cluster_id | ip_address | roles | mac_address |
+------
| nodec57 | cluster_multi | 10.204.221.61 | [u'compute'] | 00:25:90:C5:58:6E |
| nodec33 | cluster_multi | 10.204.221.59 | [u'webui', u'database', u'control', u'collector'] | 00:25:90:C4:82:28 |
| nodec35 | cluster_multi | 10.204.221.58 | [u'config', u'control', u'openstack'] | 00:25:90:C4:7A:70 |
| nodea4 | cluster_multi | 10.204.221.60 | [u'compute'] | 00:25:90:A5:3B:12 |
+------
3) Reimage the target with centos72; Reimaged succesfully;
4) Issue Provision; Provision gets completed;
5) But the contrail control node gets into timeout state
root@nodec33 ~]# contrail-status
== Contrail Control ==
supervisor-control: active
contrail-control timeout
contrail-
6) restart of contrail-control did not help to recover the process to active state
7) /var/log/
2017-02-06 Mon 19:55:54:918.111 PST nodec33 [Thread 47244092547904, Pid 31972]: DisconnectSync:
2017-02-06 Mon 19:55:54:918.017 PST nodec33 [Thread 47244265494272, Pid 31972]: SANDESH: Send FAILED: 1486439754917900 ConfigCassSm [SYS_DEBUG]: ConfigCassInitE
2017-02-06 Mon 19:55:59:918.509 PST nodec33 [Thread 47244092547904, Pid 31972]: SyncFutureWait:
7) Another issue seen in this topology is , contrail-status shows following traceback
root@nodec35 ~]# contrail-status
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-discovery active
contrail-schema active
contrail-
ifmap active
== Contrail Database ==
contrail-database: active
supervisor-
Traceback (most recent call last):
File "/usr/bin/
main()
File "/usr/bin/
contrail_
File "/usr/bin/
check_
File "/usr/bin/
check_
File "/usr/bin/
raise Exception("%s does not exist! Cannot check supervisor status." % service_sock)
Exception: /var/run/
[root@nodec35 ~]#
Notes:
------
1) This is seen in kilo/liberty/mitaka
2) Issue not seen in Single node setup
3) Seen in both ubuntu/centos Distros
description: | updated |
summary: |
- SM:mainline:3036:Centos: contrail-control process gets into timeout - state with error in cassandra + SM:mainline:3036: contrail-control process gets into timeout state with + error in cassandra |
description: | updated |
tags: | added: sanity |
tags: | added: blocker |
tags: | removed: blocker sanity |
tags: | added: sanity |
tags: | removed: sanity |
Changed in juniperopenstack: | |
status: | Incomplete → Won't Fix |
Seen with mainline build 3039 also