Process failure seen in 2 controllers/3 when deleting 2k VN's and process contrail-device-manager,contrail-schema,contrail-svc-monitor failed on the 2 controllers
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenContrail |
New
|
Undecided
|
Unassigned |
Bug Description
I have the following controllers and TSN's in the cluster and HA is enabled (I have attacahed the testbed.py)
host1 = 'root@10.94.63.102' >> Controller (leader)
host2 = 'root@10.94.63.103' >> Backup
host3 = 'root@10.94.63.133' >> Backup
host4 = 'root@10.
host5 = 'root@10.
At present the controllers had 4K VN's
1-2000 VNs configured under alpha naming convention and 2001 - 4000 VNs configured under bravo naming convention
Now I delete the VNs from 2001 - 4000 .I started this on Jan 10 evening and I saw process crash in Controller 1 and controller 2
Below is the conntrail-status before and after crash on all three Controllers
Before crash on controller 1
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema backup
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
After crash on controller 1
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema failed
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
Before crash on Controller2
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema backup
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
After crash on Controller 2
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema backup
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
No failure seen in controller3
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema active
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
========Run time service failures=
/var/crashes/
/var/crashes/
root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema active
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
Logs will be root@NTTC-
== Contrail Control ==
supervisor-control: active
contrail-control active
contrail-
contrail-dns active
contrail-named active
== Contrail Analytics ==
supervisor-
contrail-
contrail-
contrail-
contrail-collector active
contrail-
contrail-
contrail-topology active
== Contrail Config ==
supervisor-config: active
contrail-api:0 active
contrail-
contrail-
contrail-
contrail-schema active
contrail-
ifmap active
== Contrail Web UI ==
supervisor-webui: active
contrail-webui active
contrail-
== Contrail Database ==
contrail-database: active
== Contrail Supervisor Database ==
supervisor-
contrail-
kafka active
== Contrail Support Services ==
supervisor-
rabbitmq-server active
========Run time service failures=
/var/crashes/
/var/crashes/
root@NTTC-
root@NTTC-
total 1206696
-rw------- 1 contrail contrail 1993863168 Jan 6 07:42 core.contrail-
-rw------- 1 contrail contrail 69795840 Jan 6 12:16 core.contrail-
The logs will be copied to
/volume/
Changed in opencontrail: | |
assignee: | Arun Paul (ampul) → nobody |
The logs are copied to
@sp-ulnx2: /volume/ dcg-systest/ PRS/PR1656115> ls /volume/ dcg-systest/ PRS/PR1656115>
controller1 controller2 controller3
@sp-ulnx2:
The controller ip address is https:/ /10.94. 63.102: 8143
I started deleting the VNs on Jan 10 17:07 .The crash happened on Jan 10 in the night so I have copied the logs of Jan 10 20:06 from /var/log/contrail folder .
If you need any other logs please let me know .The Controller is also in the state where it shows process failure on 2 CN's