vrouter-agent and tor-agent crash at SendProuterMsgFromPhyInterface on contrail-api restart in a scale setup
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Juniper Openstack |
Fix Committed
|
High
|
Vedamurthy Joshi | ||
R2.1 |
Fix Committed
|
High
|
Vedamurthy Joshi |
Bug Description
R2.10 39 Ubuntu 14.04 Multi-node setup
nodei38 is a node with 128 tor-agents (110 active TORs) and 11K VMIs (1K active endpoints)
There are two contrail-api nodes (nodei34 and nodei35)
On restarting contrail-api on both these nodes one after the other, vrouter-agent and tor-agents crashed
Cores will be in http://
root@nodei38:
total 1107500
-rw------- 1 root root 162619392 Feb 27 10:17 core.contrail-
-rw------- 1 root root 153624576 Feb 27 10:18 core.contrail-
-rw------- 1 root root 158982144 Feb 27 10:25 core.contrail-
-rw------- 1 root root 153276416 Feb 27 10:48 core.contrail-
-rw------- 1 root root 2637926 Feb 27 11:32 core.contrail-
-rw------- 1 root root 1868295 Feb 27 11:32 core.contrail-
-rw------- 1 root root 1874133 Feb 27 11:33 core.contrail-
-rw------- 1 root root 1855571 Feb 27 11:35 core.contrail-
-rw------- 1 root root 174551040 Mar 6 13:04 core.contrail-
-rw------- 1 root root 171933696 Mar 6 13:04 core.contrail-
-rw------- 1 root root 174628864 Mar 6 13:04 core.contrail-
-rw------- 1 root root 175931392 Mar 6 13:04 core.contrail-
-rw------- 1 root root 172654592 Mar 6 13:04 core.contrail-
-rw------- 1 root root 184504320 Mar 6 13:04 core.contrail-
-rw------- 1 root root 791908352 Mar 6 13:04 core.contrail-
-rw------- 1 root root 252092416 Mar 6 13:08 core.contrail-
root@nodei38:
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_
Core was generated by `/usr/bin/
Program terminated with signal SIGSEGV, Segmentation fault.
#0 0x0000000000c03e2c in ProuterUveTable
(gdb) bt
#0 0x0000000000c03e2c in ProuterUveTable
#1 0x0000000000c03ee1 in ProuterUveTable
#2 0x0000000000c04491 in ProuterUveTable
#3 0x0000000000deb412 in DBTableBase:
#4 0x0000000000ded268 in DBTablePartBase
#5 0x0000000000dea0cd in DBPartition:
#6 0x0000000000ed4b90 in TaskImpl::execute() ()
#7 0x00007ff254b1fb3a in ?? () from /usr/lib/
#8 0x00007ff254b1b816 in ?? () from /usr/lib/
#9 0x00007ff254b1af4b in ?? () from /usr/lib/
#10 0x00007ff254b170ff in ?? () from /usr/lib/
#11 0x00007ff254b172f9 in ?? () from /usr/lib/
#12 0x00007ff254d3b182 in start_thread () from /lib/x86_
#13 0x00007ff254013fbd in clone () from /lib/x86_
(gdb) quit
Changed in juniperopenstack: | |
assignee: | Hari Prasad Killi (haripk) → Ashok Singh (ashoksr) |
tags: | added: bms |
[Thread debugging using libthread_db enabled] 64-linux- gnu/libthread_ db.so.1" . contrail- vrouter- agent'. ::EnqueueProute rMsg(PhysicalDe vice const*) () ::EnqueueProute rMsg(PhysicalDe vice const*) () ::AddLogicalInt erface( Interface const*, LogicalInterface const*) () ::InterfaceNoti fy(DBTablePartB ase*, DBEntryBase*) () :RunNotify( DBTablePartBase *, DBEntryBase*) () ::RunNotify( ) () :QueueRunner: :Run() () libtbb. so.2 libtbb. so.2 libtbb. so.2 libtbb. so.2 libtbb. so.2 64-linux- gnu/libpthread. so.0 64-linux- gnu/libc. so.6
Using host libthread_db library "/lib/x86_
Core was generated by `/usr/bin/
Program terminated with signal SIGSEGV, Segmentation fault.
#0 0x0000000000c03c7b in ProuterUveTable
(gdb) bt
#0 0x0000000000c03c7b in ProuterUveTable
#1 0x0000000000c041c3 in ProuterUveTable
#2 0x0000000000c044a8 in ProuterUveTable
#3 0x0000000000deb412 in DBTableBase:
#4 0x0000000000ded268 in DBTablePartBase
#5 0x0000000000dea0cd in DBPartition:
#6 0x0000000000ed4b90 in TaskImpl::execute() ()
#7 0x00007fe8a816db3a in ?? () from /usr/lib/
#8 0x00007fe8a8169816 in ?? () from /usr/lib/
#9 0x00007fe8a8168f4b in ?? () from /usr/lib/
#10 0x00007fe8a81650ff in ?? () from /usr/lib/
#11 0x00007fe8a81652f9 in ?? () from /usr/lib/
#12 0x00007fe8a8389182 in start_thread () from /lib/x86_
#13 0x00007fe8a7661fbd in clone () from /lib/x86_
(gdb)