VRouter kernel Oops
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenContrail |
Fix Committed
|
Undecided
|
Anand H. Krishnan |
Bug Description
During source NAT development, I got some vrouter kernel trace.
To confirm, I ran a simple test :
$ cd /opt/stack/
$ while true; do echo -n "creating... "; python contrail_
And 12 hours later I got that kernel trace:
44321 Jul 26 23:17:32 localhost kernel: [118557.448900] RTNL: assertion failed at /build/
44322 Jul 26 23:17:32 localhost kernel: [118557.450256] Pid: 13339, comm: contrail-vroute Tainted: G O 3.2.0-64-virtual #97-Ubuntu
44323 Jul 26 23:17:32 localhost kernel: [118557.450259] Call Trace:
44324 Jul 26 23:17:32 localhost kernel: [118557.450272] [<ffffffff8153f
44325 Jul 26 23:17:32 localhost kernel: [118557.450313] [<ffffffffa0171
44326 Jul 26 23:17:32 localhost kernel: [118557.450320] [<ffffffffa017a
44327 Jul 26 23:17:32 localhost kernel: [118557.450331] [<ffffffffa017a
44328 Jul 26 23:17:32 localhost kernel: [118557.450340] [<ffffffffa017b
44329 Jul 26 23:17:32 localhost kernel: [118557.450345] [<ffffffffa016c
44330 Jul 26 23:17:32 localhost kernel: [118557.450352] [<ffffffffa016d
44331 Jul 26 23:17:32 localhost kernel: [118557.450358] [<ffffffffa016c
44332 Jul 26 23:17:32 localhost kernel: [118557.450364] [<ffffffffa016d
44333 Jul 26 23:17:32 localhost kernel: [118557.450369] [<ffffffffa016c
44334 Jul 26 23:17:32 localhost kernel: [118557.450374] [<ffffffffa016c
44335 Jul 26 23:17:32 localhost kernel: [118557.450379] [<ffffffffa016c
44336 Jul 26 23:17:32 localhost kernel: [118557.450384] [<ffffffffa016d
44337 Jul 26 23:17:32 localhost kernel: [118557.450389] [<ffffffffa016c
44338 Jul 26 23:17:32 localhost kernel: [118557.450395] [<ffffffffa016d
44339 Jul 26 23:17:32 localhost kernel: [118557.450400] [<ffffffffa016d
44340 Jul 26 23:17:32 localhost kernel: [118557.450405] [<ffffffffa016c
44341 Jul 26 23:17:32 localhost kernel: [118557.450410] [<ffffffffa016d
44342 Jul 26 23:17:32 localhost kernel: [118557.450415] [<ffffffffa016c
44343 Jul 26 23:17:32 localhost kernel: [118557.450420] [<ffffffffa016d
44344 Jul 26 23:17:32 localhost kernel: [118557.450425] [<ffffffffa016c
44345 Jul 26 23:17:32 localhost kernel: [118557.450430] [<ffffffffa016d
44346 Jul 26 23:17:32 localhost kernel: [118557.450435] [<ffffffffa016d
44347 Jul 26 23:17:32 localhost kernel: [118557.450440] [<ffffffffa016d
44348 Jul 26 23:17:32 localhost kernel: [118557.450445] [<ffffffffa016d
44349 Jul 26 23:17:32 localhost kernel: [118557.450450] [<ffffffffa016d
44350 Jul 26 23:17:32 localhost kernel: [118557.450455] [<ffffffffa016d
44351 Jul 26 23:17:32 localhost kernel: [118557.450460] [<ffffffffa016d
44352 Jul 26 23:17:32 localhost kernel: [118557.450465] [<ffffffffa016d
44353 Jul 26 23:17:32 localhost kernel: [118557.450469] [<ffffffffa016d
44354 Jul 26 23:17:32 localhost kernel: [118557.450474] [<ffffffffa016c
44355 Jul 26 23:17:32 localhost kernel: [118557.450479] [<ffffffffa016d
44356 Jul 26 23:17:32 localhost kernel: [118557.450484] [<ffffffffa016d
44357 Jul 26 23:17:32 localhost kernel: [118557.450489] [<ffffffffa016d
44358 Jul 26 23:17:32 localhost kernel: [118557.450494] [<ffffffffa016d
44359 Jul 26 23:17:32 localhost kernel: [118557.450500] [<ffffffffa016d
44360 Jul 26 23:17:32 localhost kernel: [118557.450504] [<ffffffffa016d
44361 Jul 26 23:17:32 localhost kernel: [118557.450510] [<ffffffffa016c
44362 Jul 26 23:17:32 localhost kernel: [118557.450515] [<ffffffffa016d
44363 Jul 26 23:17:32 localhost kernel: [118557.450520] [<ffffffffa016d
44364 Jul 26 23:17:32 localhost kernel: [118557.450525] [<ffffffffa016d
44365 Jul 26 23:17:32 localhost kernel: [118557.450530] [<ffffffffa016d
44366 Jul 26 23:17:32 localhost kernel: [118557.450535] [<ffffffffa016d
44367 Jul 26 23:17:32 localhost kernel: [118557.450540] [<ffffffffa016d
44368 Jul 26 23:17:32 localhost kernel: [118557.450545] [<ffffffffa016d
44369 Jul 26 23:17:32 localhost kernel: [118557.450550] [<ffffffffa016d
44370 Jul 26 23:17:32 localhost kernel: [118557.450554] [<ffffffffa016d
44371 Jul 26 23:17:32 localhost kernel: [118557.450559] [<ffffffffa016d
44372 Jul 26 23:17:32 localhost kernel: [118557.450564] [<ffffffffa016d
44373 Jul 26 23:17:32 localhost kernel: [118557.450569] [<ffffffffa016d
44374 Jul 26 23:17:32 localhost kernel: [118557.450574] [<ffffffffa016d
44375 Jul 26 23:17:32 localhost kernel: [118557.450579] [<ffffffffa016d
44376 Jul 26 23:17:32 localhost kernel: [118557.450584] [<ffffffffa016d
44377 Jul 26 23:17:32 localhost kernel: [118557.450589] [<ffffffffa016d
44378 Jul 26 23:17:32 localhost kernel: [118557.450593] [<ffffffffa016d
44379 Jul 26 23:17:32 localhost kernel: [118557.450598] [<ffffffffa016d
44380 Jul 26 23:17:32 localhost kernel: [118557.450603] [<ffffffffa016d
44381 Jul 26 23:17:32 localhost kernel: [118557.450608] [<ffffffffa016d
44382 Jul 26 23:17:32 localhost kernel: [118557.450613] [<ffffffffa016d
44383 Jul 26 23:17:32 localhost kernel: [118557.450618] [<ffffffffa016d
44384 Jul 26 23:17:32 localhost kernel: [118557.450622] [<ffffffffa016d
44385 Jul 26 23:17:32 localhost kernel: [118557.450628] [<ffffffffa016e
44386 Jul 26 23:17:32 localhost kernel: [118557.450634] [<ffffffffa016e
44387 Jul 26 23:17:32 localhost kernel: [118557.450639] [<ffffffffa016e
44388 Jul 26 23:17:32 localhost kernel: [118557.450644] [<ffffffffa016e
44389 Jul 26 23:17:32 localhost kernel: [118557.450649] [<ffffffffa016e
44390 Jul 26 23:17:32 localhost kernel: [118557.450654] [<ffffffffa016e
44391 Jul 26 23:17:32 localhost kernel: [118557.450659] [<ffffffffa016e
44392 Jul 26 23:17:32 localhost kernel: [118557.450664] [<ffffffffa016e
44393 Jul 26 23:17:32 localhost kernel: [118557.450669] [<ffffffffa016c
44394 Jul 26 23:17:32 localhost kernel: [118557.450674] [<ffffffffa016c
44395 Jul 26 23:17:32 localhost kernel: [118557.450681] [<ffffffffa0174
44396 Jul 26 23:17:32 localhost kernel: [118557.450688] [<ffffffffa0174
44397 Jul 26 23:17:32 localhost kernel: [118557.450694] [<ffffffffa0173
44398 Jul 26 23:17:32 localhost kernel: [118557.450699] [<ffffffff8156e
44399 Jul 26 23:17:32 localhost kernel: [118557.450702] [<ffffffff8156e
44400 Jul 26 23:17:32 localhost kernel: [118557.450705] [<ffffffff8156e
44401 Jul 26 23:17:32 localhost kernel: [118557.450708] [<ffffffff8156e
44402 Jul 26 23:17:32 localhost kernel: [118557.450710] [<ffffffff8156e
44403 Jul 26 23:17:32 localhost kernel: [118557.450713] [<ffffffff8156e
44404 Jul 26 23:17:32 localhost kernel: [118557.450716] [<ffffffff8156e
44405 Jul 26 23:17:32 localhost kernel: [118557.450719] [<ffffffff81536
44406 Jul 26 23:17:32 localhost kernel: [118557.450722] [<ffffffff8156e
44407 Jul 26 23:17:32 localhost kernel: [118557.450726] [<ffffffff8152c
44408 Jul 26 23:17:32 localhost kernel: [118557.450730] [<ffffffff81056
44409 Jul 26 23:17:32 localhost kernel: [118557.450733] [<ffffffff81056
44410 Jul 26 23:17:32 localhost kernel: [118557.450737] [<ffffffff8152e
44411 Jul 26 23:17:32 localhost kernel: [118557.450740] [<ffffffff8153a
44412 Jul 26 23:17:32 localhost kernel: [118557.450743] [<ffffffff8152e
44413 Jul 26 23:17:32 localhost kernel: [118557.450752] [<ffffffff81060
44414 Jul 26 23:17:32 localhost kernel: [118557.450755] [<ffffffff81060
44415 Jul 26 23:17:32 localhost kernel: [118557.450764] [<ffffffff8109e
44416 Jul 26 23:17:32 localhost kernel: [118557.450767] [<ffffffff8109f
44417 Jul 26 23:17:32 localhost kernel: [118557.450771] [<ffffffff8152f
44418 Jul 26 23:17:32 localhost kernel: [118557.450774] [<ffffffff8152f
44419 Jul 26 23:17:32 localhost kernel: [118557.450780] [<ffffffff81665
I got that trace twice (12 hours after the first one), but the test continue to work 2 days more without any errors.
I remark another problem. 3 days after I started the test, the loop test take 30 seconds more to run (ie. 5-6 seconds at the beginning and 35 seconds 3 days after).
description: | updated |
Changed in opencontrail: | |
assignee: | nobody → Divakar Dharanalakota (ddivakar) |
I also need to do a quick/dirty patch to be able to run script "contrail_ veth_port. py".
I attached it.