flaky test: ovn-controller - check sbdb connection

Bug #2057784 reported by Frode Nordahl
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
ovn (Ubuntu)
New
Undecided
Unassigned

Bug Description

As per a recent debian/sid buildlog [0] for the ovn package, the "ovn-controller - check sbdb connection" appears to be flaky.

0: https://buildd.debian.org/status/fetch.php?pkg=ovn&arch=amd64&ver=24.03.1-1&stamp=1710275203&raw=0

Log excerpt:
# -*- compilation -*-
760. ovn-controller.at:342: testing ovn-controller - check sbdb connection -- parallelization=yes -- ovn_monitor_all=yes ...
creating ovn-sb database
ovsdb-server -vjsonrpc --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-sb/ovn-sb.sock --remote=db:OVN_Southbound,SB_Global,connections --private-key=/<<PKGBUILDDIR>>/tests/testpki-test-privkey.pem --certificate=/<<PKGBUILDDIR>>/tests/testpki-test-cert.pem --ca-cert=/<<PKGBUILDDIR>>/tests/testpki-cacert.pem /<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-sb/ovn-sb.db -vconsole:off --detach --no-chdir --pidfile --log-file
creating ovn-nb database
ovsdb-server -vjsonrpc --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-nb/ovn-nb.sock /<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-nb/ovn-nb.db -vconsole:off --detach --no-chdir --pidfile --log-file
starting northd
ovn-northd --n-threads=4 -vjsonrpc --ovnnb-db=unix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-nb/ovn-nb.sock --ovnsb-db=unix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/ovn-sb/ovn-sb.sock -vconsole:off --detach --no-chdir --pidfile --log-file
2024-03-12T20:26:33Z|00001|ovn_northd|INFO|Using 4 threads
5c883d3f-56fd-4781-a117-8665a3e4a0d8
ovn-macros.at:256: waiting until TCP_PORT=`sed -n 's/.*0:.*: listening on port \([0-9]*\)$/\1/p' "$d/ovn-sb/ovsdb-server.log"` && test X != X"$TCP_PORT"...
ovn-macros.at:256: wait succeeded immediately
adding simulator 'main'
ovsdb-server --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/main/db.sock -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vswitchd --enable-dummy=system -vvconn -vofproto_dpif -vunixctl -vconsole:off --detach --no-chdir --pidfile --log-file
adding simulator 'hv'
ovsdb-server --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/760/hv/db.sock -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vswitchd --enable-dummy=system -vvconn -vofproto_dpif -vunixctl -vconsole:off --detach --no-chdir --pidfile --log-file
ovn-controller --enable-dummy-vif-plug -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vsctl: no key "ovn-encap-type-hv" in Open_vSwitch record "." column external_ids
ovs-vsctl: no key "ovn-encap-ip-hv" in Open_vSwitch record "." column external_ids

Waiting until ip in sb Encap with chassis_name=hv type=geneve is 192.168.0.1...
ovn-macros.at:584: waiting until
      found=$(ovn-${db}ctl --bare --columns $column find $table $a $b $c $d $e)
      found=$(for d in $found; do echo $d; done | sort)
      test "$expected" = "$found"
    ...
ovn-macros.at:584: wait succeeded immediately

Waiting until ip in sb Encap with chassis_name=hv type=vxlan is 192.168.0.1...
ovn-macros.at:584: waiting until
      found=$(ovn-${db}ctl --bare --columns $column find $table $a $b $c $d $e)
      found=$(for d in $found; do echo $d; done | sort)
      test "$expected" = "$found"
    ...
ovn-macros.at:584: wait succeeded immediately
ovn-controller.at:342: waiting until check_sbdb_connection connected...
ovn-controller.at:342: wait succeeded immediately
ovn-controller.at:342: waiting until check_sbdb_connection 'not connected'...
ovn-controller.at:342: wait succeeded immediately
hv: clean up sandbox
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovn-controller.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovn-controller exit
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded quickly

hv: clean up vswitch
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovs-vswitchd.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovs-vswitchd exit --cleanup
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded quickly
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovsdb-server.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovsdb-server exit
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded immediately
./ovn-controller.at:342: check_logs "
        $error
        /connection failed (No such file or directory)/d
        /has no network name*/d
        /receive tunnel port not found*/d
        /Failed to locate tunnel to reach main chassis/d
    " $sbox
--- /dev/null 2024-03-10 22:14:04.000000000 +0000
+++ /<<PKGBUILDDIR>>/tests/testsuite.dir/at-groups/760/stdout 2024-03-12 20:26:34.223918134 +0000
@@ -0,0 +1 @@
+2024-03-12T20:26:33.909Z|00022|stream_tcp|ERR|tcp:192.168.0.10:6642: connect: Network is unreachable
760. ovn-controller.at:342: 760. ovn-controller - check sbdb connection -- parallelization=yes -- ovn_monitor_all=yes (ovn-controller.at:342): FAILED (ovn-controller.at:342)

# -*- compilation -*-
761. ovn-controller.at:342: testing ovn-controller - check sbdb connection -- parallelization=yes -- ovn_monitor_all=no ...
creating ovn-sb database
ovsdb-server -vjsonrpc --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-sb/ovn-sb.sock --remote=db:OVN_Southbound,SB_Global,connections --private-key=/<<PKGBUILDDIR>>/tests/testpki-test-privkey.pem --certificate=/<<PKGBUILDDIR>>/tests/testpki-test-cert.pem --ca-cert=/<<PKGBUILDDIR>>/tests/testpki-cacert.pem /<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-sb/ovn-sb.db -vconsole:off --detach --no-chdir --pidfile --log-file
creating ovn-nb database
ovsdb-server -vjsonrpc --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-nb/ovn-nb.sock /<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-nb/ovn-nb.db -vconsole:off --detach --no-chdir --pidfile --log-file
starting northd
ovn-northd --n-threads=4 -vjsonrpc --ovnnb-db=unix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-nb/ovn-nb.sock --ovnsb-db=unix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/ovn-sb/ovn-sb.sock -vconsole:off --detach --no-chdir --pidfile --log-file
2024-03-12T20:26:34Z|00001|ovn_northd|INFO|Using 4 threads
33b7af4d-31d5-4dcb-984f-e101499e33c6
ovn-macros.at:256: waiting until TCP_PORT=`sed -n 's/.*0:.*: listening on port \([0-9]*\)$/\1/p' "$d/ovn-sb/ovsdb-server.log"` && test X != X"$TCP_PORT"...
ovn-macros.at:256: wait succeeded immediately
adding simulator 'main'
ovsdb-server --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/main/db.sock -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vswitchd --enable-dummy=system -vvconn -vofproto_dpif -vunixctl -vconsole:off --detach --no-chdir --pidfile --log-file
adding simulator 'hv'
ovsdb-server --remote=punix:/<<PKGBUILDDIR>>/tests/testsuite.dir/761/hv/db.sock -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vswitchd --enable-dummy=system -vvconn -vofproto_dpif -vunixctl -vconsole:off --detach --no-chdir --pidfile --log-file
ovn-controller --enable-dummy-vif-plug -vconsole:off --detach --no-chdir --pidfile --log-file
ovs-vsctl: no key "ovn-encap-type-hv" in Open_vSwitch record "." column external_ids
ovs-vsctl: no key "ovn-encap-ip-hv" in Open_vSwitch record "." column external_ids

Waiting until ip in sb Encap with chassis_name=hv type=geneve is 192.168.0.1...
ovn-macros.at:584: waiting until
      found=$(ovn-${db}ctl --bare --columns $column find $table $a $b $c $d $e)
      found=$(for d in $found; do echo $d; done | sort)
      test "$expected" = "$found"
    ...
ovn-macros.at:584: wait succeeded immediately

Waiting until ip in sb Encap with chassis_name=hv type=vxlan is 192.168.0.1...
ovn-macros.at:584: waiting until
      found=$(ovn-${db}ctl --bare --columns $column find $table $a $b $c $d $e)
      found=$(for d in $found; do echo $d; done | sort)
      test "$expected" = "$found"
    ...
ovn-macros.at:584: wait succeeded immediately
ovn-controller.at:342: waiting until check_sbdb_connection connected...
ovn-controller.at:342: wait succeeded immediately
ovn-controller.at:342: waiting until check_sbdb_connection 'not connected'...
ovn-controller.at:342: wait succeeded immediately
hv: clean up sandbox
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovn-controller.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovn-controller exit
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded quickly

hv: clean up vswitch
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovs-vswitchd.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovs-vswitchd exit --cleanup
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded quickly
./ovn-controller.at:342: test -e $OVS_RUNDIR/ovsdb-server.pid
./ovn-controller.at:342: ovs-appctl --timeout=10 -t ovsdb-server exit
ovn-controller.at:342: waiting while kill -0 $TMPPID 2>/dev/null...
ovn-controller.at:342: wait succeeded quickly
./ovn-controller.at:342: check_logs "
        $error
        /connection failed (No such file or directory)/d
        /has no network name*/d
        /receive tunnel port not found*/d
        /Failed to locate tunnel to reach main chassis/d
    " $sbox
--- /dev/null 2024-03-10 22:14:04.000000000 +0000
+++ /<<PKGBUILDDIR>>/tests/testsuite.dir/at-groups/761/stdout 2024-03-12 20:26:35.491913980 +0000
@@ -0,0 +1 @@
+2024-03-12T20:26:35.074Z|00022|stream_tcp|ERR|tcp:192.168.0.10:6642: connect: Network is unreachable
761. ovn-controller.at:342: 761. ovn-controller - check sbdb connection -- parallelization=yes -- ovn_monitor_all=no (ovn-controller.at:342): FAILED (ovn-controller.at:342)

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.