Comment 2 for bug 2016002

Revision history for this message
Edward Hope-Morley (hopem) wrote :

Ok so a node reboot did fix this. I assume that the following means it tried to connect to one of its peers and couldn't so timed out and the service exited:

2023-04-12T11:31:47.869Z|00686|reconnect|INFO|ssl:10.5.1.42:6644: connecting...
2023-04-12T11:31:47.869Z|00687|reconnect|INFO|ssl:10.5.1.42:6644: connection attempt failed (Connection refused)
2023-04-12T11:31:47.869Z|00688|reconnect|INFO|ssl:10.5.1.42:6644: waiting 2 seconds before reconnect
2023-04-12T11:31:49.869Z|00689|reconnect|INFO|ssl:10.5.1.42:6644: connecting...
2023-04-12T11:31:49.871Z|00690|reconnect|INFO|ssl:10.5.1.42:6644: connection attempt failed (Connection refused)
2023-04-12T11:31:49.871Z|00691|reconnect|INFO|ssl:10.5.1.42:6644: waiting 4 seconds before reconnect

systemd journal has:

Apr 12 11:31:46 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00682|raft|INFO|received leadership transfer from 3f54 in term 1
Apr 12 11:31:46 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00683|raft|INFO|term 2: starting election
Apr 12 11:31:46 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00684|reconnect|INFO|ssl:10.5.1.42:6644: connection closed by peer
Apr 12 11:31:46 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00685|raft|INFO|term 2: elected leader by 2+ of 3 servers
Apr 12 11:31:47 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00686|reconnect|INFO|ssl:10.5.1.42:6644: connecting...
Apr 12 11:31:47 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00687|reconnect|INFO|ssl:10.5.1.42:6644: connection attempt failed (Connection refused)
Apr 12 11:31:47 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00688|reconnect|INFO|ssl:10.5.1.42:6644: waiting 2 seconds before reconnect
Apr 12 11:31:48 juju-34a1ff-ovntest-10 systemd[1]: ovn-ovsdb-server-sb.service: Current command vanished from the unit file, execution of the command list won't be resumed.
Apr 12 11:31:49 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00689|reconnect|INFO|ssl:10.5.1.42:6644: connecting...
Apr 12 11:31:49 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00690|reconnect|INFO|ssl:10.5.1.42:6644: connection attempt failed (Connection refused)
Apr 12 11:31:49 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00691|reconnect|INFO|ssl:10.5.1.42:6644: waiting 4 seconds before reconnect
Apr 12 11:31:51 juju-34a1ff-ovntest-10 systemd[1]: ovn-ovsdb-server-sb.service: Found left-over process 45917 (ovsdb-server) in control group while starting unit. Ignoring.
Apr 12 11:31:51 juju-34a1ff-ovntest-10 systemd[1]: This usually indicates unclean termination of a previous run, or service implementation deficiencies.
Apr 12 11:31:51 juju-34a1ff-ovntest-10 systemd[1]: Started Open vSwitch database server for OVN Southbound database.
Apr 12 11:31:51 juju-34a1ff-ovntest-10 ovsdb-server[45917]: ovs|00001|fatal_signal(urcu1)|WARN|terminating with signal 15 (Terminated)
Apr 12 11:31:51 juju-34a1ff-ovntest-10 systemd[1]: ovn-ovsdb-server-sb.service: Succeeded.
Apr 12 11:55:24 juju-34a1ff-ovntest-10 systemd[1]: Started Open vSwitch database server for OVN Southbound database.