ovsdb post cluster failure/network partition recovery
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
charm-ovn-central |
New
|
Undecided
|
Unassigned |
Bug Description
1) Deployed 3 ovn-central units (2/3 units on one node) and vault without unsealing it
2) Rebooted the node with 2 ovn-central units
3) Rebooted the node with 1 ovn-central units after the other node came back
4) Unsealed vault with auto-generation of certs
5) The model settled with a split-brain (see juju status below)
https:/
This doc describes the acceptable failure conditions https:/
However, the charm is lacking actions/
$ juju status
Model Controller Cloud/Region Version SLA Timestamp
default maaslab-default maaslab/default 2.9.31 unsupported 16:20:44+03:00
App Version Status Scale Charm Channel Rev Exposed Message
mysql-innodb-
ovn-central 22.03.0 active 3 ovn-central 22.03/stable 31 no Unit is ready (northd: active)
vault 1.7.9 active 1 vault 1.7/stable 68 no Unit is ready (active: true, mlock: enabled)
vault-mysql-router 8.0.29 active 1 mysql-router 8.0/stable 30 no Unit is ready
Unit Workload Agent Machine Public address Ports Message
mysql-innodb-
mysql-innodb-
mysql-innodb-
ovn-central/1 active idle 1/lxd/0 10.10.20.13 6641/tcp,6642/tcp Unit is ready (northd: active)
ovn-central/2* active idle 1/lxd/1 10.10.20.14 6641/tcp,6642/tcp Unit is ready (leader: ovnnb_db, ovnsb_db)
ovn-central/3 active idle 0/lxd/2 10.10.20.17 6641/tcp,6642/tcp Unit is ready (leader: ovnnb_db, ovnsb_db)
vault/0* active idle 2 10.10.20.9 8200/tcp Unit is ready (active: true, mlock: enabled)
vault-