OVN services does not recovered on OVN master node after killing ovsdb-server services[nb/sb] or ovn-northd service

Bug #1731934 reported by Eran Kuris
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Expired
Undecided
Unassigned

Bug Description

Description of problem:
Auto recover does not work on OVN master node after killing ovnDB services or north-d service.
When running kill -9 of ovsdb-server [ovnnb_db.pid/ovnsb_db.pid] or ovn-northd
The expected behavior is one of the slaves nodes will take the OVN master role.
Pacemaker should detect that the services are down and move the role to the slave node.
After debugging with dev its looks like that recovering script on the master is not called.

Version-Release number of selected component (if applicable):
Pike

How reproducible:
100%

Steps to Reproduce:
1.deploy HA setup with OVN
2.Kill -9 ovn-northd service / ovsdb-server on Master node
3.verify that one of the slave node change the status to be "Master"

Revision history for this message
Eran Kuris (ekuris) wrote :
Revision history for this message
Lucas Alvares Gomes (lucasagomes) wrote :

The way these services are deployed is not part of the ML2 driver scope. This bug needs to be open against the deployment tool (e.g TripleO).

Changed in networking-ovn:
status: New → Invalid
Revision history for this message
Eran Kuris (ekuris) wrote :

so I am changing the project to tripleo and set it as "new"

affects: networking-ovn → tripleo
Changed in tripleo:
status: Invalid → New
Revision history for this message
Cédric Jeanneret (cjeanner) wrote :

Is it still alive?

Revision history for this message
Eran Kuris (ekuris) wrote :

no, I don't have a live setup with the issue

Changed in tripleo:
status: New → Invalid
status: Invalid → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for tripleo because there has been no activity for 60 days.]

Changed in tripleo:
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.