Detect network failures

Bug #2058806 reported by Aymen Frikha
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Masakari Charm
New
Undecided
Unassigned

Bug Description

The current implementation of masakari seems not able to detect a network failure that is happening to some important networks which are:

- Overlay Geneve network : If network interfaces are down , or a switch is down that connect a compute node to the Overlay Geneve network, the VM is stuck and does not migrate to another compute node.
- Storage network : If storage Network interfaces are down, or a switch is down that connect a compute node to the storage network, the VM is stuck and does not migrate to another compute node.

summary: - detect network failure
+ Detect network failures
Revision history for this message
Aymen Frikha (aym-frikha) wrote :

+ ~field-high

Revision history for this message
Aymen Frikha (aym-frikha) wrote :

After some documentation, I found out that if masakari is based on consul agents rather than on pacemaker corosync , we would be able to detect those failures: https://docs.openstack.org/masakari-monitors/latest/hostmonitor.html

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.