Comment 10 for bug 1375625

Revision history for this message
Assaf Muller (amuller) wrote :

> not being able to rely on a robust HA solution

I disagree.

What's being described here is a very specific scenario. If the NIC to the tenant networks goes down, all of the routers lose connection to their VMs. Router replicas on other nodes will be become active, however the originals are still active, and still have connections to the external network via another NIC, duplicating FIPs. So this is a scenario of one NIC failing on one node, but not the other NIC on the same node. This scenario is not covered by L3 HA, while other scenarios are. Luckily, in most HA solutions (And any HA solution I've encountered myself) a tool like Pacemaker is used. In the RDO HA architecture, for example, Pacemaker is configured to fence a node with a dead NIC, which would resolve this specific error scenario. I don't see this as a high priority bug.