Comment 4 for bug 1747764

Revision history for this message
Andres Rodriguez (andreserl) wrote : Re: rack controller HA fails during a network partition

I see in the logs this:

2018-02-06 21:14:26 provisioningserver.rpc.clusterservice: [info] Region not available: User timeout caused connection failure. (While requesting RPC info at b'http://[::ffff:10.245.32.102]/MAAS/rpc/').
2018-02-06 21:14:57 provisioningserver.rpc.clusterservice: [info] Region not available: User timeout caused connection failure. (While requesting RPC info at b'http://[::ffff:10.245.32.102]/MAAS/rpc/').
2018-02-06 21:15:25 twisted.internet.defer: [critical] Unhandled error in Deferred:
2018-02-06 21:15:25 twisted.internet.defer: [critical]

Based on your comment that should be the time you started the partition. So a few questions, when the rack disconnected:

 - seems that dhcpd continued running, did this change at all afterwards ? or it just never stopped ?
 - did /var/lib/maas/dhcpd.conf seems to not have been "deleted" or updated to reflect no connection, did this remained to be the same over time ? did it ever get removed?