Comment 27 for bug 1728111

Revision history for this message
Tim Penhey (thumper) wrote :

I think a key thing to note here is the term "guarantee". I think I may have been taking too hard a line with guarantee.

The key thing to think about here is that the leader "shouldn't" change under normal circumstances. So the situations that are causing a leadership change should be the exceptional circumstances.

To be clear, as long as the agents are able to communicate, the leadership shouldn't change.

All the sharp edge cases are at the exceptional edge though. Why would communication drop?
 * net splits - I'm still not clear on what causes a net split
 * hardware failures
 * severly overloaded servers - we should work out how to be more aware of this, perhaps the number of running api calls.