Comment 2 for bug 1494682

Revision history for this message
Sudhakar Gariganti (sudhakar-gariganti) wrote :

From a functionality point of view, I agree it is LOW. But if we see from the scale point of view, this does impact significantly.

A single random RPC timeout@scale will put the l3 agent in indefinite cycle and has terrible impact on the DB and controller operations, which will eventually degrade the performance of other agents as well.

At just a scale of less than 1000 networks, it was taking multiples of hours for the cloud to get back into shape. Imagine the situation at a higher scale.

Agree its late in the cycle, but if there is chance, I feel its good to have this for Liberty.