RabbitMQ should use net_ticktime
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Medium
|
John Eckersberg |
Bug Description
Currently, we override RABBITMQ_
This is leftover from long ago when oslo.messaging did not have proper heartbeat support. Without AMQP heartbeats, it was not possible to detect quickly when a client lost connection, so the TCP timeout was implemented.
Heartbeat support has been available in oslo.messaging for many releases now, so this is no longer required to detect dead clients and can be removed.
However, as a somewhat-intended side effect, setting the TCP timeouts also affects dead peer detection on the connections between RabbitMQ nodes in a clustered environment. Normally these timeouts are configured via the erlang net_ticktime mechanism, see https:/
Using net_ticktime is preferrable to TCP timeouts. It is the standard practice, it is simpler, and also in testing seems to detect failures faster (see
https:/
Changed in tripleo: | |
assignee: | nobody → John Eckersberg (jeckersb) |
status: | New → In Progress |
Changed in tripleo: | |
milestone: | none → queens-1 |
importance: | Undecided → Medium |
Changed in tripleo: | |
milestone: | queens-1 → queens-2 |
Fix proposed to branch: master /review. openstack. org/503788
Review: https:/