Blocked publishers due to rabbitmq disk/memory alarm causes indefinitely frozen RPC calls with zero errors
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
oslo.messaging |
Fix Released
|
Medium
|
Mehdi Abaakouk |
Bug Description
From the rabbit server log:
=INFO REPORT==== 12-May-
Disk free space insufficient. Free bytes:974610432 Limit:1000000000
=WARNING REPORT==== 12-May-
disk resource limit alarm set on node rabbit@localhost.
*******
*** Publishers will be blocked until this alarm clears ***
*******
If a disk or RAM rabbitmq alarm goes off:
https:/
All connections will be blocked:
[stack@localhost ~]$ sudo rabbitmqctl list_connections
Listing connections ...
stackrabbit 10.0.0.200 42587 blocking
stackrabbit 10.0.0.200 42588 blocking
...
The user experience in Oslo messaging here is very poor: There's just no indication whatsoever. There's no timeouts on RPC calls, they just freeze indefinitely. There's no error in any log in any OpenStack service. I'm not familiar with the rabbit driver backend, I'm hoping that it would be possible to get an error out of it somehow.
description: | updated |
Changed in oslo.messaging: | |
importance: | Undecided → Medium |
Changed in oslo.messaging: | |
milestone: | none → 1.15.0 |
status: | Fix Committed → Fix Released |
Fix proposed to branch: master /review. openstack. org/185851
Review: https:/