oslo.messaging delays reconnect trying to close old channel
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Mirantis OpenStack |
Fix Released
|
Medium
|
Dmitry Mescheryakov |
Bug Description
Version: 9.x
Steps to reproduce:
1. Deploy a MOS env with 3 controllers and 1 compute node
2. Download that file and save it as simulator.py: http://
That is a modified copy of upstream simulator, if you are curious, make a diff against https:/
3. Go to compute node and apply that patch http://
4. In that console set the following variable:
RABBIT_
Populate user, pass and node_x_ip using the following parameters from /etc/nova/
5. Open another console to controller, which IP goes first in RABBIT_URL list.
6. Return to console opened in step #3
7. Here run
python simulator.py --url $RABBIT_URL rpc-client -m 2 -w 10 --is-cast true
With that command simulator will send 2 messages (-m) with interval between messages 10 seconds (-w) in 'cast' mode, hence you don't need rpc server.
8. Wait for simulator to send the first message. It is done ones the following lines appear in console:
2017-05-10 11:21:53,661 DEBUG oslo_messaging.
9. Once you see these lines, quickly (you have 10 seconds to do that) switch to controller console opened in step #5 and here execute
iptables -I OUTPUT 1 -p tcp --sport 5673 -j DROP
That will block AMQP traffic to that node.
10. Observe the following lines next:
2017-05-10 11:22:03,677 DEBUG oslo_messaging.
2017-05-10 11:23:03,701 ERROR oslo.messaging.
...
2017-05-10 11:23:09,735 ERROR oslo.messaging.
2017-05-10 11:23:48,365 INFO oslo.messaging.
...
Note that 40 seconds pass between last 'server is unreachable' complaint and reconnect. That is an artificial delay caused by bug in code.
To remove iptables rule set in step #9 on controller execute
iptables -D OUTPUT -p tcp --sport 5673 -j DROP
That bug is very similar to https:/
Changed in mos: | |
importance: | Undecided → Medium |
status: | New → Confirmed |
assignee: | nobody → Dmitry Mescheryakov (dmitrymex) |
milestone: | none → 9.x-updates |
description: | updated |
tags: | added: area-oslo customer-found |
description: | updated |
description: | updated |
summary: |
- oslo.messaging delays reconnection trying to close old channel + oslo.messaging delays reconnect trying to close old channel |
Changed in mos: | |
milestone: | 9.x-updates → 9.2-mu-3 |
The fix is on review here: https:/ /review. fuel-infra. org/#/c/ 35213/