Pacemaker never restarts RabbitMQ if rabbitmqctl times out
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Committed
|
High
|
Dmitry Mescheryakov | ||
Mitaka |
Fix Released
|
High
|
Dmitry Mescheryakov |
Bug Description
Version: 9.1
Steps to reproduce:
1. Install environment consisting of 3 controllers.
2. On one of the controllers make 'rabbitmqctl cluster_status' command run for more than 1 minute. You can do that for instance by applying the following patch to /usr/sbin/
Another alternative is to execute the following command on one of the controllers:
rabbitmqctl eval 'sys:suspend(
That will make cluster_status hang on all controllers simultaneously until RabbitMQ is restarted.
Expected result:
After some time (up to 5 minutes) Pacemaker should restart the RabbitMQ on the controller where rabbitmqctl was corrupted.
Actual result:
RabbitMQ is not restarted, not even in days.
Reproducibility: 100%
Details:
Look into /var/log/
After some time you will see the following entries appear in the log:
2016-08-
At some point the counter will go over the limit (3), but nothing will change - Pacemaker will not restart the resource even though OCF script clearly reports error. In pacemaker.log it can be seen that Pacemaker sends pre-stop notifications about that node. If one adds logging of such notifications to the RabbitMQ OCF script, it can be seen that they do reach the nodes. Such pre-stop notifications are sent with some period 3-5 minutes). But nothing else happens, Pacemaker does not initiate the stop action itself.
Changed in fuel: | |
importance: | Undecided → High |
assignee: | nobody → Dmitry Mescheryakov (dmitrymex) |
milestone: | none → 9.1 |
status: | New → Confirmed |
tags: | added: area-library |
summary: | + Pacemaker never restarts RabbitMQ |
summary: |
- Pacemaker never restarts RabbitMQ + Pacemaker never restarts RabbitMQ if rabbitmqctl times out |
description: | updated |
no longer affects: | fuel/newton |
Changed in fuel: | |
milestone: | 9.2 → 10.0 |
tags: | added: on-verification |
tags: | removed: on-verification |
Fix proposed to branch: master /review. openstack. org/366058
Review: https:/