Full RabbitMQ cluster failure when the master of rabbit multi-state resource goes down
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Bogdan Dobrelya | ||
5.1.x |
Won't Fix
|
High
|
Denis Meltsaykin | ||
6.0.x |
Won't Fix
|
High
|
Denis Meltsaykin | ||
6.1.x |
Fix Released
|
High
|
Bogdan Dobrelya |
Bug Description
Multinode with HA with 3 controllers
Steps to reproduce
1) find the master of rabbitmq multistate-resource (crm status)
2) On the other two controllers start
watch "rabbitmqctl eval 'rabbit_
to see whats happening with RabbitMQ cluster and rabbit application. You should see the line
{rabbit,
3) Turn off the primary controller and see whats happening with other two controllers
Because of currrent HA implementation you will see that rabbitmqctl could not connect to the node
something like that
Error: unable to connect to node 'rabbit@node-10': nodedown
or
line with rabbitmq application could disappear
The normal behaviour I believe should be
No disconnection on two others controllers and no full cluster failover when primary controller goes down
Also in the
https:/
noted
3) destroy the master, node-3
Expected: new master election, failover with full downtime - no nodes can process AMQP connections
I think It should be improved
[root@fuel ~]# fuel --f
DEPRECATION WARNING: file /etc/fuel/
api: '1.0'
astute_sha: 4a117a1ca6bdcc3
auth_required: true
build_id: 2015-03-23_15-29-20
build_number: '218'
feature_groups:
- mirantis
fuellib_sha: a0265ae47bb2307
fuelmain_sha: a05ab877af31924
nailgun_sha: 7c100f47450ea1a
ostf_sha: a4cf5f218c6aea9
production: docker
python-
release: '6.1'
release_versions:
2014.2-6.1:
VERSION:
api: '1.0'
astute_sha: 4a117a1ca6bdcc3
build_id: 2015-03-23_15-29-20
build_number: '218'
feature_
- mirantis
fuellib_sha: a0265ae47bb2307
fuelmain_sha: a05ab877af31924
nailgun_sha: 7c100f47450ea1a
ostf_sha: a4cf5f218c6aea9
production: docker
python-
release: '6.1'
summary: |
- Entire RabbitMQ cluster downtime when primary controller goes down + Full RabbitMQ cluster failure when primary controller goes down |
no longer affects: | fuel/6.1.x |
summary: |
- Full RabbitMQ cluster failure when primary controller goes down + Full RabbitMQ cluster failure when the master of rabbit multi-state + resource goes down |
description: | updated |
Changed in fuel: | |
status: | Won't Fix → Confirmed |
milestone: | 6.1 → 5.1.1-updates |
milestone: | 5.1.1-updates → 7.0 |
Changed in fuel: | |
assignee: | Bogdan Dobrelya (bogdando) → Vladimir Kuklin (vkuklin) |
tags: | added: on-verification |
related bug - https:/ /bugs.launchpad .net/fuel/ +bug/1436343