Pacemaker restarts multiple nodes when reassigning maser

Bug #1786316 reported by Anthony Landry
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Invalid
Undecided
Anthony Landry

Bug Description

Detailed bug description:
If the master rabbimq node is isolated and is not restarted by pacemaker, when it attempts to rejoin the cluster multiple nodes are restarted instead of just one.

Reproducibility:
 Yes, in an isolated environment

Workaround:
 manually modified cib.xml to contain the following bit of code:

<nvpair id="p_rabbitmq-server-meta_attributes-requires" name="requires" value="nothing"/>

Impact:
Monitoring services that require the use of persistent messages can lose the occasional message due to the rabbit resets from a split-brain environment.

Revision history for this message
Anthony Landry (alandry) wrote :

sla2 for 9.0-updates

Anthony Landry (alandry)
affects: mos → fuel
Changed in fuel:
assignee: nobody → Oleksiy Molchanov (omolchanov)
milestone: none → 9.2-mu-8
status: New → In Progress
Revision history for this message
Anthony Landry (alandry) wrote :

Assigned to me as I'm working on a workaround with the customer

Changed in fuel:
assignee: Oleksiy Molchanov (omolchanov) → Anthony Landry (alandry)
Revision history for this message
Anthony Landry (alandry) wrote :

Addendum to this bug, this is a feature, and is intended working order for RMQ, it is requested to be changed in the pacemaker script to accommodate durable queues for monitoring purposes with the queue-sync command. However, this makes RMQ unstable and prone to crashing in MOS, therefore it is not recommended.

Revision history for this message
Denis Meltsaykin (dmeltsaykin) wrote :

Anthony, I'm not sure we want/can change this option since it was designed that way intentionally. How do you think can we close the bug report as Invalid?

Revision history for this message
Anthony Landry (alandry) wrote : Re: [Bug 1786316] Re: Pacemaker restarts multiple nodes when reassigning maser

The customer hasn't responded to anything, was going to close out the
ticket today after I closed the salesforce

On Fri, Aug 31, 2018 at 5:11 AM Denis Meltsaykin <email address hidden>
wrote:

> Anthony, I'm not sure we want/can change this option since it was
> designed that way intentionally. How do you think can we close the bug
> report as Invalid?
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1786316
>
> Title:
> Pacemaker restarts multiple nodes when reassigning maser
>
> Status in Fuel for OpenStack:
> In Progress
>
> Bug description:
> Detailed bug description:
> If the master rabbimq node is isolated and is not restarted by
> pacemaker, when it attempts to rejoin the cluster multiple nodes are
> restarted instead of just one.
>
> Reproducibility:
> Yes, in an isolated environment
>
> Workaround:
> manually modified cib.xml to contain the following bit of code:
>
> <nvpair id="p_rabbitmq-server-meta_attributes-requires"
> name="requires" value="nothing"/>
>
> Impact:
> Monitoring services that require the use of persistent messages can lose
> the occasional message due to the rabbit resets from a split-brain
> environment.
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/fuel/+bug/1786316/+subscriptions
>

Revision history for this message
Denis Meltsaykin (dmeltsaykin) wrote :

Closed as Invalid.

Changed in fuel:
milestone: 9.2-mu-8 → 9.x-updates
status: In Progress → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.