Rabbit OCF monitor check levels was not configured

Bug #1462899 reported by Bogdan Dobrelya
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Committed
High
Bogdan Dobrelya
5.1.x
Won't Fix
High
Denis Meltsaykin
6.0.x
Won't Fix
High
Denis Meltsaykin

Bug Description

The code
https://github.com/stackforge/fuel-library/blob/master/deployment/puppet/pacemaker_wrappers/manifests/rabbitmq.pp#L53-L57
should configure additional monitor action for rabbit primitive, but it didn't:

primitive p_rabbitmq-server ocf:fuel:rabbitmq-server \
        op promote interval=0 timeout=120 \
        op stop interval=0 timeout=120 \
        op start interval=0 timeout=360 \
        op notify interval=0 timeout=180 \
        op demote interval=0 timeout=120 \
        op monitor interval=27 role=Master timeout=60 \
        op monitor interval=30 timeout=60 \
        params command_timeout="-s KILL" debug=true erlang_cookie=EOKOWXQREETZSHFNTPEY node_port=5673 \
        meta failure-timeout=360s migration-threshold=INFINITY

As a result, the bug https://bugs.launchpad.net/fuel/+bug/1458828 have arisen again and rabbit@node-1 app remained stopped permanently.

ISO '496'

Tags: 6.1rc2
Changed in fuel:
milestone: none → 6.1
importance: Undecided → High
status: New → Confirmed
assignee: nobody → Bogdan Dobrelya (bogdando)
description: updated
description: updated
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

Puppet log from node-1 and CIB dump

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/189228

Changed in fuel:
status: Confirmed → In Progress
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

How-to test:
check the rabbitmq resource created in the CIB, it should be like this:
primitive p_rabbitmq-server ocf:fuel:rabbitmq-server \
        op monitor role=Master timeout=60 interval=27 \
...
        op monitor timeout=60 interval=30 \
...
        op monitor timeout=60 role=Slave interval=103 OCF_CHECK_LEVEL=30 \
...

There must be 3 monitors configured

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/189228
Committed: https://git.openstack.org/cgit/stackforge/fuel-library/commit/?id=263a2dd5d87de569819740873c0086cafc7e0651
Submitter: Jenkins
Branch: master

commit 263a2dd5d87de569819740873c0086cafc7e0651
Author: Bogdan Dobrelya <email address hidden>
Date: Mon Jun 8 10:48:48 2015 +0200

    Make second level of Rabbit OCF monitor only at slaves

    W/o this fix, monitor check level 30 is not always
    added into the CIB for RabbitMQ pacemaker resource.

    The w/a is to define second monitor only for slaves.

    Closes-bug: #1462899

    Change-Id: I678b4af19e846a2a7f6c4ce829a1a8b5a4391876
    Signed-off-by: Bogdan Dobrelya <email address hidden>

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (stable/6.1)

Fix proposed to branch: stable/6.1
Review: https://review.openstack.org/189400

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (stable/6.1)

Reviewed: https://review.openstack.org/189400
Committed: https://git.openstack.org/cgit/stackforge/fuel-library/commit/?id=9b20ea04b15fe160ce9cb542d6649471be467f99
Submitter: Jenkins
Branch: stable/6.1

commit 9b20ea04b15fe160ce9cb542d6649471be467f99
Author: Bogdan Dobrelya <email address hidden>
Date: Mon Jun 8 10:48:48 2015 +0200

    Make second level of Rabbit OCF monitor only at slaves

    W/o this fix, monitor check level 30 is not always
    added into the CIB for RabbitMQ pacemaker resource.

    The w/a is to define second monitor only for slaves.

    Closes-bug: #1462899

    Change-Id: I678b4af19e846a2a7f6c4ce829a1a8b5a4391876
    Signed-off-by: Bogdan Dobrelya <email address hidden>

tags: added: 6.1rc2
Revision history for this message
Denis Meltsaykin (dmeltsaykin) wrote :

Setting this as Won't Fix for 5.1.1-updates and 6.0-updates, as such a complex change cannot be delivered in the scope of the Maintenance Update. Also, the possible solution of the backporting of RabbitMQ OCF script is covered in details by the Operations Guide from the official documentation of the Product.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.