OpenStack RabbitMQ Server Charm

Nagios check plugin check_rabbitmq_queues doesn't output the busiest queues when specifying wildcards in argument

Bug #1939084 reported by Julien Thieffry on 2021-08-06

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	OpenStack RabbitMQ Server Charm	Fix Released	Undecided	Unassigned	OpenStack RabbitMQ Server Charm 21.10

Bug Description

This behavior has been observed in:
OS-Version: Ubuntu 18.04.5
Bootstack series: queens-bionic
Rabbitmq-server charm revision: 110

When wildcards for vhost and queue are specified in queue_thresholds (such as \* \* 100 200), whenever warnings or errors arise in the output of the check_rabbitmq_queues.py script (and Nagios dashboard), output is similar to: "CRITICAL: * in * has XXXX messages". While this properly reports a problem, information on which vhost/queues combination is being busy is lost as this value is being consolidated and reported under the wildcard.

This might be the expected behavior, but whenever such scenario happens, we often need to investigate which queues in particular are being busy. Since we use the default configuration option (wildcard on vhost and queue), current output from Nagios offers little help in identifying where the issue is coming from.

As a result, it would be helpful to specify a charm configuration parameter that allows to display an arbitrary number of the busiest queues in the script output whenever a warning or critical code is issued.

Tags:

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-08-06: Fix proposed to charm-rabbitmq-server (master)

Fix proposed to branch: master
Review: https://review.opendev.org/c/openstack/charm-rabbitmq-server/+/803669

Changed in charm-rabbitmq-server:
status:	New → In Progress

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-09-01: Fix merged to charm-rabbitmq-server (master)

Reviewed: https://review.opendev.org/c/openstack/charm-rabbitmq-server/+/803669
Committed: https://opendev.org/openstack/charm-rabbitmq-server/commit/242167b6bad190dd7aa2b885272906551d390c8a
Submitter: "Zuul (22348)"
Branch: master

commit 242167b6bad190dd7aa2b885272906551d390c8a
Author: Julien Thieffry <email address hidden>
Date: Fri Aug 6 02:16:54 2021 +0000

Display busiest queues in check_queues NRPE plugin

    When invoking the check_rabbitmq_queues script with wildcards for vhost
    and/or queue parameters, script output does not reflect precisely which
    queues are having a high number of oustanding messages as information is
    consolidated under the wildcard.

    This change fixes this behaviour by adding a new charm configuration
    parameter which allows the user to specify the number of busiest queues,
    n, to display should the check_rabbitmq_queues script reports any
    warnings or errors. The default, n=0, keeps the current script output.
    This option is applicable regardless of the vhost:queue combination but
    is specifically relevant when wildcards are passed as arguments.

Implementation displays the first n items in the stats list re-organized
in decreasing message count order.

Closes-Bug: #1939084
Change-Id: I5a32cb6bf37bd2a0f30861eace3c0e6cb5c2559d

Changed in charm-rabbitmq-server:
status:	In Progress → Fix Committed

Alex Kavanagh (ajkavanagh) on 2021-10-11

Changed in charm-rabbitmq-server:
milestone:	none → 21.10

Alex Kavanagh (ajkavanagh) on 2021-10-22

Changed in charm-rabbitmq-server:
status:	Fix Committed → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.