OpenStack HA Cluster Charm

cleanup action leaves failcounts behind

Bug #1915093 reported by Xav Paice on 2021-02-09

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	OpenStack HA Cluster Charm	Triaged	Medium	Unassigned

Bug Description

On some, but not all, occasions when a resource failcount is >1, the cleanup action does not clear all the failcounts and we end up needing to run something along the following lines to clean the resources:

for i in $(jsft |grep hacluster |awk '/ubuntu/ {print $1}') ; do juju run --application $i 'sudo crm_resource --list -Q -l | xargs -L1 sudo crm_failcount -D -r' ; done

Note that the crm_failcount -D -r needs to be run on each unit of an application in order to be effective, though the crm_resource -C is cluster wide.

I suggest we add `crm_failcount -D` to the action, for each resource on the unit. In order to ensure the whole cluster is clear, we might need to run the action on all units, but at least we can do so via actions rather than 'juju run' or ssh (which requires different permissions for some models of managed services).

Alex Kavanagh (ajkavanagh) on 2021-02-26

Changed in charm-hacluster:
status:	New → Triaged
importance:	Undecided → Medium

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.