'pacemaker-status' validation reports incorrect status
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Yadnesh Kulkarni |
Bug Description
Description
===========
'pacemaker-status' validation always reports PASSED regardless of the actual status of pacemaker service
Steps to reproduce
==================
Stop pacemaker service on ctrl node and run validation
~~~
[root@overcloud
● pacemaker.service - Pacemaker High Availability Cluster Manager
Loaded: loaded (/usr/lib/
Active: inactive (dead) since Thu 2021-08-19 09:30:30 UTC; 4min 2s ago
Docs: man:pacemakerd
https:/
Process: 675053 ExecStart=
Main PID: 675053 (code=exited, status=0/SUCCESS)
(undercloud) [stack@undercloud ~]$ openstack tripleo validator run --validation pacemaker-status --inventory ./overcloud-
+------
| UUID | Validations | Status | Host_Group | Status_by_Host | Unreachable_Hosts | Duration |
+------
| 31899bf0-
+------
(undercloud) [stack@undercloud ~]$
~~~
Expected result
===============
Validation should've been reported as failed
Actual result
=============
Validation was reported as passed, meaning pacemaker service is active
Environment
===========
master release
Logs & Configs
==============
Ansible reports the service as inactive in validation logs
~~~
~~~
In [1] there is no task to fail the playbook if the service is found inactive or failed.
Changed in tripleo: | |
status: | New → Triaged |
importance: | Undecided → High |
Changed in tripleo: | |
assignee: | nobody → Yadnesh Kulkarni (ykulkarn) |
Looks legit issue to me:-
heat-admin@ overcloud- controller- 1 ~]$ sudo pcs status
Error: error running crm_mon, is pacemaker running?
crm_mon: Error: cluster is not available on this node
Even after stopping pacemaker on 1 controller, status still shows passed
(undercloud) [zuul@undercloud ~]$ openstack tripleo validator run --validation pacemaker-status ------- ------- ------- ------- ----+-- ------- ------- --+---- ----+-- ------- ---+--- ------- ------- ------- ------- ------- ------- ------- ------- ------- ------+ ------- ------- -----+- ------- -----+ ------- ------- ------- ------- ----+-- ------- ------- --+---- ----+-- ------- ---+--- ------- ------- ------- ------- ------- ------- ------- ------- ------- ------+ ------- ------- -----+- ------- -----+ 8e1c-48ec- aa61-185cfd8fed a6 | pacemaker-status | PASSED | Controller | overcloud- controller- 0, overcloud- controller- 1, overcloud- controller- 2 | | 0:00:02.900 | ------- ------- ------- ------- ----+-- ------- ------- --+---- ----+-- ------- ---+--- ------- ------- ------- ------- ------- ------- ------- ------- ------- ------+ ------- ------- -----+- ------- -----+
Running Validations without Overcloud settings.
+------
| UUID | Validations | Status | Host_Group | Status_by_Host | Unreachable_Hosts | Duration |
+------
| 60d0b82a-
+------