Comment 1 for bug 1891801

Revision history for this message
Rafael David Tinoco (rafaeldtinoco) wrote :

Hello Christoph,

Why do you think that ? Could you be more prolix about this change and what are the pros and cons of making this change ? An example on how you're setting up your cluster using sbd and watchdog would be also good to corroborate your request.

From:

https://wiki.clusterlabs.org/wiki/Using_SBD_with_Pacemaker

I have:

"""Ensure that the sbd daemon is running on a node before starting the cluster services. The best approach is generally to enable it to start at boot. (The cluster can't manage the sbd daemon as a cluster resource.) There are two flavors of SBD, sbd for cluster nodes, and sbd_remote for Pacemaker Remote nodes. Here we use sbd as an example, but for Pacemaker Remote nodes, replace sbd with sbd_remote:"""

Note: sbd has to start before corosync and pacemaker. It would be good to have watchdog already working, so you're probably right... but that change should be done in sbd.service and not watchdog (as watchdog is a "generic" service that serves other purposes than pacemaker/fence-agents).

and

"""With watchdog-only SBD, the cluster must have true quorum. Thus, it can only be used in a cluster with three or more nodes, or a two-node cluster with external quorum (such as corosync using qdevice with a third node).
Configure the basic setup on every node as described above.
Select a recovery interval (in seconds) that is greater than SBD_WATCHDOG_TIMEOUT in /etc/sysconfig/sbd."""

I assume the ordering has something to do with unfencing (from fence_sbd + watchdog setup) but I also know that those type of unfencing (like fence_mpath and fence_iscsi) are not supported "automatically" (meaning that any time there is a cluster split, a manual intervention is required).

Looking forward to reading more about your request.

Thanks

-rafaeldtinoco