mariadbcheck.socket: Failed to create listening socket
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack-Ansible |
Fix Released
|
High
|
Dmitriy Rabotyagov |
Bug Description
Hello,
After rebooting any of the LXC containers of the Galera cluster, HAProxy reports the galera service backend DOWN and it never comes back UP again without manual intervention.
Upon further investigation, we noticed that journalctl -xe --unit=
===
-- Boot 9d7a4cb973fb45d
Jan 21 19:44:51 inf2-mia-
Jan 21 19:44:51 inf2-mia-
Jan 21 19:44:51 inf2-mia-
Jan 21 19:44:51 inf2-mia-
░░ Subject: Unit failed
░░ Defined-By: systemd
░░ Support: https:/
░░
░░ The unit mariadbcheck.socket has entered the 'failed' state with result 'resources'.
Jan 21 19:44:51 inf2-mia-
░░ Subject: A start job for unit mariadbcheck.socket has failed
░░ Defined-By: systemd
░░ Support: https:/
░░
░░ A start job for unit mariadbcheck.socket has finished with a failure.
░░
░░ The job identifier is 54 and the job result is failed.
===
WORKAROUND: Log into the affected galera container and manually perform a "systemctl restart mariadbcheck.
===
Steps to reproduce:
1-Login to the galera LXC container.
2-Perform a reboot on the container
Environment variables:
-Debian 11 on all hosts.
-Openstack-Ansible version: stable/zed 26.1.0.dev45
-3x infra nodes and 2x compute hosts.
-HAProxy
-KeepAlived
===
Any suggestions would be appreciated.
Thank you.
Hi, Roger.
Can you kindly check if that might be related and fixed by https:/ /bugs.launchpad .net/openstack- ansible/ +bug/2002653 ?
The fix has been applied for stable/zed just couple of days ago and would require rerunning of bootstrap- ansible. sh script.