Unable to start Etcd

Bug #1945641 reported by OpenInfra
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Invalid
Low
Unassigned

Bug Description

Installed StarlingX AIO Simplex 5.0.1

Bootstrapped using ansible-playbook.
Bootstrap failed with following errors

"msg": "Unable to start service etcd: Job for etcd.service failed because the control process exited with error code. See \"systemctl status etcd.service\" and \"journalctl -xe\" for details.\n"

"msg": "Unable to restart service etcd: Job for etcd.service failed because the control process exited with error code. See \"systemctl status etcd.service\" and \"journalctl -xe\" for details.\n"

Revision history for this message
OpenInfra (openinfra) wrote :

###
### StarlingX
### Release 21.05
###

OS="centos"
SW_VERSION="21.05"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID="r/stx.5.0"

JOB="STX_5.0_build_layer_flock"
<email address hidden>"
BUILD_NUMBER="42"
BUILD_HOST="starlingx_mirror"
BUILD_DATE="2021-09-03 18:36:34 +0000"

FLOCK_OS="centos"
FLOCK_JOB="STX_5.0_build_layer_flock"
<email address hidden>"
FLOCK_BUILD_NUMBER="42"
FLOCK_BUILD_HOST="starlingx_mirror"
FLOCK_BUILD_DATE="2021-09-03 18:36:34 +0000"

DISTRO_OS="centos"
DISTRO_JOB="STX_5.0_build_layer_distro"
<email address hidden>"
DISTRO_BUILD_NUMBER="42"
DISTRO_BUILD_HOST="starlingx_mirror"
DISTRO_BUILD_DATE="2021-08-26 14:23:33 +0000"

COMPILER_OS="centos"
COMPILER_JOB="STX_5.0_build_layer_compiler"
<email address hidden>"
COMPILER_BUILD_NUMBER="35"
COMPILER_BUILD_HOST="starlingx_mirror"
COMPILER_BUILD_DATE="2021-05-14 19:53:00 +0000"

Revision history for this message
OpenInfra (openinfra) wrote :
Download full text (4.1 KiB)

task path: /usr/share/ansible/stx-ansible/playbooks/roles/bootstrap/persist-config/tasks/shutdown_services.yml:57
<localhost> ESTABLISH LOCAL CONNECTION FOR USER: root
<localhost> EXEC /bin/sh -c '( umask 77 && mkdir -p "` echo /tmp/.ansible-${USER}/tmp/ansible-tmp-1633005257.94-14230922024113 `" && echo ansible-tmp-1633005257.94-14230922024113="` echo /tmp/.ansible-${USER}/tmp/ansible-tmp-1633005257.94-14230922024113 `" ) && sleep 0'
Using module file /usr/lib/python2.7/site-packages/ansible/modules/system/systemd.py
<localhost> PUT /root/.ansible/tmp/ansible-local-25214X6j1Zd/tmpKN3xlB TO /tmp/.ansible-root/tmp/ansible-tmp-1633005257.94-14230922024113/AnsiballZ_systemd.py
<localhost> EXEC /bin/sh -c 'chmod u+x /tmp/.ansible-root/tmp/ansible-tmp-1633005257.94-14230922024113/ /tmp/.ansible-root/tmp/ansible-tmp-1633005257.94-14230922024113/AnsiballZ_systemd.py && sleep 0'
<localhost> EXEC /bin/sh -c '/usr/bin/python /tmp/.ansible-root/tmp/ansible-tmp-1633005257.94-14230922024113/AnsiballZ_systemd.py && sleep 0'
<localhost> EXEC /bin/sh -c 'rm -f -r /tmp/.ansible-root/tmp/ansible-tmp-1633005257.94-14230922024113/ > /dev/null 2>&1 && sleep 0'
fatal: [localhost]: FAILED! => {
    "changed": false,
    "invocation": {
        "module_args": {
            "daemon_reload": false,
            "enabled": null,
            "force": null,
            "masked": null,
            "name": "etcd",
            "no_block": false,
            "scope": null,
            "state": "restarted",
            "user": null
        }
    },
    "msg": "Unable to start service etcd: Job for etcd.service failed because the control process exited with error code. See \"systemctl status etcd.service\" and \"journalctl -xe\" for details.\n"
}
Read vars_file 'vars/common/main.yml'
Read vars_file 'host_vars/bootstrap/default.yml'

TASK [bootstrap/persist-config : Etcd failed to restart, try one more time] **********************************************************************************************************************************************************************************
task path: /usr/share/ansible/stx-ansible/playbooks/roles/bootstrap/persist-config/tasks/shutdown_services.yml:63
<localhost> ESTABLISH LOCAL CONNECTION FOR USER: root
<localhost> EXEC /bin/sh -c '( umask 77 && mkdir -p "` echo /tmp/.ansible-${USER}/tmp/ansible-tmp-1633005258.16-279060645036411 `" && echo ansible-tmp-1633005258.16-279060645036411="` echo /tmp/.ansible-${USER}/tmp/ansible-tmp-1633005258.16-279060645036411 `" ) && sleep 0'
Using module file /usr/lib/python2.7/site-packages/ansible/modules/system/systemd.py
<localhost> PUT /root/.ansible/tmp/ansible-local-25214X6j1Zd/tmpqrXDaF TO /tmp/.ansible-root/tmp/ansible-tmp-1633005258.16-279060645036411/AnsiballZ_systemd.py
<localhost> EXEC /bin/sh -c 'chmod u+x /tmp/.ansible-root/tmp/ansible-tmp-1633005258.16-279060645036411/ /tmp/.ansible-root/tmp/ansible-tmp-1633005258.16-279060645036411/AnsiballZ_systemd.py && sleep 0'
<localhost> EXEC /bin/sh -c '/usr/bin/python /tmp/.ansible-root/tmp/ansible-tmp-1633005258.16-279060645036411/AnsiballZ_systemd.py && sleep 0'
<localhost> EXEC /bin/sh -c 'rm -f -r /tmp/.ansible-root/tmp/ansible-tmp-...

Read more...

Revision history for this message
OpenInfra (openinfra) wrote :

systemctl status etcd.service
● etcd.service - Etcd Server
   Loaded: loaded (/usr/lib/systemd/system/etcd.service; disabled; vendor preset: disabled)
   Active: failed (Result: start-limit) since Thu 2021-09-30 12:34:19 UTC; 12min ago
  Process: 28444 ExecStart=/bin/bash -c GOMAXPROCS=$(nproc) /usr/bin/etcd --name="${ETCD_NAME}" --data-dir="${ETCD_DATA_DIR}" --listen-client-urls="${ETCD_LISTEN_CLIENT_URLS}" (code=exited, status=1/FAILURE)
 Main PID: 28444 (code=exited, status=1/FAILURE)

Revision history for this message
OpenInfra (openinfra) wrote :
Download full text (3.7 KiB)

controller-0:~$ systemctl status etcd.service
● etcd.service - Etcd Server
   Loaded: loaded (/usr/lib/systemd/system/etcd.service; disabled; vendor preset: disabled)
   Active: failed (Result: start-limit) since Mon 2021-10-04 06:44:35 UTC; 8min ago
  Process: 114537 ExecStart=/bin/bash -c GOMAXPROCS=$(nproc) /usr/bin/etcd --name="${ETCD_NAME}" --data-dir="${ETCD_DATA_DIR}" --listen-client-urls="${ETCD_LISTEN_CLIENT_URLS}" (code=exited, status=1/FAILURE)
 Main PID: 114537 (code=exited, status=1/FAILURE)

controller-0:~$ sudo journalctl -xe
--
-- Unit dracut-pre-udev.service has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Stopped dracut cmdline hook.
-- Subject: Unit dracut-cmdline.service has finished shutting down
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit dracut-cmdline.service has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Stopped Create Static Device Nodes in /dev.
-- Subject: Unit systemd-tmpfiles-setup-dev.service has finished shutting down
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit systemd-tmpfiles-setup-dev.service has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Stopped Create list of required static device nodes for the current kernel.
-- Subject: Unit kmod-static-nodes.service has finished shutting down
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit kmod-static-nodes.service has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Closed udev Kernel Socket.
-- Subject: Unit systemd-udevd-kernel.socket has finished shutting down
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit systemd-udevd-kernel.socket has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Closed udev Control Socket.
-- Subject: Unit systemd-udevd-control.socket has finished shutting down
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit systemd-udevd-control.socket has finished shutting down.
Sep 30 11:59:20 localhost systemd[1]: Starting Cleanup udevd DB...
-- Subject: Unit initrd-udevadm-cleanup-db.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit initrd-udevadm-cleanup-db.service has begun starting up.
Sep 30 11:59:20 localhost systemd[1]: Started Cleanup udevd DB.
-- Subject: Unit initrd-udevadm-cleanup-db.service has finished start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit initrd-udevadm-cleanup-db.service has finished starting up.
--
-- The start-up result is done.
Sep 30 11:59:20 localhost systemd[1]: Reached target Switch Root.
-- Subject: Unit initrd-switch-root.target has finished start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
--
-- Unit initrd-switch-root.target has finished starting up.
--
-- The start-up result is done.
Sep 30 11:59:20 localhost systemd[1]: Starting Swit...

Read more...

Ghada Khalil (gkhalil)
tags: added: stx.containers
Revision history for this message
OpenInfra (openinfra) wrote :

Freshly install stx: 5.0.1 but could not reproduce this issue.
Please note that now I don''t have the previous setup (no logs, etc)

Revision history for this message
Ghada Khalil (gkhalil) wrote :

Closing as the issue is not reproducible and there are no logs

tags: added: stx.5.0
Changed in starlingx:
importance: Undecided → Low
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.