Brief Description
-----------------
After initial unlocking controller-1, controller-1 reboot and manifests were applied successfully. However, a few seconds after worker manifest applied, another reboot occurred unexpectedly. After the second reboot completed, controller-1 was enabled and unlocked. The mtcClient log on controller-1 showed that the sysinv goenabled check didn't pass after the first reboot.
Severity
--------
Major
Steps to Reproduce
------------------
1. bootstrap and unlock controller-0
2. install controller-1
3. configure controller-1
4. host-unlock controller-1
Expected Behavior
------------------
controller-1 reboots once after unlock
Actual Behavior
----------------
controller-1 reboots twice after unlock
Reproducibility
---------------
Reproducible
System Configuration
--------------------
AIO-DX
Lab-name: IP_5_6, vbox
Branch/Pull Time/Commit
-----------------------
BUILD_DATE="2019-08-26 21:00:20 -0400"
Last Pass
---------
Don't know
Timestamp/Logs
--------------
2019-08-27T07:28:21.642 [112079.00194] controller-0 mtcAgent --- nodeBase.cpp ( 675) log_adminAction : Info : controller-1 Unlock Action
controller-1:
daemon.log
2019-08-27T07:30:48.293 controller-1 controller_config[12976]: info Applying puppet controller manifest...
...
2019-08-27T07:36:19.828 controller-1 worker_config[106371]: info Applying puppet worker manifest...
2019-08-27T07:36:19.829 controller-1 systemd[1]: info Started Service Management API Unit.
2019-08-27T07:36:19.859 controller-1 systemd[1]: info Starting Service Management Event Recorder Unit...
...
2019-08-27T07:36:35.388 controller-1 systemd[1]: info Stopped target RPC Port Mapper.
2019-08-27T07:36:35.409 controller-1 systemd[1]: info Stopping Name Service Cache Daemon...
2019-08-27T07:36:35.416 controller-1 systemd[1]: info Stopping Authorization Manager...
2019-08-27T07:36:35.421 controller-1 systemd[1]: info Stopped target rpc_pipefs.target.
...
Rebooting ...
...
2019-08-27T07:41:17.015 controller-1 systemd[1]: info Started Service Management Unit.
2019-08-27T07:41:17.023 controller-1 systemd[1]: info Started Service Management Shutdown Unit.
2019-08-27T07:41:17.038 controller-1 systemd[1]: info Starting Service Management API Unit...
2019-08-27T07:41:17.077 controller-1 sm-api[98463]: info Starting sm-api: OK
2019-08-27T07:41:17.078 controller-1 systemd[1]: info Started Service Management API Unit.
2019-08-27T07:41:17.084 controller-1 worker_config[98137]: info /etc/init.d/worker_config: Running puppet manifest apply
2019-08-27T07:41:17.098 controller-1 systemd[1]: info Starting Service Management Event Recorder Unit...
2019-08-27T07:41:17.103 controller-1 worker_config[98137]: info Applying puppet worker manifest...
...
mtcClient.log
2019-08-27T07:36:19.532 [106386.00028] controller-1 mtcClient --- mtcNodeComp.cpp (1690) run_goenabled_scripts : Info : GoEnabled Scripts : Main-Function Context
2019-08-27T07:36:19.532 [106386.00029] controller-1 mtcClient --- mtcNodeComp.cpp (1408) _launch_all_scripts : Info : Sorted Test File List: 7
2019-08-27T07:36:19.532 [106386.00030] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/config_goenabled_check.sh start
2019-08-27T07:36:19.532 [106386.00031] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/inventory_goenabled_check.sh start
2019-08-27T07:36:19.532 [106386.00032] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/mlx4_core_goenabled.sh start
2019-08-27T07:36:19.532 [106386.00033] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/patch_check_goenabled.sh start
2019-08-27T07:36:19.532 [106386.00034] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/sysinv_goenabled_check.sh start
2019-08-27T07:36:19.532 [106386.00035] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/virt-support-goenabled.sh start
2019-08-27T07:36:19.532 [106386.00036] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/worker-goenabled.sh start
2019-08-27T07:36:19.534 [106386.00037] controller-1 mtcClient --- mtcNodeComp.cpp (1179) daemon_service_run : Info : main goenable tests started
2019-08-27T07:36:19.536 [106386.00038] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/config_goenabled_check.sh (0.002 secs)
2019-08-27T07:36:19.536 [106386.00039] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/mlx4_core_goenabled.sh (0.002 secs)
2019-08-27T07:36:19.536 [106386.00040] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/patch_check_goenabled.sh (0.002 secs)
2019-08-27T07:36:19.537 [106386.00041] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/worker-goenabled.sh (0.002 secs)
2019-08-27T07:36:20.014 [106386.00042] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/virt-support-goenabled.sh (0.480 secs)
2019-08-27T07:36:20.483 [106386.00043] controller-1 mtcClient msg mtcCompMsg.cpp ( 196) mtc_service_command : Info : GoEnabled In-Progress
2019-08-27T07:36:20.483 [106386.00044] controller-1 mtcClient msg mtcCompMsg.cpp ( 410) mtc_service_command : Info : goEnabled main req reply send (Mgmnt)
2019-08-27T07:36:36.976 [106386.00045] controller-1 mtcClient sig daemon_signal.cpp ( 172) daemon_signal_hdlr : Info : Received SIGTERM
...
2019-08-27T07:41:16.799 [98218.00028] controller-1 mtcClient --- mtcNodeComp.cpp (1690) run_goenabled_scripts : Info : GoEnabled Scripts : Main-Function Context
2019-08-27T07:41:16.799 [98218.00029] controller-1 mtcClient --- mtcNodeComp.cpp (1408) _launch_all_scripts : Info : Sorted Test File List: 7
2019-08-27T07:41:16.799 [98218.00030] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/config_goenabled_check.sh start
2019-08-27T07:41:16.799 [98218.00031] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/inventory_goenabled_check.sh start
2019-08-27T07:41:16.799 [98218.00032] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/mlx4_core_goenabled.sh start
2019-08-27T07:41:16.799 [98218.00033] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/patch_check_goenabled.sh start
2019-08-27T07:41:16.799 [98218.00034] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/sysinv_goenabled_check.sh start
2019-08-27T07:41:16.799 [98218.00035] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/virt-support-goenabled.sh start
2019-08-27T07:41:16.799 [98218.00036] controller-1 mtcClient --- mtcNodeComp.cpp (1415) _launch_all_scripts : Info : ... /etc/goenabled.d/worker-goenabled.sh start
2019-08-27T07:41:16.801 [98218.00037] controller-1 mtcClient --- mtcNodeComp.cpp (1179) daemon_service_run : Info : main goenable tests started
2019-08-27T07:41:16.802 [98218.00038] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/config_goenabled_check.sh (0.002 secs)
2019-08-27T07:41:16.803 [98218.00039] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/mlx4_core_goenabled.sh (0.002 secs)
2019-08-27T07:41:16.803 [98218.00040] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/patch_check_goenabled.sh (0.002 secs)
2019-08-27T07:41:16.803 [98218.00041] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/worker-goenabled.sh (0.002 secs)
2019-08-27T07:41:17.264 [98218.00042] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/virt-support-goenabled.sh (0.463 secs)
2019-08-27T07:41:47.750 [98218.00043] controller-1 mtcClient com nodeUtil.cpp (1202) get_node_health : Info : controller-1 is Healthy (0)
2019-08-27T07:42:02.846 [98218.00044] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/inventory_goenabled_check.sh (46.045 secs)
2019-08-27T07:42:02.846 [98218.00045] controller-1 mtcClient --- mtcNodeComp.cpp (1787) daemon_sigchld_hdlr : Info : PASSED: /etc/goenabled.d/sysinv_goenabled_check.sh (46.046 secs)
2019-08-27T07:42:02.897 [98218.00046] controller-1 mtcClient --- mtcNodeComp.cpp ( 888) _manage_goenabled_tests : Info : GoEnabled Testing Complete ; all tests passed
Test Activity
-------------
Other
Also see this happened on compute nodes on a low-latency standard lab IP_1_4