Master controller reboots due to kernel panic after ansible play
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| StarlingX |
Fix Released
|
High
|
Alexander Kozyrev | ||
Bug Description
Brief Description
-----------------
After executing ansible playbook on a fresh install, the system restarts automatically. When it comes back is unstable, not able to continue with setup
Severity
--------
Critical: Setup cannot be done.
Steps to Reproduce
------------------
Follow up wiki procedure to do a fresh install of StarlingX.
After system boot, transfering localhost.yml, execute ansible playbook. Shortly, the system will automatically reboot, without doing any other command.
Once the system is back online, we cannot continue with setup because the system complains that is not the active controller, probably caused by the lack of MGMT/OAM configuration (not done because the system reboot).
Expected Behavior
------------------
Main controller don't need to reboot until the unlock is done (after some pre-configuration).
Actual Behavior
----------------
Main controller reboots with incomplete setup.
Reproducibility
---------------
100% on Simplex, Duplex, Standard, Standard External Storage
System Configuration
-------
Simplex, Duplex, Standard, Standard External Storage
Branch/Pull Time/Commit
-------
20190602T233000Z
Last Pass
---------
20190530T152953Z - Sanity execution passed with this ISO. with 05/31 and 06/01 images, the setup also failed.
Timestamp/Logs
--------------
Full collect of a Standard (2+2) configuration attached.
controller-0:~$ sudo grep Error /var/log/
sudo: ldap_sasl_bind_s(): Can't contact LDAP server
Password:
2019-06-
2019-06-
controller-0:~$
No relevant errors found on dmesg, kern.log.
fm alarm-list, ceph -s are unresponsive, also kubectl retuns the following error:
controller-0:~$ kubectl get pods --all-namespaces
Unable to connect to the server: dial tcp 192.168.206.2:6443: connect: network is unreachable
controller-0:~$
Test Activity
-------------
Sanity
| tags: |
added: stx.distro.other removed: stx.config |
| summary: |
- Master controller reboots after ansible play + Master controller reboots due to kernel panic after ansible play |
| Changed in starlingx: | |
| status: | Triaged → In Progress |

Marking as release gating; issue is blocking sanity and appears to have been introduced in the 2019-05-31 load.