AIO Simplex controller degraded
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
Critical
|
Don Penney |
Bug Description
Brief Description
-----------------
During the initial setup of a Simplex configuration, the controller remains on degraded status after the unlock.
Severity
--------
Critical
Steps to Reproduce
------------------
Follow up documentation to complete a simplex install.
Expected Behavior
------------------
After the unlock, controller-0 exits the degrade condition after a few minutes.
Actual Behavior
----------------
After the unlock, controller-0 remains on degraded status.
Reproducibility
---------------
100% reproducible on baremetal.
System Configuration
-------
Simplex, baremetal.
Branch/Pull Time/Commit
-------
20200115T023003Z
Last Pass
---------
Passed on build from Jan/11.
Timestamp/Logs
--------------
Some outputs: http://
Test Activity
-------------
Sanity
Changed in starlingx: | |
importance: | Undecided → Critical |
tags: | added: stx.4.0 |
description: | updated |
Changed in starlingx: | |
assignee: | nobody → Don Penney (dpenney) |
Changed in starlingx: | |
status: | New → Invalid |
How are you doing this installation? From the collected logs, looking through the anaconda logs, it looks like you are maybe using the net_smallsystem _ks.cfg file as the basis for a remote installation kickstart. The net_* kickstart files are for installation of nodes from the active controller, not for the initial installation of controller-0. The initial installation kickstart (smallsystem_ks.cfg on the ISO) provides some extra steps for the installation of the first controller, including software repo setup/mirroring.
For installation from a network server, there is a pxeboot_setup.sh utility in the ISO and pxeboot_*.cfg kickstart templates that have the extra steps needed for the installation of the first controller.
The patch-agent is failing because it cannot find the required software groups, an indication that the software repos were not properly mirrored as part of the post-installation setup that is in the kickstarts of the initial controller install:
2020-01- 15T12:57: 56: sw-patch- agent[1053928] : base.py(415): WARNING: Failed to synchronize cache for repo 'platform-base', ignoring this repo. 15T12:57: 56: sw-patch- agent[1053928] : patch_agent. py(493) : ERROR: Could not find software group: updates- controller- worker 15T12:57: 56: sw-patch- agent[1053928] : patch_functions .py(68) : ERROR: Uncaught exception sw-patch- agent", line 15, in <module> python2. 7/site- packages/ cgcs_patch/ patch_agent. py", line 870, in main python2. 7/site- packages/ cgcs_patch/ patch_agent. py", line 495, in query packages_ iter():
2020-01-
2020-01-
Traceback (most recent call last):
File "/usr/sbin/
main()
File "/usr/lib64/
pa.query()
File "/usr/lib64/
for pkg in pkggrp.
AttributeError: 'NoneType' object has no attribute 'packages_iter'
You can see these additional pieces for the first controller installation in the following kickstart sections: /opendev. org/starlingx/ metal/src/ branch/ master/ bsp-files/ kickstarts/ post_usb_ controller. cfg /opendev. org/starlingx/ metal/src/ branch/ master/ bsp-files/ kickstarts/ post_pxeboot_ controller. cfg
https:/
https:/
I downloaded the 20200115T023003Z ISO and verified installation works as expected.