cert-manager fails to apply after controller-0 is rebooted
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Dan Voiculeasa |
Bug Description
Brief Description
-----------------
Scenario for this issue: STX.5.0 Release
In an AIO-DX reboot stand-by controller. When it comes up it becomes active(don't know what triggered this). Sysinv comes up, sees that alarms to reapply the apps were present, starts to apply the apps. In ~20 seconds from the startup, while cert-manager is still applying, sysinv is killed/
Severity
--------
Major: requires manual intervention to reapply the app
Steps to Reproduce
------------------
Read the brief description, it describes the scenario and claims there are 2 unknowns on how to reproduce.
The steps would be 'sudo reboot' on stand-by controller but there are some conditions I don't know how to achieve.
For what we care we can emulate by raising app reapply alarms + sysinv conductor restarts.
Expected Behavior
------------------
First: status of 'cert-manager' should be reset to 'uploaded' not 'apply-failed'.
Second: We may want to keep the apps auto-managed. For example 'cert-manager' should not need manual intervention to get out of 'uploaded' state. Currently we have limitations on some apps, which have auto-apply feature disabled.
Actual Behavior
----------------
cert-manager has status 'apply-failed'
Reproducibility
---------------
Seen once
System Configuration
-------
AIO-DX, but can affect any multi-node.
Branch/Pull Time/Commit
-------
STX.5.0
Last Pass
---------
N/A
Timestamp/Logs
--------------
Test Activity
-------------
Production
Workaround
----------
Manually apply cert-manager
Changed in starlingx: | |
assignee: | nobody → Dan Voiculeasa (dvoicule) |
Changed in starlingx: | |
importance: | Undecided → Medium |
tags: | added: stx.8.0 stx.apps |
Probably cert-manager and nginx-ingress- controller auto-apply feature is disabled because they are manually applied during bootstrap. We could enable auto-apply feature for these apps checking if it is not during bootstrap and other possible conditions (the framework already accounts for restore and upgrades).
platform-integ-apps has the auto-apply feature enabled.
Don't know about other apps, will assume they don't have it enabled.