kubernetes certificates renewal failure on controller-1
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
Reinildes Oliveira |
Bug Description
Brief Description
-------
Alarm 250.003, kubernetes certificates renewal failed is raised for controller-1 on multiple labs at the exact same time. This is causing multiple failures over many automated suites.
Severity
-------
Major: System is usable but degraded
Steps to Reproduce
-------
Install master debian build 2023-07-27_18-00-35
wait for k8s certificates to expire on controller-1
Expected Behavior
-------
all k8s certificates should get renewed without issues
Actual Behavior
-------
k8s certificates are fail to be renewed on controller-1
Reproducibility
-------
Seen once
System Configuration
-------
Issue seen on:
dx, multi-node
Load info (eg: 2022-03-
all mentioned labs shared same load
SW_VERSION="23.09"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID=
Last Pass
-------
Not applicable. Alarm was not seen during previous weekend regression
Alarms
-------
[sysadmin@
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 250.001 | controller-0 Configuration is out-of-date. (applied: 58060d78-56be- | host=controller-0 | major | 2023-07-31T19 |
| | 46e1-9044-
| | | | | |
| 250.003 | Kubernetes certificates renewal failed. | host=controller-1 | major | 2023-07-31T00 |
| | | | | :10:01.454623 |
| | | | | |
+------
Test Activity
-------
Regression Testing
Workaround
-------
No workaround known so far
Changed in starlingx: | |
assignee: | nobody → Reinildes Oliveira (rjosemat) |
Changed in starlingx: | |
importance: | Undecided → High |
tags: | added: stx.9.0 stx.fault stx.security |
Fix proposed to branch: master /review. opendev. org/c/starlingx /config/ +/890442
Review: https:/