sw-manager kube-upgrade-strategy orchestration failed due to wait-alarms-clear timeout
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Igor Pires Soares |
Bug Description
Brief Description
-----------------
sw-manager kube-upgrade-
Note: It's probably timing / intermittent issue, as I did orchestration several rounds/labs, this is the only time to hit such issue.
Severity
-----------------
Major
Steps to Reproduce
-----------------
sw-manager kube-upgrade-
sw-manager kube-upgrade-
Expected Behavior
-----------------
sw-manager kube-upgrade-
Actual Behavior
-----------------
sw-manager kube-upgrade-
Reproducibility
-----------------
Not sure
System Configuration
-----------------
multi-node
Timestamp/Logs
-----------------
[sysadmin@
Strategy Kubernetes Upgrade Strategy:
strategy-uuid: d38b6d46-
controller-
storage-
worker-
default-
alarm-
current-phase: abort
current-
state: aborted
apply-result: timed-out
apply-reason:
abort-result: success
abort-reason:
[sysadmin@
+----+-
| id | hostname | personality | target_version | control_
+----+-
| 1 | controller-0 | controller | v1.24.4 | v1.24.4 | v1.24.4 | upgraded-kubelet |
| 2 | compute-0 | worker | v1.23.1 | N/A | v1.23.1 | None |
| 3 | controller-1 | controller | v1.24.4 | v1.24.4 | v1.24.4 | upgraded-kubelet |
+----+-
[sysadmin@
...
result: success
reason:
result: success
reason:
result: success
reason:
result: timed-out
reason:
stage-id: 6
stage-name: kube-upgrade-
timeout: 3736 seconds
result: initial
reason:
steps:
...
[sysadmin@
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 750.006 | A configuration change requires a reapply of the oidc-auth-apps | k8s_application= | warning | 2022-12-20T21 |
| | application. | oidc-auth-apps | | :35:04.047739 |
| | | | | |
| 750.006 | A configuration change requires a reapply of the platform-integ-apps | k8s_application= | warning | 2022-12-20T21 |
| | application. | platform-integ-apps | | :35:02.837708 |
| | | | | |
| 750.006 | A configuration change requires a reapply of the cert-manager application | k8s_application= | warning | 2022-12-20T21 |
| | . | cert-manager | | :35:01.490398 |
| | | | | |
| 100.114 | NTP address 91.207.136.55 is not a valid or a reachable NTP server. | host=controller
| | | .207.136.55 | | :34:47.116198 |
| | | | | |
| 900.007 | Kubernetes upgrade in progress. | host=controller | minor | 2022-12-20T20 |
| | | | | :55:00.596354 |
| | | | | |
+------
Test Activity
-----------------
Feature Testing
Changed in starlingx: | |
status: | New → In Progress |
Changed in starlingx: | |
importance: | Undecided → Low |
importance: | Low → Medium |
tags: | added: stx.nfv |
Changed in starlingx: | |
assignee: | nobody → Igor Pires Soares (ipiresso) |
tags: | added: stx.9.0 |
Fix proposed to branch: master /review. opendev. org/c/starlingx /cert-manager- armada- app/+/871748
Review: https:/