2023-01-18 15:30:52 |
Marcelo de Castro Loebens |
bug |
|
|
added bug |
2023-01-18 15:31:00 |
Marcelo de Castro Loebens |
starlingx: assignee |
|
Marcelo de Castro Loebens (mdecastr) |
|
2023-01-18 15:31:08 |
Marcelo de Castro Loebens |
starlingx: status |
New |
In Progress |
|
2023-01-18 16:13:36 |
Ghada Khalil |
summary |
False alarm 750.005 (Application Update In Progress) is raised for cert-manager after upgrade |
False alarm 750.005 (Application Update In Progress) is raised for application after upgrade |
|
2023-01-18 16:13:44 |
Ghada Khalil |
summary |
False alarm 750.005 (Application Update In Progress) is raised for application after upgrade |
False alarm 750.005 (Application Update In Progress) is raised after upgrade |
|
2023-01-18 16:22:43 |
Ghada Khalil |
starlingx: importance |
Undecided |
Low |
|
2023-01-18 16:22:54 |
Ghada Khalil |
tags |
|
stx.fault |
|
2023-01-18 16:24:22 |
Ghada Khalil |
description |
Brief Description
-----------------
Alarm 750.005 | Application Update In Progress is raised post upgrade to 22.12 and remains active/raised after 1 day of upgrade completion.
[sysadmin@controller-0 ~(keystone_admin)]$ fm alarm-list
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| 500.200 | Certificate namespace=kube-system, certificate=oidc-auth-apps-certificate is expiring soon on 2022-12-18, | namespace=kube-system. | major | 2022-12-18T15:53:09 |
| | 16:52:26 | certificate=oidc-auth-apps- | | .172366 |
| | | certificate | | |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server fans | | .403421 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server temperature | | .323105 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server power | | .284404 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server voltage | | .241260 |
| | | | | |
| 750.005 | Application Update In Progress | k8s_application=cert-manager | warning | 2022-12-18T00:31:57 |
| | | | | .282863 |
| | | | | |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
Severity
--------
Minor
Steps to Reproduce
------------------
Execute upgrade from 22.06 to 22.12. Issue is intermittent.
Expected Behavior
------------------
Alarm should be remove after the k8s app update, as part of the platform upgrade.
Actual Behavior
----------------
Alarm 750.005 is raised sometimes.
Reproducibility
---------------
Intermittent. Seen 4 times.
System Configuration
--------------------
Simplex
Branch/Pull Time/Commit
-----------------------
NA
Last Pass
---------
NA
Timestamp/Logs
--------------
NA
Test Activity
-------------
Developer Testing
Workaround
----------
Manually remove the alarm
fm alarm-list --uuid
fm alarm-delete <alarm_uuid> |
Brief Description
-----------------
Alarm 750.005 | Application Update In Progress is raised post upgrade to 22.12 and remains active/raised after 1 day of upgrade completion.
[sysadmin@controller-0 ~(keystone_admin)]$ fm alarm-list
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| 500.200 | Certificate namespace=kube-system, certificate=oidc-auth-apps-certificate is expiring soon on 2022-12-18, | namespace=kube-system. | major | 2022-12-18T15:53:09 |
| | 16:52:26 | certificate=oidc-auth-apps- | | .172366 |
| | | certificate | | |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server fans | | .403421 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server temperature | | .323105 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server power | | .284404 |
| | | | | |
| 200.015 | controller-0 has one or more board management controller sensor group read failures | host=controller-0.sensorgroup | major | 2022-12-18T00:47:18 |
| | | =server voltage | | .241260 |
| | | | | |
| 750.005 | Application Update In Progress | k8s_application=cert-manager | warning | 2022-12-18T00:31:57 |
| | | | | .282863 |
| | | | | |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
Severity
--------
Minor
Steps to Reproduce
------------------
Execute upgrade. Issue is intermittent.
Expected Behavior
------------------
Alarm should be removed after the k8s app update, as part of the platform upgrade.
Actual Behavior
----------------
Alarm 750.005 is raised sometimes.
Reproducibility
---------------
Intermittent. Seen 4 times.
System Configuration
--------------------
Simplex
Branch/Pull Time/Commit
-----------------------
NA
Last Pass
---------
NA
Timestamp/Logs
--------------
NA
Test Activity
-------------
Developer Testing
Workaround
----------
Manually remove the alarm
fm alarm-list --uuid
fm alarm-delete <alarm_uuid> |
|
2023-01-27 23:48:04 |
Ghada Khalil |
tags |
stx.fault |
stx.8.0 stx.fault stx.update |
|
2023-01-27 23:48:20 |
Ghada Khalil |
starlingx: status |
In Progress |
Fix Released |
|