False alarm 750.005 (Application Update In Progress) is raised after upgrade

Bug #2003228 reported by Marcelo de Castro Loebens
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Low
Marcelo de Castro Loebens

Bug Description

Brief Description
-----------------
Alarm 750.005  | Application Update In Progress is raised post upgrade to 22.12 and remains active/raised after 1 day of upgrade completion.

[sysadmin@controller-0 ~(keystone_admin)]$ fm alarm-list
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| Alarm ID | Reason Text                                                                                               | Entity ID                     | Severity | Time Stamp          |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+
| 500.200  | Certificate namespace=kube-system, certificate=oidc-auth-apps-certificate is expiring soon on 2022-12-18, | namespace=kube-system.        | major    | 2022-12-18T15:53:09 |
|          | 16:52:26                                                                                                  | certificate=oidc-auth-apps-   |          | .172366             |
|          |                                                                                                           | certificate                   |          |                     |
|          |                                                                                                           |                               |          |                     |
| 200.015  | controller-0 has one or more board management controller sensor group read failures                       | host=controller-0.sensorgroup | major    | 2022-12-18T00:47:18 |
|          |                                                                                                           | =server fans                  |          | .403421             |
|          |                                                                                                           |                               |          |                     |
| 200.015  | controller-0 has one or more board management controller sensor group read failures                       | host=controller-0.sensorgroup | major    | 2022-12-18T00:47:18 |
|          |                                                                                                           | =server temperature           |          | .323105             |
|          |                                                                                                           |                               |          |                     |
| 200.015  | controller-0 has one or more board management controller sensor group read failures                       | host=controller-0.sensorgroup | major    | 2022-12-18T00:47:18 |
|          |                                                                                                           | =server power                 |          | .284404             |
|          |                                                                                                           |                               |          |                     |
| 200.015  | controller-0 has one or more board management controller sensor group read failures                       | host=controller-0.sensorgroup | major    | 2022-12-18T00:47:18 |
|          |                                                                                                           | =server voltage               |          | .241260             |
|          |                                                                                                           |                               |          |                     |
| 750.005  | Application Update In Progress                                                                            | k8s_application=cert-manager  | warning  | 2022-12-18T00:31:57 |
|          |                                                                                                           |                               |          | .282863             |
|          |                                                                                                           |                               |          |                     |
+----------+-----------------------------------------------------------------------------------------------------------+-------------------------------+----------+---------------------+

Severity
--------
Minor

Steps to Reproduce
------------------
Execute upgrade. Issue is intermittent.

Expected Behavior
------------------
Alarm should be removed after the k8s app update, as part of the platform upgrade.

Actual Behavior
----------------
Alarm 750.005 is raised sometimes.

Reproducibility
---------------
Intermittent. Seen 4 times.

System Configuration
--------------------
Simplex

Branch/Pull Time/Commit
-----------------------
NA

Last Pass
---------
NA

Timestamp/Logs
--------------
NA

Test Activity
-------------
Developer Testing

Workaround
----------
Manually remove the alarm
fm alarm-list --uuid
fm alarm-delete <alarm_uuid>

Changed in starlingx:
assignee: nobody → Marcelo de Castro Loebens (mdecastr)
status: New → In Progress
Ghada Khalil (gkhalil)
summary: - False alarm 750.005 (Application Update In Progress) is raised for cert-
- manager after upgrade
+ False alarm 750.005 (Application Update In Progress) is raised for
+ application after upgrade
summary: - False alarm 750.005 (Application Update In Progress) is raised for
- application after upgrade
+ False alarm 750.005 (Application Update In Progress) is raised after
+ upgrade
Ghada Khalil (gkhalil)
Changed in starlingx:
importance: Undecided → Low
tags: added: stx.fault
description: updated
Revision history for this message
Ghada Khalil (gkhalil) wrote :

Fixed By: https://review.opendev.org/c/starlingx/config/+/870669
For some reason, the LP was not updated automatically

tags: added: stx.8.0 stx.update
Changed in starlingx:
status: In Progress → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.