"dcmanager fw-update-strategy apply" updates a subcloud failed, the subcloud can never be updated again
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
Eric MacDonald |
Bug Description
Brief Description
-----------------
"dcmanager fw-update-strategy apply" updates a subcloud failed, the subcloud can never be updated again.
Tried on both subcloud2 and subcloud4, same result.
Remove label and image on SystemController, next round update still doesn't work.
Remove label and image on both SystemController and the subcloud, lock/unlock the subcloud, next round update still doesn't work.
Severity
--------
Major
Steps to Reproduce
------------------
precondition: subcloud2 FPGA has flashed root-key image
"dcmanager fw-update-strategy apply" to update subcloud2 FPGA with an unsigned image
Expected Behavior
------------------
The subcloud2 FPGA updates failed, the fw strategy failed quickly. And "dcmanager fw-update-strategy show" shows failed with "finishing fw update: Not all images applied successfully". (observed on 2020-07-
After remove label and image, doing another round of update should work. (observed on 2020-07-
Actual Behavior
----------------
The subcloud2 FPGA updates failed, the fw strategy failed until an hour. And "dcmanager fw-update-strategy show" shows failed with "applying fw update strategy: Timeout applying firmware strategy."
After remove label and image, doing another round of update always fails with "creating fw update strategy: VIM strategy unexpected build state: applying".
Reproducibility
---------------
yes
System Configuration
-------
Lab-name: DC-3
Branch/Pull Time/Commit
-------
2020-08-07_20-00-00
Last Pass
---------
2020-07-31_20-00-00
Timestamp/Logs
--------------
### first round update
[sysadmin@
+------
| cloud | stage | state | details | started_at | finished_at |
+------
| subcloud2 | 2 | failed | applying fw update strategy: Timeout applying firmware strategy. | 2020-08-08 13:57:43.558955 | 2020-08-08 14:58:42.410606 |
+------
[sysadmin@
+------
| uuid | bitstream_type | pci_vendor | pci_device | bitstream_id | key_signature | revoke_key_id | name | description | image_version | applied | applied_labels |
+------
| 2960a627-
+------
[sysadmin@
+------
| hostname | PCI | Device image uuid | status | Update start | updated_at |
| | device | | | time | |
| | address | | | | |
+------
| controller-0 | 0000:b2: | 2960a627-
| | 00.0 | | | 58:14.621556+00 | |
| | | | | :00 | |
| | | | | | |
+------
[sysadmin@
+------
| Alarm | Reason Text | Entity ID | Severity | Time Stamp |
| ID | | | | |
+------
| 900. | Firmware update auto-apply inprogress | orchestration=
| 301 | | | | 3:58:14. |
| | | | | 298101 |
| | | | | |
| 900. | Device image update operation in progress | system=49038d7a- | minor | 2020-08-08T1 |
| 006 | | 59c7-462b-
| | | | | 679773 |
### After remove label and image on SystemController, second round update
[sysadmin@
+------
| cloud | stage | state | details | started_at | finished_at |
+------
| subcloud2 | 2 | failed | creating fw update strategy: VIM strategy unexpected build state: applying | 2020-08-08 17:13:31.668916 | 2020-08-08 17:13:52.868613 |
+------
[sysadmin@
+------
| uuid | bitstream_type | pci_vendor | pci_device | bitstream_id | key_signature | revoke_key_id | name | description | image_version | applied | applied_labels |
+------
| 2960a627-
| 35980675-
+------
[sysadmin@
+------
| hostname | PCI device | Device image uuid | status | Update start time | updated_at |
| | address | | | | |
+------
| controller-0 | 0000:b2:00.0 | 35980675-
+------
[sysadmin@
+------
| Alarm | Reason Text | Entity ID | Severity | Management Affecting | Time Stamp |
| ID | | | | | |
+------
| 900. | Firmware update auto-apply inprogress | orchestration=
| 301 | | | | | 17:22:00. |
| | | | | | 750332 |
| | | | | | |
| 900. | Device image update operation in | system=49038d7a- | minor | True | 2020-08-08T |
| 006 | progress | 59c7-462b-
| | | | | | 644574 |
Test Activity
-------------
Functional Testing
summary: |
- "dcmanager fw-update-strategy apply" update a subcloud failed, the + "dcmanager fw-update-strategy apply" updates a subcloud failed, the subcloud can never be updated again |
description: | updated |
description: | updated |
description: | updated |
tags: | added: stx.5.0 |
Changed in starlingx: | |
assignee: | nobody → Eric MacDonald (rocksolidmtce) |
Changed in starlingx: | |
importance: | Undecided → High |
status: | New → Triaged |
tags: | added: stx.nfv |
log: /files. starlingx. kube.cengn. ca/launchpad/ 1890915
https:/