DC restore subcloud group to previous release: Some subclouds were installed with active release instead of inactive release
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Victor Romano |
Bug Description
Brief Description
DC subcloud restore group to previous release: Some subclouds was installed to active release instead of inactive release
CMD:
dcmanager subcloud-backup restore --group restore-group --with-install --release 21.12
Severity
Minor
Note: The reproducibility rate is low and we can run the restore operation again once it fails. And so far we've seen it only for sushy subclouds
Steps to Reproduce
Deploy SystemController and subclouds with 21.12
Upgrade SystemController
apply network restrictions
backup subclouds
upgrade subclouds
restore subclouds back to 21.12
Expected Behavior
The subclouds should be restored to 21.12 load
Actual Behavior
some subclouds installed with 22.12 load and the backup failed as the 22.12 backup was not available
Reproducibility
2 out of 93 subclouds
System Configuration
dc
Load info (eg: 2022-03-
cat /etc/build.info
SW_VERSION="22.12"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID=
SRC_BUILD_
BUILD_BY="jenkins"
BUILD_NUMBER="50"
BUILD_HOST=
BUILD_DATE=
Last Pass
intermittent issue
Timestamp/Logs
logs:
TASK [subcloud-
Thursday 23 November 2023 22:56:14 +0000 (0:00:00.460) 0:00:07.635 *****
fatal: [subcloud3015]: FAILED! => changed=false
msg: There must be one platform backup file in /opt/dc-
Alarms
no alarms
Test Activity
Regression Testing
Workaround
re-run restore operation again
Changed in starlingx: | |
status: | New → In Progress |
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.9.0 stx.distcloud |
Changed in starlingx: | |
assignee: | nobody → Victor Romano (vgluzrom) |
Reviewed: https:/ /review. opendev. org/c/starlingx /ansible- playbooks/ +/901864 /opendev. org/starlingx/ ansible- playbooks/ commit/ e7167fdc7382b2e 0f88ee29d8520a3 d8bfb6198c
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit e7167fdc7382b2e 0f88ee29d8520a3 d8bfb6198c
Author: Victor Romano <email address hidden>
Date: Fri Nov 24 14:39:18 2023 -0300
Enforce password change when installing via rvmc
When installing a subcloud via rvmc, it's possible that the BMC password_ change was created, failing the
reports a successful operation when booting with new install media
without actually installing the subcloud. To prevent this, a new
variable called enforce_
installation if there is an error during initial password change
(observed if the system wasn't correctly reinstalled).
Test plan:
changing the password before the playbook and verify the
installation fails as expected.
successfully.
operation completed successfully.
- PASS: Perform a subcloud add with install forcing a failure by
- PASS: Perform a normal subcloud add with install and verify
the operation completed successfully.
- PASS: Deploy a standalone SX and verify the bootstrap completed
- PASS: Upgrade a subcloud from stx6 to stx8 and verify the
Closes-Bug: 2044564
Change-Id: I80bee246dedfdf 9688507c3529d7d 080992da08b
Signed-off-by: Victor Romano <email address hidden>