Have subcloud rehoming playbook failures to be reported as errors in 'dcmanager subcloud errors subcloud#'
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Fabrizio Perez |
Bug Description
Brief Description
-----------------
Currently dcmanager is already able to extract some ansible execution failures from playbooks and have it reported as errors to the user when 'dcmanager subcloud errors subcloud#' is run.
Failures in the rehome playbook are not included at the moment. The intent of this story is to have ansible failures in the rehoming playbook listed as well.
For instance:
Ansible playbook error:
[sysadmin@
TASK [common/
Saturday 26 September 2026 22:27:31 +0000 (0:00:00.039) 0:09:16.953 ****
skipping: [subcloud1]
TASK [common/
Saturday 26 September 2026 22:27:31 +0000 (0:00:00.066) 0:09:17.020 ****
fatal: [subcloud1]: FAILED! => changed=false
msg: |2-
Docker Registry certificate is expired. Manual action required! On the subcloud, please update the expired certificates with `system certificate-
TASK [common/
Saturday 26 September 2026 22:27:31 +0000 (0:00:00.059) 0:09:17.080 ****
skipping: [subcloud1]
PLAY RECAP *******
subcloud1 : ok=77 changed=49 unreachable=0 failed=1 skipped=36 rescued=0 ignored=0
Saturday 26 September 2026 22:27:31 +0000 (0:00:00.031) 0:09:17.112 ****
=======
common/
common/
common/
common/
common/
common/
common/
common/
common/
common/
common/
common/prepare-env : stat -------
common/
common/
common/
common/
common/
common/
common/
common/
[sysadmin@
Should show subcloud error as below:
[sysadmin@
FAILED bootstrapping playbook of (subcloud1).
detail: fatal: [subcloud1]: FAILED! => changed=false
msg: |2-
Docker Registry certificate is expired. Manual action required! On the subcloud, please update the expired certificates with `system certificate-
For bootstrap failures, please delete and re-add the subcloud after the cause of failure has been resolved.
This feature was suggested by Peters, Matt in a demo meeting.
Severity
--------
<Minor: System/Feature is usable with minor issue>
Steps to Reproduce
------------------
Force a failure in subcloud rehoming.
Check output of dcmanager subcloud errors <subcloud>
Expected Behavior
------------------
The command displays the error.
Actual Behavior
----------------
The command doesn't display the error.
Reproducibility
---------------
100%
System Configuration
-------
Any
Branch/Pull Time/Commit
-------
Last Pass
---------
Timestamp/Logs
--------------
Test Activity
-------------
Demo
Workaround
----------
Changed in starlingx: | |
status: | New → In Progress |
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.9.0 stx.distcloud |
Changed in starlingx: | |
assignee: | nobody → Fabrizio Perez (fperezwindriver) |
Reviewed: https:/ /review. opendev. org/c/starlingx /distcloud/ +/904363 /opendev. org/starlingx/ distcloud/ commit/ 04c8b51b4003986 d6b979ccb1b20e6 2ae4cbd802
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit 04c8b51b4003986 d6b979ccb1b20e6 2ae4cbd802
Author: fperez <email address hidden>
Date: Tue Dec 26 20:39:49 2023 -0300
Report rehoming playbook failures
This commit extends ansible error catching for rehoming
subcloud operation.
Test plan:
PASS: Intentionally force a failure in the rehoming playbook.
Verify that the error is displayed correctly
Closes-bug: 2047645
Change-Id: I4571e04247bdcf 273f5de860ae503 2597b173ed2
Signed-off-by: fperez <email address hidden>