Subcloud install fails using the Redfish Virtual Media service with RVMC pod in pending state
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Enzo Candotti |
Bug Description
SC installation fails with error:
TASK [Fail if rvmc-subcloud20
Monday 04 April 2022 17:19:40 +0000 (0:00:00.026) 0:01:02.579 **********
fatal: [subcloud2001 -> localhost]: FAILED! => changed=false
msg: Redfish Virtual Media Controller failed to start the install
Type Reason Age From Message
---- ------ ---- ---- -------
Warning FailedScheduling 43s (x13 over 14m) default-scheduler 0/8 nodes are available: 2 node(s) had taint {node-role.
Steps to Reproduce
After completing a STD DC-system-
Expected Behavior
SC can be installed using the remote install option.
Actual Behavior
Installation fails with error:
TASK [Get rvmc_namespace] *******
Monday 04 April 2022 17:08:59 +0000 (0:00:00.015) 0:00:00.355 **********
changed: [subcloud1 -> localhost]
TASK [Ensure rvmc_namespace is created] *******
Monday 04 April 2022 17:08:59 +0000 (0:00:00.157) 0:00:00.513 **********
changed: [subcloud1 -> localhost]
TASK [Get default registry key] *******
Monday 04 April 2022 17:09:00 +0000 (0:00:00.157) 0:00:00.670 **********
changed: [subcloud1 -> localhost]
TASK [Copy default-
Monday 04 April 2022 17:09:00 +0000 (0:00:00.174) 0:00:00.844 **********
changed: [subcloud1 -> localhost]
TASK [Create Redfish Virtual Media Controller resource file] *******************
Monday 04 April 2022 17:09:00 +0000 (0:00:00.279) 0:00:01.124 **********
changed: [subcloud1 -> localhost]
TASK [Activate Redfish Virtual Media Controller] *******
Monday 04 April 2022 17:09:00 +0000 (0:00:00.347) 0:00:01.471 **********
changed: [subcloud1 -> localhost]
TASK [Get the pod name that created by Redfish Virtual Media Controller batch job] ***
Monday 04 April 2022 17:09:01 +0000 (0:00:00.276) 0:00:01.748 **********
changed: [subcloud1 -> localhost]
TASK [set_fact] *******
Monday 04 April 2022 17:09:01 +0000 (0:00:00.164) 0:00:01.913 **********
ok: [subcloud1 -> localhost]
TASK [Wait for 60 seconds for rvmc-subcloud1-
Monday 04 April 2022 17:09:01 +0000 (0:00:00.047) 0:00:01.960 **********
changed: [subcloud1 -> localhost]
TASK [Save Redfish Virtual Media Controller logs if rvmc-subcloud1-
Monday 04 April 2022 17:10:01 +0000 (0:01:00.186) 0:01:02.147 **********
changed: [subcloud1 -> localhost]
TASK [debug] *******
Monday 04 April 2022 17:10:01 +0000 (0:00:00.183) 0:01:02.331 **********
ok: [subcloud1 -> localhost] =>
msg: ''
TASK [Fail if rvmc-subcloud1-
Monday 04 April 2022 17:10:01 +0000 (0:00:00.026) 0:01:02.357 **********
fatal: [subcloud1 -> localhost]: FAILED! => changed=false
msg: Redfish Virtual Media Controller failed to start the install
Reproducibility
100% On DC with Standard system controllers.
System Configuration
DC with Standard system controllers, tested remote -install on 2 Subclouds
SW_VERSION="22.02"
Branch and the time when code was pulled or git commit or cengn load info
Last Pass
New test scenario.
Timestamp/Logs
Provide a snippet of logs if available and the timestamp when issue was seen.
TASK [Fail if rvmc-subcloud1-
Monday 04 April 2022 17:10:01 +0000 (0:00:00.026) 0:01:02.357 **********
fatal: [subcloud1 -> localhost]: FAILED! => changed=false
msg: Redfish Virtual Media Controller failed to start the install
Please indicate the unique identifier in the logs to highlight the problem
Alarms
-
Test Activity
Developer Testing
Workaround
Delete taint on master node:
kubectl taint nodes controller-0 node-role.
After removing this taint the subcloud installation has to be re-applied.
Changed in starlingx: | |
assignee: | nobody → Enzo Candotti (ecandotti) |
Changed in starlingx: | |
importance: | Undecided → Medium |
tags: | added: stx.7.0 stx.config stx.distcloud |
Fix proposed to branch: master /review. opendev. org/c/starlingx /ansible- playbooks/ +/836987
Review: https:/