Improve maintenance power/reset control command retry handling
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Eric MacDonald |
Bug Description
Brief Description
-----------------
platform management BMC power on/off and reset command retry handling is not working as expected.
Issue 1: host-reset is skipping the graceful altogether and always issuing immediate and performing only 4 rather than 5 retries.
Issue 2: host-power-off algorithm timing leads to 2 retries being used up for a single retry which is then leading to only 5 retries being done. Even the first successful try leads to a logged retry.
Need to improve on and drive consistency into the maintenance power on/off
and reset handling in terms of retries and use of graceful and immediate commands.
Severity
--------
Minor. Issue only exists in BMC command failure handling cases which also still works but does not behave ideally.
Steps to Reproduce
------------------
system host-power-off <host>
system host-reset <host>
Expected Behavior
------------------
retries and command type are as expected.
Actual Behavior
----------------
retries and command type are not as expected.
Reproducibility
---------------
100% reproducible
System Configuration
-------
Any BMC provisioned host
Branch/Pull Time/Commit
-------
Any load prior to the close of this bug report
Last Pass
---------
test escape
Timestamp/Logs
--------------
not necessary. issue is understood
Test Activity
-------------
Normal Use
Workaround
----------
None
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.9.0 stx.metal |
Changed in starlingx: | |
assignee: | nobody → Eric MacDonald (rocksolidmtce) |
Fix proposed to branch: master /review. opendev. org/c/starlingx /metal/ +/892051
Review: https:/