Power off host operation reports completed even if host remains powered on
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Eric MacDonald |
Bug Description
The maintenance Power Off FSM is not producing a failed response if a BMC accepts the power off request but does not actually power off the host. This was seen on system WP 8-12 compute-1 (WP11) when the BMC somehow got into a funky state and accepted the power off request but then did not proceed with the power off operation.
The FSM needs to be enhanced with the following case states immediately following the MTC_POWEROFF_
MTC_
MTC_
MTC_
This is day one behavior that was never observed nor fault insertion tested for.
Severity
--------
Minor with the following reasoning.
1. Server is already out of service if its being powered off so no immediate service affecting impact.
2. Could be considered a double fault scenario.
3. Host does not mistakenly appear powered off while its not. Instead it just bounces back as online.
Steps to Reproduce
------------------
Difficult without tricking the code by provisioning host 'A' with BMC info from host 'B' and then executing a power off for host 'A' that will result in host 'B' being powered off while host 'A' remains powered on and online.
Expected Behavior
------------------
Power off command reports completed and host is not powered off.
Actual Behavior
----------------
Power off command reports completed and host is powered off.
Reproducibility
---------------
Reproducible 100%
System Configuration
-------
Any host with BMC support/
Branch/Pull Time/Commit
-------
Any
Last Pass
---------
Never. Requires faulty BMC that does not power off but accepts power off command.
Timestamp/Logs
--------------
2020-02-
2020-02-
2020-02-
Test Activity
-------------
Observed while debugging why a BMC did not power off its host while it accepted the power off command.
Workaround
----------
Not required.
Changed in starlingx: | |
assignee: | nobody → Eric MacDonald (rocksolidmtce) |
summary: |
- Power off host reports completed even if host remains powered on + Power off host operation reports completed even if host remains powered + on |
Changed in starlingx: | |
status: | In Progress → Fix Released |
Low / not gating - it doesn't seem worth to try to workaround a faulty BMC by adding complexity to the stx software.
Suggest you close this as Won't Fix