After uncontrolled swact /var/run/bmc/redfishtool/ doen`t have hwmond sensor data

Bug #1853471 reported by Anujeyan Manokeran on 2019-11-21
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
High
Eric MacDonald

Bug Description

Brief Description
-----------------
After uncontrolled swact /var/run/bmc/redfishtool/ doen`t have hwmond sensor data it was swiched to /var/run/bmc/ipmitool/ on new active controller. This was observed after hard reboot on active controller(controller-1). Below show the data collection on controller-0 for both ipmi and redfish. Prior to swact the data was sensor data was captured in /var/run/bmc/redfishtool/ on controller-1 when it was active.

2019-11-20T20:17:19.000 controller-1 -sh: info HISTORY: PID=3506364 UID=42425 sudo reboot

2019-11-20T20:18:31.026 [3406288.00359] controller-0 mtcAgent |-| mtcNodeHdlrs.cpp (6507) bmc_handler : Info : controller-0 bmc is accessible using redfi sh
2019-11-20T20:18:31.072 [3406288.00368] controller-0 mtcAgent |-| mtcNodeHdlrs.cpp (6507) bmc_handler : Info : compute-0 bmc is accessible using redfish
2019-11-20T20:18:31.103 [3406288.00377] controller-0 mtcAgent |-| mtcNodeHdlrs.cpp (6507) bmc_handler : Info : compute-2 bmc is accessible using redfish
20
2019-11-20T20:18:31.214 [3406288.00386] controller-0 mtcAgent |-| mtcNodeHdlrs.cpp (6507) bmc_handler : Info : compute-1 bmc is accessible using redfish
2019-11-20T20:19:51.210 [3406288.00401] controller-0 mtcAgent |-| mtcNodeHdlrs.cpp (6507) bmc_handler : Info : controller-1 bmc is accessible using redfi sh

ls -lrt /var/run/bmc/redfishtool/
total 40
-rw-r--r-- 1 root root 1022 Nov 20 20:18 mtcAgent_controller-0_root_query
-rw-r--r-- 1 root root 1022 Nov 20 20:18 mtcAgent_compute-2_root_query
-rw-r--r-- 1 root root 1022 Nov 20 20:18 mtcAgent_compute-0_root_query
-rw-r--r-- 1 root root 1022 Nov 20 20:18 mtcAgent_compute-1_root_query
-rw-r--r-- 1 root root 1022 Nov 20 20:19 mtcAgent_controller-1_root_query
-rw-r--r-- 1 root root 3659 Nov 21 14:54 mtcAgent_controller-1_bmc_info
-rw-r--r-- 1 root root 3659 Nov 21 14:55 mtcAgent_controller-0_bmc_info
-rw-r--r-- 1 root root 3660 Nov 21 14:55 mtcAgent_compute-1_bmc_info
-rw-r--r-- 1 root root 3660 Nov 21 14:55 mtcAgent_compute-2_bmc_info
-rw-r--r-- 1 root root 3660 Nov 21 14:55 mtcAgent_compute-0_bmc_info
controller-0:~$ ls -lrt /var/run/bmc/ipmitool/
total 60
-rw-r--r-- 1 root root 9672 Nov 21 14:54 hwmond_controller-0_sensor_data
-rw-r--r-- 1 root root 9796 Nov 21 14:54 hwmond_compute-0_sensor_data
-rw-r--r-- 1 root root 9796 Nov 21 14:55 hwmond_compute-1_sensor_data
-rw-r--r-- 1 root root 9796 Nov 21 14:55 hwmond_compute-2_sensor_data
-rw-r--r-- 1 root root 9672 Nov 21 14:56 hwmond_controller-1_sensor_data

Severity
--------
Major

Steps to Reproduce
------------------
1.Set up lab for using BMC redfishtool
2. Verify BMC sensor data collection using redfish tool. Eg check mtcAgent log and list files in /var/run/bmc/redfishtool
3.Reboot active controller
4. Verify sensor data collection is still on new active controller /var/run/bmc/redfishtool
TC-name: Uncontrolled swact and verify bmc sensor data collection tool

Expected Behavior
------------------
All the sensor data under redfish directory /var/run/bmc/redfishtool.

Actual Behavior
----------------
After swact it was not collected in /var/run/bmc/redfishtool under new active controller.

Reproducibility
---------------
Reproducible 100%

System Configuration
--------------------
AIO-DX + worker node IPv6 config
Lab-name:
WCP-8-12

Branch/Pull Time/Commit
-----------------------
2019-11-18_20-00-00

Last Pass
---------
Never tested

Timestamp/Logs
--------------
2019-11-20T20:17:19.000

Test Activity
-------------
Feature Testing

Eric MacDonald (rocksolidmtce) wrote :

After a swact the Hardware Monitor on the new side is selecting ipmi even though mtcAgent selects redfish.

Changed in starlingx:
assignee: nobody → Eric MacDonald (rocksolidmtce)
Ghada Khalil (gkhalil) on 2019-11-21
Changed in starlingx:
importance: Undecided → High
status: New → Triaged
tags: added: stx.3.0 stx.metal
Ghada Khalil (gkhalil) wrote :

Marking as stx.3.0 / high priority given this is related to the redfish feature which is an stx.3.0 deliverable

Eric MacDonald (rocksolidmtce) wrote :

Issue is understood and a fix is being worked on.

Fix proposed to branch: master
Review: https://review.opendev.org/697309

Changed in starlingx:
status: Triaged → In Progress
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers