Comment 0 for bug 2076173

Revision history for this message
Gabriel B. Sant'Anna (baioc) wrote :

ipmitool sel does not correctly display the sensor's name if its owner is set to lun1.
Upstream bug: https://codeberg.org/IPMITool/ipmitool/issues/8

We were asked to track this in order to enable new hardware from a partner.
It was reported in v1.8.19 (used in Noble), but likely affects previous versions as well.

Steps to reproduce, copied verbatim from the upstream report:
1. Using AMI/BMC to generate a sensor error event. The sensor belong to LUN1
```
GPU1_MEM | 10h | ok | 11.1 | Uncorrectable ECC
GPU2_MEM | 11h | ok | 11.2 | Uncorrectable ECC
GPU3_MEM | 12h | ok | 11.3 | Uncorrectable ECC
GPU4_MEM | 13h | ok | 11.4 | Uncorrectable ECC
GPU5_MEM | 14h | ok | 11.5 | Uncorrectable ECC
GPU6_MEM | 15h | ok | 11.6 | Uncorrectable ECC
GPU7_MEM | 16h | ok | 11.7 | Uncorrectable ECC
GPU8_MEM | 17h | ok | 11.8 | Uncorrectable ECC
```
2. Run `ipmitool sel elist`
3. Observe the abnormal reply:
```
c5 | 2023/08/02 | 17时17分24秒 CST | Memory | Uncorrectable ECC | Asserted
c6 | 2023/08/02 | 17时18分29秒 CST | Memory | Uncorrectable ECC | Asserted
c7 | 2023/08/02 | 17时18分29秒 CST | Memory | Uncorrectable ECC | Asserted
c8 | 2023/08/02 | 17时18分30秒 CST | Memory | Uncorrectable ECC | Asserted
c9 | 2023/08/02 | 17时18分30秒 CST | Memory | Uncorrectable ECC | Asserted
ca | 2023/08/02 | 17时19分34秒 CST | Memory | Uncorrectable ECC | Asserted
cb | 2023/08/02 | 17时19分34秒 CST | Memory INTEGRAL_DIMM | Uncorrectable ECC | Asserted
cc | 2023/08/02 | 17时19分34秒 CST | Memory | Uncorrectable ECC | Asserted
```
SensorName is empty or wrong.(Expexct GPU1_MEM)

A fix was proposed upstream but is yet to be merged: https://codeberg.org/IPMITool/ipmitool/pulls/39