ras-mc-ctl doesn't provide BDF for PCIe errors
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
rasdaemon (Debian) |
Fix Released
|
Unknown
|
|||
rasdaemon (Ubuntu) |
Fix Released
|
Undecided
|
dann frazier | ||
Focal |
Fix Released
|
Undecided
|
dann frazier |
Bug Description
[Impact]
rasdaemon provides ras-mc-ctl, a script for querying the rasdaemon database. When displaying PCIe AER events from the
database, it doesn't provide any information to identify the associated PCIe device. Knowing that some hardware is reporting errors, but not knowing what hardware that is, isn't terribly helpful.
This information is already stored in the database (has been since 0.6.5 in focal), so we just need to update ras-mc-ctl to display it as well.
[Test Case]
- Trigger an AER event (how to do so appears to be pretty platform-specific).
- Check for the Bus/device/function info in the output of ras-mc-ctl.
[Fix]
https:/
[Regression Risk]
The change here adds additional content to the output of ras-mc-ctl. Instead of something like this:
PCIe AER events:
1 2020-04-16 22:09:48 +0000 Corrected error: Receiver Error
2 2020-04-16 22:23:24 +0000 Corrected error: Receiver Error
You'll now see something like this:
PCIe AER events:
1 2020-04-16 22:09:48 +0000 0000:0b:00.0 Corrected error: Receiver Error
2 2020-04-16 22:23:24 +0000 0000:0b:00.0 Corrected error: Receiver Error
As with any such unstructured output, it's possible that a user has some code to parse the output that would be confused by the additional content.
Changed in rasdaemon (Ubuntu): | |
status: | New → In Progress |
assignee: | nobody → dann frazier (dannf) |
description: | updated |
description: | updated |
Changed in rasdaemon (Ubuntu Focal): | |
status: | New → In Progress |
assignee: | nobody → dann frazier (dannf) |
description: | updated |
Changed in rasdaemon (Debian): | |
status: | Unknown → Confirmed |
Changed in rasdaemon (Debian): | |
status: | Confirmed → Fix Released |
This bug was fixed in the package rasdaemon - 0.6.5-2ubuntu1
---------------
rasdaemon (0.6.5-2ubuntu1) groovy; urgency=medium
* d/p/ras- mc-ctl- PCIe-AER- display- PCIe-dev- name.patch: fix-the- wrong-declaring -of-sruct- ras_event. patch:
ras-mc-ctl: Display bus/device/function of the PCIe device
corresponding to an AER event. LP: #1888423.
* d/p/rasdaemon-
Fix FTBFS w/ gcc-10.
-- dann frazier <email address hidden> Tue, 21 Jul 2020 19:57:27 +0000