[Artful/Zesty] ACPI APEI error handling bug fixes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Critical
|
Manoj Iyer | ||
Zesty |
Won't Fix
|
Critical
|
Manoj Iyer | ||
Artful |
Fix Released
|
Critical
|
Manoj Iyer |
Bug Description
[Impact]
Error records which have multiple errors in them will incorrectly report all errors after the first one. This results in garbage non-standard error trace events to be generated, and for AER and MC errors there will be no kernel action to help recover from these errors in the AER and EDAC drivers.
[Fix]
Patches in Linus tree fixes this issue:
aaf2c2fb0f51 ACPI / APEI: clear error status before acknowledging the error
c4335fdd3822 ACPI: APEI: fix the wrong iteration of generic error status block
[Testing]
Insert a e1000 pcie card into the system, run the following command that should generate PCIe correctable errors, you will see only the first error in each GHES report go to the AER driver rather than all errors from the GHES reports.
$ sudo setpci -s 0002:00:00.0 0x70c.l=
Where "0002:00:00.0" being the root hub for the card.
Used JTAG to trigger multiple concurrent errors, and observed that all errors were parsed, instead of just the first one. As mentioned in comment #3. So, the poster of comment #3 will do the verification once the patch lands in -proposed.
[Regression Potential]
The two patches to ACPI APEI driver was cleanly cherry picked from linus's tree and applied to Artful and Zesty. The patches were tested on QDF2400 platform where it was found to issue and don't introduce any regressions.
CVE References
Changed in linux (Ubuntu): | |
assignee: | Manoj Iyer (manjo) → Canonical Kernel Team (canonical-kernel-team) |
Changed in linux (Ubuntu): | |
status: | Incomplete → In Progress |
Changed in linux (Ubuntu Zesty): | |
status: | New → In Progress |
Changed in linux (Ubuntu Artful): | |
status: | New → In Progress |
Changed in linux (Ubuntu Zesty): | |
importance: | Undecided → Critical |
Changed in linux (Ubuntu Artful): | |
importance: | Undecided → Critical |
Changed in linux (Ubuntu Zesty): | |
assignee: | nobody → Manoj Iyer (manjo) |
Changed in linux (Ubuntu Artful): | |
assignee: | nobody → Manoj Iyer (manjo) |
Changed in linux (Ubuntu): | |
assignee: | Canonical Kernel Team (canonical-kernel-team) → Manoj Iyer (manjo) |
description: | updated |
Changed in linux (Ubuntu Artful): | |
status: | In Progress → Fix Committed |
Changed in linux (Ubuntu Zesty): | |
status: | In Progress → Won't Fix |
Changed in linux (Ubuntu): | |
status: | In Progress → Fix Committed |
Changed in linux (Ubuntu): | |
status: | Fix Committed → Fix Released |
tags: | added: cscc |
This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:
apport-collect 1732990
and then change the status of the bug to 'Confirmed'.
If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.
This change has been made by an automated script, maintained by the Ubuntu Kernel Team.