Boot/Installation crash of Ubuntu-16.04.3 HWE kernel on R940
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
dellserver |
Fix Released
|
Undecided
|
Unassigned | ||
linux (Ubuntu) |
Fix Released
|
Medium
|
Unassigned | ||
Zesty |
Fix Released
|
Medium
|
Unassigned |
Bug Description
== SRU Justification ==
Kernel crashes when installation of Ubuntu-16.04.3 with HWE (ISO).
Same observation while booting to 4.10.0-28 HWE kerenl of Ubuntu-16.04.3
and 4.10.0-33 HWE as well.
Seen only with 4.10 HWE kernels of Ubuntu-16.04.3. 4.4 kernels of
Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine.
Reducing the core count to <26 cores helps here. Boot & installation of
HWE-kernel works fine.
This bug was introduced by commit:
dc6db24d2476 ("x86/acpi: Set persistent cpuid <-> nodeid mapping when booting")
It is resolved by reverting commit dc6db24d2476, which was done in mainline by
commit c962cff17df as of v4.11-rc3.
There are three additiona commits introduced by the same patch author when
commit c962cff17df was submitted. However, it was confirmed that only the single
revert is needed to fix this particular bug. Upstream thread:
https:/
== Fix ==
commit c962cff17dfa11f
Author: Dou Liyang <email address hidden>
Date: Fri Mar 3 16:02:23 2017 +0800
Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting"
== Regression Potential ==
This is reverting a commit that introduced a bug. This commit has also
been reverted upstream.
== Test Case ==
A test kernel was built with this patch and tested by the original bug reporter.
The bug reporter states the test kernel resolved the bug.
Setup:
Dell PowerEdge R940 having 2 socket populated with 28 cores.
Impact:
This is a boot and installation failure for R940 users having 2 socket x 28 cores.
Requesting an SRU for the fix.
Steps:
1. Setup a Dell PowerEdge R940 with 28 cores CPUs on 2 sockets.
2. Start installation of Ubuntu-16.04.3 with HWE kernel.
3. Observe the screen remains blank. Console logs indicate kernel crash.
Additional Info:
* Seen only with 4.10 HWE kernels of Ubuntu-16.04.3.
4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine.
* Reducing the core count to <26 cores helps here. Boot & installation of HWE-kernel works fine.
* Attaching the console log and acpidump from the setup.
* Patch causing this failure: https:/
x86/acpi: Set persistent cpuid <-> nodeid mapping when booting
* Fix patch series: https:/
Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting"
https:/
Revert"x86/acpi: Enable MADT APIs to return disabled apicids"
https:/
acpi/processor: Implement DEVICE operator for processor enumeration
https:/
acpi/processor: Check for duplicate processor ids at hotplug time
https:/
CVE References
tags: | added: kernel-da-key zesty |
Changed in linux (Ubuntu Zesty): | |
status: | In Progress → Fix Released |
status: | Fix Released → Fix Committed |
tags: |
added: verification-done-zesty removed: kernel-da-key verification-needed-zesty zesty |
Changed in dellserver: | |
status: | In Progress → Fix Released |
Changed in linux (Ubuntu): | |
status: | In Progress → Fix Released |
tags: | added: cscc |
acpidump from the server