Comment 0 for bug 1719697

Revision history for this message
Sujith Pandel (sujithpandel) wrote :

Description:
Kernel crashes when installation of Ubuntu-16.04.3 with HWE (ISO).
Same observation while booting to 4.10.0-28 HWE kerenl of Ubuntu-16.04.3 and 4.10.0-33 HWE as well.

Setup:
Dell PowerEdge R940 having 2 socket populated with 28 cores.

Impact:
This is a boot and installation failure for R940 users having 2 socket x 28 cores.
Requesting an SRU for the fix.

Steps:
1. Setup a Dell PowerEdge R940 with 28 cores CPUs on 2 sockets.
2. Start installation of Ubuntu-16.04.3 with HWE kernel.
3. Observe the screen remains blank. Console logs indicate kernel crash.

Additional Info:
* Seen only with 4.10 HWE kernels of Ubuntu-16.04.3.
  4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine.

* Reducing the core count to <26 cores helps here. Boot & installation of HWE-kernel works fine.

* Attaching the console log and acpidump from the setup.

* Patch causing this failure: https://github.com/torvalds/linux/commit/dc6db24d2476cd09c0ecf2b8d80313539f737a89
x86/acpi: Set persistent cpuid <-> nodeid mapping when booting

* Fix patch series: https://lkml.org/lkml/2017/2/20/66

Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting"
https://github.com/torvalds/linux/commit/c962cff17dfa11f4a8227ac16de2b28aea3312e4#diff-6bd7ef719bca1a2a56b9ebf4bd0bd88d

Revert"x86/acpi: Enable MADT APIs to return disabled apicids"
https://github.com/torvalds/linux/commit/09c3f2bd5c7e5f18687663acb6adc6b167484ca5

acpi/processor: Implement DEVICE operator for processor enumeration
https://github.com/torvalds/linux/commit/8c8cb30f49b86333d8e036e1945cf1a78c03577e

acpi/processor: Check for duplicate processor ids at hotplug time
https://github.com/torvalds/linux/commit/a77d6cd968497792e072b74dff45b891ba778ddb