Activity log for bug #1719697

Date Who What changed Old value New value Message
2017-09-26 17:28:34 Sujith Pandel bug added bug
2017-09-26 17:28:34 Sujith Pandel attachment added Console-logs https://bugs.launchpad.net/bugs/1719697/+attachment/4957375/+files/console-logs-ubuntu-r940-28cores-2socket-crash.txt
2017-09-26 17:32:00 Sujith Pandel attachment added acpidump https://bugs.launchpad.net/dellserver/+bug/1719697/+attachment/4957377/+files/acpidump
2017-09-26 18:07:34 Narinder Gupta bug added subscriber Leann Ogasawara
2017-10-17 18:28:09 Joseph Salisbury bug added subscriber Joseph Salisbury
2017-10-17 18:28:23 Joseph Salisbury tags kernel-da-key zesty
2017-10-23 18:05:35 Joseph Salisbury dellserver: status New In Progress
2017-10-23 18:05:40 Joseph Salisbury dellserver: assignee Joseph Salisbury (jsalisbury)
2017-10-27 19:20:00 Joseph Salisbury bug task added linux (Ubuntu)
2017-10-27 19:20:09 Joseph Salisbury linux (Ubuntu): status New In Progress
2017-10-27 19:20:13 Joseph Salisbury linux (Ubuntu): importance Undecided Medium
2017-10-27 19:20:16 Joseph Salisbury linux (Ubuntu): assignee Joseph Salisbury (jsalisbury)
2017-10-27 19:20:25 Joseph Salisbury nominated for series Ubuntu Zesty
2017-10-27 19:20:25 Joseph Salisbury bug task added linux (Ubuntu Zesty)
2017-10-27 19:20:32 Joseph Salisbury linux (Ubuntu Zesty): status New In Progress
2017-10-27 19:20:35 Joseph Salisbury linux (Ubuntu Zesty): importance Undecided Medium
2017-10-27 19:20:39 Joseph Salisbury linux (Ubuntu Zesty): assignee Joseph Salisbury (jsalisbury)
2017-10-27 19:22:15 Joseph Salisbury description Description: Kernel crashes when installation of Ubuntu-16.04.3 with HWE (ISO). Same observation while booting to 4.10.0-28 HWE kerenl of Ubuntu-16.04.3 and 4.10.0-33 HWE as well. Setup: Dell PowerEdge R940 having 2 socket populated with 28 cores. Impact: This is a boot and installation failure for R940 users having 2 socket x 28 cores. Requesting an SRU for the fix. Steps: 1. Setup a Dell PowerEdge R940 with 28 cores CPUs on 2 sockets. 2. Start installation of Ubuntu-16.04.3 with HWE kernel. 3. Observe the screen remains blank. Console logs indicate kernel crash. Additional Info: * Seen only with 4.10 HWE kernels of Ubuntu-16.04.3. 4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine. * Reducing the core count to <26 cores helps here. Boot & installation of HWE-kernel works fine. * Attaching the console log and acpidump from the setup. * Patch causing this failure: https://github.com/torvalds/linux/commit/dc6db24d2476cd09c0ecf2b8d80313539f737a89 x86/acpi: Set persistent cpuid <-> nodeid mapping when booting * Fix patch series: https://lkml.org/lkml/2017/2/20/66 Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting" https://github.com/torvalds/linux/commit/c962cff17dfa11f4a8227ac16de2b28aea3312e4#diff-6bd7ef719bca1a2a56b9ebf4bd0bd88d Revert"x86/acpi: Enable MADT APIs to return disabled apicids" https://github.com/torvalds/linux/commit/09c3f2bd5c7e5f18687663acb6adc6b167484ca5 acpi/processor: Implement DEVICE operator for processor enumeration https://github.com/torvalds/linux/commit/8c8cb30f49b86333d8e036e1945cf1a78c03577e acpi/processor: Check for duplicate processor ids at hotplug time https://github.com/torvalds/linux/commit/a77d6cd968497792e072b74dff45b891ba778ddb == SRU Justification == Kernel crashes when installation of Ubuntu-16.04.3 with HWE (ISO). Same observation while booting to 4.10.0-28 HWE kerenl of Ubuntu-16.04.3 and 4.10.0-33 HWE as well. Seen only with 4.10 HWE kernels of Ubuntu-16.04.3. 4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine. Reducing the core count to <26 cores helps here. Boot & installation of HWE-kernel works fine. This bug was introduced by commit: dc6db24d2476 ("x86/acpi: Set persistent cpuid <-> nodeid mapping when booting") It is resolved by reverting commit dc6db24d2476, which was done in mainline by commit c962cff17df as of v4.11-rc3. There are three additiona commits introduced by the same patch author when commit c962cff17df was submitted. However, it was confirmed that only the single revert is needed to fix this particular bug. Upstream thread: https://lkml.org/lkml/2017/2/20/66 == Fix == commit c962cff17dfa11f4a8227ac16de2b28aea3312e4 Author: Dou Liyang <douly.fnst@cn.fujitsu.com> Date: Fri Mar 3 16:02:23 2017 +0800 Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting" == Regression Potential == This is reverting a commit that introduced a bug. This commit has also been reverted upstream. == Test Case == A test kernel was built with this patch and tested by the original bug reporter. The bug reporter states the test kernel resolved the bug. Setup: Dell PowerEdge R940 having 2 socket populated with 28 cores. Impact: This is a boot and installation failure for R940 users having 2 socket x 28 cores. Requesting an SRU for the fix. Steps: 1. Setup a Dell PowerEdge R940 with 28 cores CPUs on 2 sockets. 2. Start installation of Ubuntu-16.04.3 with HWE kernel. 3. Observe the screen remains blank. Console logs indicate kernel crash. Additional Info: * Seen only with 4.10 HWE kernels of Ubuntu-16.04.3.   4.4 kernels of Ubuntu-16.04.3 works fine. Daily builds of Ubuntu Server 17.10 works fine. * Reducing the core count to <26 cores helps here. Boot & installation of HWE-kernel works fine. * Attaching the console log and acpidump from the setup. * Patch causing this failure: https://github.com/torvalds/linux/commit/dc6db24d2476cd09c0ecf2b8d80313539f737a89 x86/acpi: Set persistent cpuid <-> nodeid mapping when booting * Fix patch series: https://lkml.org/lkml/2017/2/20/66 Revert "x86/acpi: Set persistent cpuid <-> nodeid mapping when booting" https://github.com/torvalds/linux/commit/c962cff17dfa11f4a8227ac16de2b28aea3312e4#diff-6bd7ef719bca1a2a56b9ebf4bd0bd88d Revert"x86/acpi: Enable MADT APIs to return disabled apicids" https://github.com/torvalds/linux/commit/09c3f2bd5c7e5f18687663acb6adc6b167484ca5 acpi/processor: Implement DEVICE operator for processor enumeration https://github.com/torvalds/linux/commit/8c8cb30f49b86333d8e036e1945cf1a78c03577e acpi/processor: Check for duplicate processor ids at hotplug time https://github.com/torvalds/linux/commit/a77d6cd968497792e072b74dff45b891ba778ddb
2017-11-21 16:19:35 Khaled El Mously linux (Ubuntu Zesty): status In Progress Fix Released
2017-11-21 16:19:58 Khaled El Mously linux (Ubuntu Zesty): status Fix Released Fix Committed
2017-11-28 17:48:08 Khaled El Mously tags kernel-da-key zesty kernel-da-key verification-needed-zesty zesty
2017-12-04 16:42:41 Sujith Pandel tags kernel-da-key verification-needed-zesty zesty verification-done-zesty
2017-12-07 17:43:17 Launchpad Janitor linux (Ubuntu Zesty): status Fix Committed Fix Released
2017-12-07 17:43:17 Launchpad Janitor cve linked 2017-1000405
2017-12-07 17:43:17 Launchpad Janitor cve linked 2017-12146
2017-12-07 17:43:17 Launchpad Janitor cve linked 2017-16939
2017-12-07 17:43:18 Launchpad Janitor linux (Ubuntu Zesty): status Fix Committed Fix Released
2018-06-25 16:11:54 Michael Reed dellserver: status In Progress Fix Released
2018-06-25 16:31:17 Leann Ogasawara removed subscriber Leann Ogasawara
2019-01-23 01:12:13 Joseph Salisbury linux (Ubuntu): status In Progress Fix Released
2019-07-24 21:07:14 Brad Figg tags verification-done-zesty cscc verification-done-zesty