Groovy 5.8 kernel hangs on boot on CPUs with eLLC

Bug #1903397 reported by John Smith on 2020-11-07
22
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Undecided
Unassigned
Groovy
High
Kamal Mostafa
linux-hwe-5.8 (Ubuntu)
Undecided
Unassigned
Focal
High
Unassigned

Bug Description

5.8 kernel hangs on boot on my PC with Intel Core i7-5775C.
Bug is already fixed upstream:
https://gitlab.freedesktop.org/drm/intel/-/issues/2381
https://bugzilla.kernel.org/show_bug.cgi?id=208737
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?h=v5.10-rc1&id=1664ffee760a5d98952318fdd9b198fae396d660
Please backport the fix.
Thanks!

[Impact]

 * Bug causes boot failure on common hardware.

[Test Case]

 * Test kernel validated by submitter (see Comment #6).

[Regression Potential]

 * Low: Simple cherry-pick from upstream.

CVE References

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1903397

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: groovy
John Smith (jksmith) on 2020-11-07
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
John Smith (jksmith) wrote :

The fix is backported to 5.9 kernel.
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?h=v5.9.8&id=7bc462243188e05cc976587bc9942e7c05241140
5.8 is EOL.
Will the fix be backported to Groovy 5.8 kernel?

tags: added: kernel-fixed-upstream
John Smith (jksmith) wrote :

Is there a way to push fix into groovy kernel branch myself? This bug makes system unbootable on plenty of CPUs with eDRAM.

Changed in linux (Ubuntu Groovy):
status: New → In Progress
assignee: nobody → Kamal Mostafa (kamalmostafa)
description: updated
Kamal Mostafa (kamalmostafa) wrote :

Thanks for reporting this, John. We'll get the fix into our groovy build pipeline right away.

In the meantime, please try this test kernel which includes it to verify that it resolves the problem (and let us know here):
https://kernel.ubuntu.com/~kamal/lp1903397/amd64/

Kamal Mostafa (kamalmostafa) wrote :
Changed in linux (Ubuntu Groovy):
importance: Undecided → High
John Smith (jksmith) wrote :

I was able to boot. Everything seems ok.
Thank you!

description: updated
Changed in linux (Ubuntu Groovy):
status: In Progress → Fix Committed
gruberm (gruberm) wrote :

Will this land in Focal's HWE package as well?
Just wondering because only Groovy is listed.

Kamal Mostafa (kamalmostafa) wrote :

@gruberm, Yes, this patch will make it into the focal/hwe-5.8 also.

no longer affects: linux-hwe-5.8 (Ubuntu Groovy)
no longer affects: linux (Ubuntu Focal)
Changed in linux-hwe-5.8 (Ubuntu Focal):
importance: Undecided → High
status: New → In Progress

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-groovy' to 'verification-done-groovy'. If the problem still exists, change the tag 'verification-needed-groovy' to 'verification-failed-groovy'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags: added: verification-needed-groovy
John Smith (jksmith) wrote :

The kernel in -proposed solves the problem. Should I change the tag to 'verification-done-groovy'?

John Smith (jksmith) on 2020-11-27
tags: added: verification-done-groovy
removed: verification-needed-groovy
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-hwe-5.8 (Ubuntu):
status: New → Confirmed
gruberm (gruberm) wrote :

In case -proposed in Focal's HWE needs also verification:
could someone with a 5675C or 5775C test 5.8.0-31.33~20.04.1 with the corresponding patch as well?
I've unfortunately no access to my Broadwell machine for the next 5-7 days.

Kamal Mostafa (kamalmostafa) wrote :

Thanks for the verification-done-groovy @jksmith.

FYI @gruberm, the Focal HWE kernel won't need formal verification. The patch will appear in the Focal HWE kernel version 5.8.0-32.34~20.04.1 once that gets generated from the groovy -32.34 krenel later this week. (Note that the -31.33~20.04.1 does *not* yet contain the patch).

Launchpad Janitor (janitor) wrote :
Download full text (50.5 KiB)

This bug was fixed in the package linux - 5.8.0-31.33

---------------
linux (5.8.0-31.33) groovy; urgency=medium

  * groovy/linux: 5.8.0-31.33 -proposed tracker (LP: #1905299)

  * Groovy 5.8 kernel hangs on boot on CPUs with eLLC (LP: #1903397)
    - drm/i915: Mark ininitial fb obj as WT on eLLC machines to avoid rcu lockup
      during fbdev init

  * CVE-2020-4788
    - selftests/powerpc: rfi_flush: disable entry flush if present
    - powerpc/64s: flush L1D on kernel entry
    - powerpc/64s: flush L1D after user accesses
    - selftests/powerpc: entry flush test

linux (5.8.0-30.32) groovy; urgency=medium

  * groovy/linux: 5.8.0-30.32 -proposed tracker (LP: #1903194)

  * Update kernel packaging to support forward porting kernels (LP: #1902957)
    - [Debian] Update for leader included in BACKPORT_SUFFIX

  * Avoid double newline when running insertchanges (LP: #1903293)
    - [Packaging] insertchanges: avoid double newline

  * EFI: Fails when BootCurrent entry does not exist (LP: #1899993)
    - efivarfs: Replace invalid slashes with exclamation marks in dentries.

  * raid10: Block discard is very slow, causing severe delays for mkfs and
    fstrim operations (LP: #1896578)
    - md: add md_submit_discard_bio() for submitting discard bio
    - md/raid10: extend r10bio devs to raid disks
    - md/raid10: pull codes that wait for blocked dev into one function
    - md/raid10: improve raid10 discard request
    - md/raid10: improve discard request for far layout
    - dm raid: fix discard limits for raid1 and raid10
    - dm raid: remove unnecessary discard limits for raid10

  * Bionic: btrfs: kernel BUG at /build/linux-
    eTBZpZ/linux-4.15.0/fs/btrfs/ctree.c:3233! (LP: #1902254)
    - btrfs: extent_io: do extra check for extent buffer read write functions
    - btrfs: extent-tree: kill BUG_ON() in __btrfs_free_extent()
    - btrfs: extent-tree: kill the BUG_ON() in insert_inline_extent_backref()
    - btrfs: ctree: check key order before merging tree blocks

  * Tiger Lake PMC core driver fixes (LP: #1899883)
    - platform/x86: intel_pmc_core: update TGL's LPM0 reg bit map name
    - platform/x86: intel_pmc_core: fix bound check in pmc_core_mphy_pg_show()
    - platform/x86: pmc_core: Use descriptive names for LPM registers
    - platform/x86: intel_pmc_core: Fix TigerLake power gating status map
    - platform/x86: intel_pmc_core: Fix the slp_s0 counter displayed value

  * drm/i915/dp_mst - System would hang during the boot up. (LP: #1902469)
    - Revert "UBUNTU: SAUCE: drm/i915/display: Fix null deref in
      intel_psr_atomic_check()"
    - drm/i915: Fix encoder lookup during PSR atomic check

  * Undetected Data corruption in MPI workloads that use VSX for reductions on
    POWER9 DD2.1 systems (LP: #1902694)
    - powerpc: Fix undetected data corruption with P9N DD2.1 VSX CI load emulation
    - selftests/powerpc: Make alignment handler test P9N DD2.1 vector CI load
      workaround

  * [20.04 FEAT] Support/enhancement of NVMe IPL (LP: #1902179)
    - s390/ipl: support NVMe IPL kernel parameters

  * uvcvideo: add mapping for HEVC payloads (LP: #1895803)
    - media: uvcvideo: Add mapping for HEVC payloads

  * risc-v 5.8 ...

Changed in linux (Ubuntu Groovy):
status: Fix Committed → Fix Released
Launchpad Janitor (janitor) wrote :
Download full text (51.0 KiB)

This bug was fixed in the package linux-hwe-5.8 - 5.8.0-31.33~20.04.1

---------------
linux-hwe-5.8 (5.8.0-31.33~20.04.1) focal; urgency=medium

  * focal/linux-hwe-5.8: 5.8.0-31.33~20.04.1 -proposed tracker (LP: #1905311)

  [ Ubuntu: 5.8.0-31.33 ]

  * groovy/linux: 5.8.0-31.33 -proposed tracker (LP: #1905299)
  * Groovy 5.8 kernel hangs on boot on CPUs with eLLC (LP: #1903397)
    - drm/i915: Mark ininitial fb obj as WT on eLLC machines to avoid rcu lockup
      during fbdev init
  * CVE-2020-4788
    - selftests/powerpc: rfi_flush: disable entry flush if present
    - powerpc/64s: flush L1D on kernel entry
    - powerpc/64s: flush L1D after user accesses
    - selftests/powerpc: entry flush test

linux-hwe-5.8 (5.8.0-30.32~20.04.1) focal; urgency=medium

  * focal/linux-hwe-5.8: 5.8.0-30.32~20.04.1 -proposed tracker (LP: #1903193)

  * Groovy update: v5.8.15 upstream stable release (LP: #1902130)
    - [Packaging] [hwe-5.8] module ocelot_board rename

  * Packaging resync (LP: #1786013)
    - [Packaging] update update.conf

  [ Ubuntu: 5.8.0-30.32 ]

  * groovy/linux: 5.8.0-30.32 -proposed tracker (LP: #1903194)
  * Update kernel packaging to support forward porting kernels (LP: #1902957)
    - [Debian] Update for leader included in BACKPORT_SUFFIX
  * Avoid double newline when running insertchanges (LP: #1903293)
    - [Packaging] insertchanges: avoid double newline
  * EFI: Fails when BootCurrent entry does not exist (LP: #1899993)
    - efivarfs: Replace invalid slashes with exclamation marks in dentries.
  * raid10: Block discard is very slow, causing severe delays for mkfs and
    fstrim operations (LP: #1896578)
    - md: add md_submit_discard_bio() for submitting discard bio
    - md/raid10: extend r10bio devs to raid disks
    - md/raid10: pull codes that wait for blocked dev into one function
    - md/raid10: improve raid10 discard request
    - md/raid10: improve discard request for far layout
    - dm raid: fix discard limits for raid1 and raid10
    - dm raid: remove unnecessary discard limits for raid10
  * Bionic: btrfs: kernel BUG at /build/linux-
    eTBZpZ/linux-4.15.0/fs/btrfs/ctree.c:3233! (LP: #1902254)
    - btrfs: extent_io: do extra check for extent buffer read write functions
    - btrfs: extent-tree: kill BUG_ON() in __btrfs_free_extent()
    - btrfs: extent-tree: kill the BUG_ON() in insert_inline_extent_backref()
    - btrfs: ctree: check key order before merging tree blocks
  * Tiger Lake PMC core driver fixes (LP: #1899883)
    - platform/x86: intel_pmc_core: update TGL's LPM0 reg bit map name
    - platform/x86: intel_pmc_core: fix bound check in pmc_core_mphy_pg_show()
    - platform/x86: pmc_core: Use descriptive names for LPM registers
    - platform/x86: intel_pmc_core: Fix TigerLake power gating status map
    - platform/x86: intel_pmc_core: Fix the slp_s0 counter displayed value
  * drm/i915/dp_mst - System would hang during the boot up. (LP: #1902469)
    - Revert "UBUNTU: SAUCE: drm/i915/display: Fix null deref in
      intel_psr_atomic_check()"
    - drm/i915: Fix encoder lookup during PSR atomic check
  * Undetected Data corruption in MPI workloads that use VSX for reductions on
    POWER9 DD2.1 sy...

Changed in linux-hwe-5.8 (Ubuntu Focal):
status: In Progress → Fix Released
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package linux - 5.8.0-36.40+21.04.1

---------------
linux (5.8.0-36.40+21.04.1) hirsute; urgency=medium

  * Packaging resync (LP: #1786013)
    - update dkms package versions

  [ Ubuntu: 5.8.0-36.40 ]

  * debian/scripts/file-downloader does not handle positive failures correctly
    (LP: #1878897)
    - [Packaging] file-downloader not handling positive failures correctly

  [ Ubuntu: 5.8.0-35.39 ]

  * Packaging resync (LP: #1786013)
    - update dkms package versions
  * CVE-2021-1052 // CVE-2021-1053
    - [Packaging] NVIDIA -- Add the NVIDIA 460 driver

 -- Kleber Sacilotto de Souza <email address hidden> Thu, 07 Jan 2021 11:57:30 +0100

Changed in linux (Ubuntu):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Duplicates of this bug

Other bug subscribers