[amdgpu] Several stability bugs in pre-5.17 kernels, e.g. on AMD RX 6600 XT

Bug #1987413 reported by Dennis Gnad
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned
linux-hwe-5.15 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Since updating to kernel 5.15 in Ubuntu LTS 20.04, I got several stability issues with my AMD RX 6600 XT graphics adapter. I upgraded to LTS 22.04 with the hope to be able to get more recent kernels, but there are no HWE kernels and the GA kernel is still 5.15, while I would expect it to be at least on kernel.org's recent stable kernel, 5.19.3 as of now.

These are the bugs that seem to affect me and other users of more recent AMD GPUs:
https://gitlab.freedesktop.org/drm/amd/-/issues/1819
https://gitlab.freedesktop.org/drm/amd/-/issues/1871
https://gitlab.freedesktop.org/drm/amd/-/issues/1887

If you read the bugs, you find that people report kernel version 5.17 or more recent to fix them, which I can confirm: I started to run the Ubuntu OEM kernel ( 5.17.0-1015-oem ), and thus far got a 100% stable system.

So, I guess these problems will be solved with the next HWE kernel, that is expected to be added to the Ubuntu 22.04 archive in October. However, for current Ubuntu 22.04 or 20.04 HWE users, there is no supported solution to get a stable system with various amdgpu adapters.
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu82.1
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC3: dennis 2163 F.... pulseaudio
 /dev/snd/controlC2: dennis 2163 F.... pulseaudio
 /dev/snd/controlC1: dennis 2163 F.... pulseaudio
CasperMD5CheckResult: unknown
CurrentDesktop: ubuntu:GNOME
DistroRelease: Ubuntu 22.04
InstallationDate: Installed on 2020-05-01 (844 days ago)
InstallationMedia: Ubuntu 20.04 LTS "Focal Fossa" - Release amd64 (20200423)
IwConfig:
 lo no wireless extensions.

 enp37s0 no wireless extensions.
MachineType: Micro-Star International Co., Ltd. MS-7B87
Package: linux-hwe-5.15
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.15.0-47-generic root=UUID=6eaf2615-5b03-4ec9-a9df-bee7f682670a ro quiet splash vt.handoff=7
ProcVersionSignature: Ubuntu 5.15.0-47.51-generic 5.15.46
RelatedPackageVersions:
 linux-restricted-modules-5.15.0-47-generic N/A
 linux-backports-modules-5.15.0-47-generic N/A
 linux-firmware 20220329.git681281e4-0ubuntu3.4
RfKill:
 0: hci0: Bluetooth
  Soft blocked: no
  Hard blocked: no
Tags: wayland-session jammy
Uname: Linux 5.15.0-47-generic x86_64
UpgradeStatus: Upgraded to jammy on 2022-08-19 (4 days ago)
UserGroups: adm cdrom dip lpadmin lxd plugdev sambashare sudo wireshark
_MarkForUpload: True
dmi.bios.date: 06/11/2020
dmi.bios.release: 5.14
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1.C0
dmi.board.asset.tag: To be filled by O.E.M.
dmi.board.name: B450M GAMING PLUS (MS-7B87)
dmi.board.vendor: Micro-Star International Co., Ltd.
dmi.board.version: 1.0
dmi.chassis.asset.tag: To be filled by O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: Micro-Star International Co., Ltd.
dmi.chassis.version: 1.0
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr1.C0:bd06/11/2020:br5.14:svnMicro-StarInternationalCo.,Ltd.:pnMS-7B87:pvr1.0:rvnMicro-StarInternationalCo.,Ltd.:rnB450MGAMINGPLUS(MS-7B87):rvr1.0:cvnMicro-StarInternationalCo.,Ltd.:ct3:cvr1.0:skuTobefilledbyO.E.M.:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: MS-7B87
dmi.product.sku: To be filled by O.E.M.
dmi.product.version: 1.0
dmi.sys.vendor: Micro-Star International Co., Ltd.

Revision history for this message
Luís Infante da Câmara (luis220413) wrote :
description: updated
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1987413

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Dennis Gnad (bluesceada-1) wrote :
Changed in linux-hwe-5.15 (Ubuntu):
status: New → Incomplete
Revision history for this message
Dennis Gnad (bluesceada-1) wrote : AlsaInfo.txt

apport information

tags: added: apport-collected jammy wayland-session
description: updated
Revision history for this message
Dennis Gnad (bluesceada-1) wrote : CRDA.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : Lspci.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : Lspci-vt.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : Lsusb.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : Lsusb-t.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : Lsusb-v.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : PaInfo.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : ProcEnviron.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : ProcModules.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : PulseList.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : UdevDb.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : WifiSyslog.txt

apport information

Revision history for this message
Dennis Gnad (bluesceada-1) wrote : acpidump.txt

apport information

Revision history for this message
Luís Infante da Câmara (luis220413) wrote :

Meanwhile many HWE kernels were published and the current HWE kernel is 6.2.0-31.31~22.04.1. Can you test with this kernel?

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Changed in linux-hwe-5.15 (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Dennis Gnad (bluesceada-1) wrote :

Sorry I overlooked that last message. I actually had a working system from around 6.2.0 to 6.5.0-17. It seems around 6.5.0-18 a new problem was introduced that now causes intermittent freezes. See the new bug: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2055818

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.