amdgpu crash

Bug #2064529 reported by Brett Holman
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
New
Undecided
Unassigned
Noble
New
Undecided
Unassigned

Bug Description

graphics crashes, requires reboot

this has happened a few times recently - typically during periods of activity but not always.

See the journal log from the last boot below:

Apr 30 21:07:15 arc kernel: snd_hda_intel 0000:1f:00.1: Unable to change power state from D3hot to D0, device inaccessible
Apr 30 21:07:16 arc kernel: snd_hda_intel 0000:1f:00.1: CORB reset timeout#2, CORBRP = 65535
Apr 30 21:07:25 arc kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_low timeout, signaled seq=155368, emitted seq=155370
Apr 30 21:07:25 arc kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process spotify pid 11047 thread spotify:cs0 pid 11105
Apr 30 21:07:25 arc kernel: amdgpu 0000:1f:00.0: amdgpu: GPU reset begin!
Apr 30 21:07:33 arc kernel: amdgpu 0000:1f:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0x24, input parameter: 0x0, error code: 0xffffffff
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0x9, input parameter: 0xf4, error code: 0xffffffff
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0xa, input parameter: 0x103000, error code: 0xffffffff
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0xe, input parameter: 0x0, error code: 0xffffffff
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0x42, input parameter: 0x1, error code: 0xffffffff
Apr 30 21:07:34 arc kernel: amdgpu: [powerplay] Failed message: 0x24, input parameter: 0x0, error code: 0xffffffff
Apr 30 21:07:54 arc kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Apr 30 21:07:54 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing E18E (len 824, WS 0, PS 0) @ 0xE30E
Apr 30 21:07:54 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing E048 (len 326, WS 0, PS 0) @ 0xE138
Apr 30 21:07:54 arc kernel: amdgpu 0000:1f:00.0: [drm] *ERROR* dce110_link_encoder_disable_output: Failed to execute VBIOS command table!
Apr 30 21:08:14 arc kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Apr 30 21:08:14 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing C30A (len 62, WS 0, PS 0) @ 0xC326
Apr 30 21:08:34 arc kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Apr 30 21:08:34 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing B802 (len 1359, WS 12, PS 8) @ 0xBB17
Apr 30 21:08:34 arc kernel: amdgpu 0000:1f:00.0: [drm] REG_WAIT timeout 10us * 3000 tries - dce110_stream_encoder_dp_blank line:936
Apr 30 21:08:54 arc kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Apr 30 21:08:54 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing E18E (len 824, WS 0, PS 0) @ 0xE30E
Apr 30 21:08:54 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing E048 (len 326, WS 0, PS 0) @ 0xE138
Apr 30 21:08:54 arc kernel: amdgpu 0000:1f:00.0: [drm] *ERROR* dce110_link_encoder_disable_output: Failed to execute VBIOS command table!
Apr 30 21:09:14 arc kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Apr 30 21:09:14 arc kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing C30A (len 62, WS 0, PS 0) @ 0xC326
Apr 30 21:09:14 arc kernel: amdgpu 0000:1f:00.0: [drm] REG_WAIT timeout 10us * 3000 tries - dce110_stream_encoder_dp_blank line:936

ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: linux-image-6.8.0-31-generic 6.8.0-31.31
ProcVersionSignature: Ubuntu 6.8.0-31.31-generic 6.8.1
Uname: Linux 6.8.0-31-generic x86_64
NonfreeKernelModules: zfs
ApportVersion: 2.28.1-0ubuntu2
Architecture: amd64
CRDA: N/A
CasperMD5CheckResult: pass
CurrentDesktop: ubuntu:GNOME
Date: Wed May 1 11:09:48 2024
InstallationDate: Installed on 2021-10-20 (924 days ago)
InstallationMedia: Ubuntu 21.04 "Hirsute Hippo" - Release amd64 (20210420)
MachineType: Micro-Star International Co., Ltd. MS-7B79
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-6.8.0-31-generic root=/dev/mapper/vgubuntu-root ro quiet splash vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-6.8.0-31-generic N/A
 linux-backports-modules-6.8.0-31-generic N/A
 linux-firmware 20240318.git3b128b60-0ubuntu2
RfKill:
 0: hci0: Bluetooth
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
UpgradeStatus: Upgraded to noble on 2024-01-30 (92 days ago)
dmi.bios.date: 06/28/2018
dmi.bios.release: 5.13
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: A.40
dmi.board.asset.tag: To be filled by O.E.M.
dmi.board.name: X470 GAMING PLUS (MS-7B79)
dmi.board.vendor: Micro-Star International Co., Ltd.
dmi.board.version: 2.0
dmi.chassis.asset.tag: To be filled by O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: Micro-Star International Co., Ltd.
dmi.chassis.version: 2.0
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrA.40:bd06/28/2018:br5.13:svnMicro-StarInternationalCo.,Ltd.:pnMS-7B79:pvr2.0:rvnMicro-StarInternationalCo.,Ltd.:rnX470GAMINGPLUS(MS-7B79):rvr2.0:cvnMicro-StarInternationalCo.,Ltd.:ct3:cvr2.0:skuTobefilledbyO.E.M.:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: MS-7B79
dmi.product.sku: To be filled by O.E.M.
dmi.product.version: 2.0
dmi.sys.vendor: Micro-Star International Co., Ltd.

Revision history for this message
Brett Holman (holmanb) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.