drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_dec timeout

Bug #2018537 reported by GLenn
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux-hwe-5.19 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Linux laptop2 5.19.0-41-generic #42~22.04.1-Ubuntu SMP PREEMPT_DYNAMIC Tue Apr 18 17:40:00 UTC 2 x86_64 x86_64 x86_64 GNU/Linux

There is a constant issue with the video acceleration in amdgpu.
In conjunction with firefox and drm protected video I could trigger a display freeze very reliably.
Disney+ start page would trigger this, screen would go blank shortly, returning with partial mouse response of applications.

Just some days ago the new kernel upgrade was applied.
With this, the graphics driver even crashes when playing unprotected videos like on youtube.
This applies to wayland and xorg mode.

Crash log:
[43929.632425] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_dec timeout, signaled seq=40254, emitted seq=40256
[43929.632814] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process RDD Process pid 17220 thread firefox:cs0 pid 31188
[43929.633160] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
[43930.143971] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[43930.346077] [drm] Register(0) [mmUVD_RBC_RB_RPTR] failed to reach value 0x00000120 != 0x000000a0
[43930.542345] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[43930.551906] [drm] free PSP TMR buffer
[43930.580095] CPU: 0 PID: 51067 Comm: kworker/u32:2 Not tainted 5.19.0-41-generic #42~22.04.1-Ubuntu
[43930.580103] Hardware name: LENOVO 82KT/LNVNB161216, BIOS GLCN41WW 09/13/2021
[43930.580106] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
[43930.580121] Call Trace:
[43930.580124] <TASK>
[43930.580128] show_stack+0x52/0x69
[43930.580135] dump_stack_lvl+0x49/0x6d
[43930.580143] dump_stack+0x10/0x18
[43930.580149] amdgpu_do_asic_reset+0x2b/0x441 [amdgpu]
[43930.580186] amdgpu_device_gpu_recover_imp.cold+0x4f6/0x805 [amdgpu]
[43930.580186] amdgpu_job_timedout+0x15e/0x190 [amdgpu]
[43930.580186] ? finish_task_switch.isra.0+0x84/0x290
[43930.580186] drm_sched_job_timedout+0x6d/0x120 [gpu_sched]
[43930.580186] process_one_work+0x21f/0x400
[43930.580186] worker_thread+0x50/0x3f0
[43930.580186] ? rescuer_thread+0x3a0/0x3a0
[43930.580186] kthread+0xee/0x120
[43930.580186] ? kthread_complete_and_exit+0x20/0x20
[43930.580186] ret_from_fork+0x22/0x30
[43930.580186] </TASK>
[43930.581703] amdgpu 0000:03:00.0: amdgpu: MODE2 reset
[43930.582168] amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
[43930.582364] [drm] PCIE GART of 1024M enabled.
[43930.582366] [drm] PTB located at 0x000000F400900000
[43930.582382] [drm] PSP is resuming...
[43930.602253] [drm] reserve 0x400000 from 0xf41fb00000 for PSP TMR
[43930.921623] amdgpu 0000:03:00.0: amdgpu: RAS: optional ras ta ucode is not available
[43930.933021] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
[43930.933027] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[43930.933033] amdgpu 0000:03:00.0: amdgpu: SMU is resuming...
[43930.933772] amdgpu 0000:03:00.0: amdgpu: SMU is resumed successfully!
[43930.934335] [drm] DMUB hardware initialized: version=0x0101001F
[43931.267844] [drm] kiq ring mec 2 pipe 1 q 0
[43931.270626] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[43931.270689] [drm] JPEG decode initialized successfully.
[43931.270695] amdgpu 0000:03:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
[43931.270701] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[43931.270704] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[43931.270707] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[43931.270710] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[43931.270712] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[43931.270714] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[43931.270717] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[43931.270719] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[43931.270722] amdgpu 0000:03:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[43931.270725] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
[43931.270728] amdgpu 0000:03:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
[43931.270730] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
[43931.270733] amdgpu 0000:03:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
[43931.270735] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
[43931.274645] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow start
[43931.274648] amdgpu 0000:03:00.0: amdgpu: recover vram bo from shadow done
[43931.274654] [drm] Skip scheduling IBs!
[43931.274670] amdgpu 0000:03:00.0: amdgpu: GPU reset(2) succeeded!
[43931.293293] [drm] Skip scheduling IBs!
[43931.293997] [drm] Skip scheduling IBs!
[43931.294319] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[43931.294830] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
[43931.324720] [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: xorg 1:7.7+23ubuntu2
ProcVersionSignature: Ubuntu 5.19.0-41.42~22.04.1-generic 5.19.17
Uname: Linux 5.19.0-41-generic x86_64
ApportVersion: 2.20.11-0ubuntu82.4
Architecture: amd64
BootLog: Error: [Errno 13] Permission denied: '/var/log/boot.log'
CasperMD5CheckResult: pass
CompositorRunning: None
CurrentDesktop: ubuntu:GNOME
Date: Thu May 4 21:03:40 2023
DistUpgraded: Fresh install
DistroCodename: jammy
DistroVariant: ubuntu
ExtraDebuggingInterest: Yes
GpuHangFrequency: Several times a week
GpuHangReproducibility: Yes, I can easily reproduce it
GpuHangStarted: Within the last few days
GraphicsCard:
 Advanced Micro Devices, Inc. [AMD/ATI] Lucienne [1002:164c] (rev c3) (prog-if 00 [VGA controller])
   Subsystem: Lenovo Lucienne [17aa:3f95]
InstallationDate: Installed on 2022-09-07 (239 days ago)
InstallationMedia: Ubuntu 22.04.1 LTS "Jammy Jellyfish" - Release amd64 (20220809.1)
MachineType: LENOVO 82KT
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.19.0-41-generic root=UUID=162e05ae-69e3-42b8-ae11-a01b1fd0a06f ro quiet splash vt.handoff=7
SourcePackage: xorg
Symptom: display
Title: Xorg freeze
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 09/13/2021
dmi.bios.release: 1.41
dmi.bios.vendor: LENOVO
dmi.bios.version: GLCN41WW
dmi.board.asset.tag: No Asset Tag
dmi.board.name: LNVNB161216
dmi.board.vendor: LENOVO
dmi.board.version: SDK0T76473WIN
dmi.chassis.asset.tag: No Asset Tag
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: IdeaPad 3 14ALC6
dmi.ec.firmware.release: 1.41
dmi.modalias: dmi:bvnLENOVO:bvrGLCN41WW:bd09/13/2021:br1.41:efr1.41:svnLENOVO:pn82KT:pvrIdeaPad314ALC6:rvnLENOVO:rnLNVNB161216:rvrSDK0T76473WIN:cvnLENOVO:ct10:cvrIdeaPad314ALC6:skuLENOVO_MT_82KT_BU_idea_FM_IdeaPad314ALC6:
dmi.product.family: IdeaPad 3 14ALC6
dmi.product.name: 82KT
dmi.product.sku: LENOVO_MT_82KT_BU_idea_FM_IdeaPad 3 14ALC6
dmi.product.version: IdeaPad 3 14ALC6
dmi.sys.vendor: LENOVO
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.113-2~ubuntu0.22.04.1
version.libgl1-mesa-dri: libgl1-mesa-dri 22.2.5-0ubuntu0.1~22.04.1
version.libgl1-mesa-glx: libgl1-mesa-glx N/A
version.xserver-xorg-core: xserver-xorg-core 2:21.1.4-2ubuntu1.7~22.04.1
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:19.1.0-2ubuntu1
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.917+git20210115-1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.17-2build1

Revision history for this message
GLenn (glenn64) wrote :
summary: - Xorg freeze
+ drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_dec timeout
affects: xorg (Ubuntu) → linux-hwe-5.19 (Ubuntu)
tags: added: amdgpu
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-hwe-5.19 (Ubuntu):
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.