amdgpu: page fault in page starting at address 0x0000000000000000 from client 0x1b (UTCL2)

Bug #2110038 reported by Julian Andres Klode
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
mesa (Ubuntu)
New
Undecided
Unassigned

Bug Description

Similar to bug 2032386, but with NULL addresses in the GPU page fault, caused by Google Meet in Firefox (deb from Mozilla), regression started around March.

ProblemType: Bug
DistroRelease: Ubuntu 25.10
Package: libdrm-amdgpu1 2.4.124-2
ProcVersionSignature: Ubuntu 6.14.0-15.15-generic 6.14.0
Uname: Linux 6.14.0-15-generic x86_64
NonfreeKernelModules: zfs
ApportVersion: 2.32.0-0ubuntu5
Architecture: amd64
BootLog: Error: [Errno 13] Permission denied: '/var/log/boot.log'
CasperMD5CheckResult: pass
CompositorRunning: None
CurrentDesktop: GNOME
Date: Tue May 6 13:49:31 2025
DistUpgraded: Fresh install
DistroCodename: questing
DistroVariant: ubuntu
GraphicsCard:
 Advanced Micro Devices, Inc. [AMD/ATI] Rembrandt [Radeon 680M] [1002:1681] (rev d1) (prog-if 00 [VGA controller])
   Subsystem: Lenovo Device [17aa:50b6]
InstallationDate: Installed on 2022-11-26 (892 days ago)
InstallationMedia: Ubuntu 23.04 "Lunar Lobster" - Alpha amd64 (20221126)
MachineType: LENOVO 21CF004PGE
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-6.14.0-15-generic root=/dev/mapper/ubuntu-root ro rootflags=subvol=@ quiet splash amdgpu.dcdebugmask=0x10 zswap.enabled=1 zswap.compressor=zstd zswap.max_pool_percent=20 zswap.zpool=zsmalloc vt.handoff=7
RebootRequiredPkgs: Error: path contained symlinks.
SourcePackage: libdrm
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 05/29/2024
dmi.bios.release: 1.53
dmi.bios.vendor: LENOVO
dmi.bios.version: R23ET77W (1.53 )
dmi.board.asset.tag: Not Available
dmi.board.name: 21CF004PGE
dmi.board.vendor: LENOVO
dmi.board.version: SDK0T76538 WIN
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.ec.firmware.release: 1.32
dmi.modalias: dmi:bvnLENOVO:bvrR23ET77W(1.53):bd05/29/2024:br1.53:efr1.32:svnLENOVO:pn21CF004PGE:pvrThinkPadT14Gen3:rvnLENOVO:rn21CF004PGE:rvrSDK0T76538WIN:cvnLENOVO:ct10:cvrNone:skuLENOVO_MT_21CF_BU_Think_FM_ThinkPadT14Gen3:
dmi.product.family: ThinkPad T14 Gen 3
dmi.product.name: 21CF004PGE
dmi.product.sku: LENOVO_MT_21CF_BU_Think_FM_ThinkPad T14 Gen 3
dmi.product.version: ThinkPad T14 Gen 3
dmi.sys.vendor: LENOVO
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.124-2
version.libgl1-mesa-dri: libgl1-mesa-dri 25.0.3-1ubuntu2
version.libgl1-mesa-glx: libgl1-mesa-glx N/A
version.xserver-xorg-core: xserver-xorg-core 2:21.1.16-1ubuntu1
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:22.0.0-1build1
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.917+git20210115-1build1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.18-1

Revision history for this message
Julian Andres Klode (juliank) wrote :
Revision history for this message
Julian Andres Klode (juliank) wrote :

The firefox has vaapi enabled, the interaction appears between multiple vaapi streams (the videos of the attendees) and possibly the OpenGL around it.

I restarted firefox with AMD_DEBUG=nooptvariant now which seems to make it more stable, it was hanging the GPU multiple times per minute before.

Revision history for this message
Julian Andres Klode (juliank) wrote :
Download full text (3.7 KiB)

Alright funnily enough opening another window then actually crashed the entire GPU hard.

Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 timeout, signaled seq=10238003, emitted seq=10238004
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: Process information: process gnome-shell pid 4814 thread gnome-shel:cs0 pid 4844
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: Starting gfx_0.1.0 ring reset
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: Ring gfx_0.1.0 reset failure
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: MODE2 reset
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
Mai 06 14:12:02 jak-t14-g3 kernel: [drm] PCIE GART of 1024M enabled (table at 0x000000F43FC00000).
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: PSP is resuming...
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: reserve 0xa00000 from 0xf43e000000 for PSP TMR
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Mai 06 14:12:02 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
Mai 06 14:12:02 jak-t14-g3 kernel: [drm] kiq ring mec 2 pipe 1 q 0
Mai 06 14:12:02 jak-t14-g3 kernel: [drm] DMUB hardware initialized: version=0x04000045
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring gfx_0.1.0 uses VM inv eng 1 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 4 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 5 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring kiq_0.2.1.0 uses VM inv eng 12 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring sdma0 uses VM inv eng 13 on hub 0
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu 0000:04:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 8
Mai 06 14:12:03 jak-t14-g3 kernel: amdgpu...

Read more...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.