amdgpu: [gfxhub] page fault
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
New
|
Undecided
|
Unassigned | ||
Noble |
Incomplete
|
Undecided
|
Unassigned |
Bug Description
Randomly, it happens that my screens are frozen and can't be interacted with.
Using linux 6.8.0-38-generic with AMD Ryzen 9 7900 CPU and RX 6750 XT GPU.
libdrm-
libdrm-
xserver-
Relevant part of journalctl -b -1 -e
```
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b41000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b43000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b49000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b48000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b49000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b40000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b42000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b7e000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b40000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:1 pasid:32771, for process Xorg pid 16675 thread Xorg:cs0 pid 16896)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x0000800104b42000 from client 0x1b (UTCL2)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: TCP (0x8)
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
jul 23 11:15:03 Q58-sff kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
[...]
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff kernel: [drm:amdgpu_
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:13 Q58-sff /usr/libexec/
jul 23 11:15:23 Q58-sff systemd[1]: systemd-
jul 23 11:15:23 Q58-sff /usr/libexec/
jul 23 11:15:24 Q58-sff /usr/libexec/
jul 23 11:15:24 Q58-sff /usr/libexec/
```
As this is happening randomly, not sure how much it can be investigated. If frequency increase, I will try mainline kernel and investigate more.
ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: linux-image-
ProcVersionSign
Uname: Linux 6.8.0-38-generic x86_64
ApportVersion: 2.28.1-0ubuntu3
Architecture: amd64
CRDA: N/A
CasperMD5CheckR
CurrentDesktop: i3
Date: Tue Jul 23 11:54:35 2024
InstallationDate: Installed on 2023-09-01 (325 days ago)
InstallationMedia: Ubuntu 22.04.3 LTS "Jammy Jellyfish" - Release amd64 (20230807.2)
MachineType: ASUS System Product Name
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageV
linux-
linux-
linux-firmware 20240318.
SourcePackage: linux
StagingDrivers: adis16240 most_i2c fb_bd663474 vt6656_stage adis16203 fb_hx8340bn gb_firmware fb_sh1106 fb_ili9481 vt6655_stage ad9834 fb_ssd1289 fb_ssd1306 gdmtty fb_tinylcd r8712u pi433 sm750fb fb_ili9325 dvb_ttpci fb_upd161704 fb_agm1264k_fl fb_hx8353d gb_audio_codec gb_audio_module ad9832 fb_ssd1351 fb_uc1701 fb_ili9163 fb_hx8357d fb_ssd1331 fb_seps525 fb_ssd1325 gb_audio_gb fb_tls8204 gb_spilib gb_audio_apbridgea r8723bs vme_tsi148 fbtft adt7316 fb_ra8875 fb_uc1611 gb_audio_manager ks7010 adt7316_i2c fb_ili9340 fb_s6d02a1 fb_s6d1121 fb_ili9320 fb_st7735r fb_ssd1305 gdmulte fb_pcd8544 rtllib adt7316_spi fb_st7789v fb_ili9486 ad7816 fb_ili9341 ad5933 fb_hx8347d prism2_usb rts5208 r8192e_pci
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 08/25/2023
dmi.bios.release: 16.54
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1654
dmi.board.
dmi.board.name: ROG STRIX B650E-I GAMING WIFI
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.sku: SKU
dmi.product.
dmi.sys.vendor: ASUS
Changed in linux (Ubuntu Noble): | |
status: | New → Incomplete |
Not sure if the amdgpu recover then xorg crash, and would be a xorg bug, or the amdgpu ends up in a weird state.