[amdgpu] Xorg freeze

Bug #1888193 reported by Maksym Wezdecki
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned
xserver-xorg-video-amdgpu (Ubuntu)
New
Undecided
Unassigned

Bug Description

I have Radeon Pro WX 7100. It hangs twice a day.

I narrow-downed this issue to the fan speed lock. After reboot, it always runs on 18%, no matter of the load. If I install Radeon Profile and force fan to 100% and then turn auto fan configuration, then fan reacts on the load.

ProblemType: Bug
DistroRelease: Ubuntu 18.04
Package: xorg 1:7.7+19ubuntu7.1
ProcVersionSignature: Ubuntu 5.3.0-62.56~18.04.1-generic 5.3.18
Uname: Linux 5.3.0-62-generic x86_64
ApportVersion: 2.20.9-0ubuntu7.15
Architecture: amd64
BootLog: Error: [Errno 13] Permission denied: '/var/log/boot.log'
CompositorRunning: None
CurrentDesktop: ubuntu:GNOME
Date: Mon Jul 20 09:15:03 2020
DistUpgraded: Fresh install
DistroCodename: bionic
DistroVariant: ubuntu
ExtraDebuggingInterest: Yes
GpuHangFrequency: Very infrequently
GraphicsCard:
 Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon Pro WX 7100] [1002:67c4] (prog-if 00 [VGA controller])
   Subsystem: Advanced Micro Devices, Inc. [AMD/ATI] Ellesmere [Radeon Pro WX 7100] [1002:0b0d]
InstallationDate: Installed on 2020-06-29 (20 days ago)
InstallationMedia: Ubuntu 18.04.3 LTS "Bionic Beaver" - Release amd64 (20190805)
MachineType: System manufacturer System Product Name
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.3.0-62-generic root=UUID=2b68d049-e85f-47ab-97d9-5632f225b71b ro quiet splash vt.handoff=1
SourcePackage: xorg
Symptom: display
Title: Xorg freeze
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 06/17/2020
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 7803
dmi.board.asset.tag: Default string
dmi.board.name: ROG CROSSHAIR VI HERO (WI-FI AC)
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.version: Default string
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr7803:bd06/17/2020:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnROGCROSSHAIRVIHERO(WI-FIAC):rvrRev1.xx:cvnDefaultstring:ct3:cvrDefaultstring:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.sku: SKU
dmi.product.version: System Version
dmi.sys.vendor: System manufacturer
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.101-2~18.04.1
version.libgl1-mesa-dri: libgl1-mesa-dri 20.0.8-0ubuntu1~18.04.1
version.libgl1-mesa-glx: libgl1-mesa-glx 20.0.8-0ubuntu1~18.04.1
version.xserver-xorg-core: xserver-xorg-core N/A
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati N/A
version.xserver-xorg-video-intel: xserver-xorg-video-intel N/A
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau N/A

Revision history for this message
Maksym Wezdecki (maksym-wezdecki-c) wrote :
Revision history for this message
Daniel van Vugt (vanvugt) wrote :

This looks like a hardware or driver problem with the Radeon Pro WX 7100. Your logs show two issues:

1. Xorg logs that the monitor is being repeatedly redetected. So it sounds like the monitor is not plugged in properly. Maybe try a new monitor cable.

2. The kernel is logging more serious issues:

[ 107.569355] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38480c for process mksReplay pid 2764 thread main-mks pid 2772
[ 107.569358] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0030C387
[ 107.569359] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0804800C
[ 107.569360] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 4, pasid 32785) at page 3195783, read from 'TC4' (0x54433400) (72)
[ 107.569365] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38880c for process mksReplay pid 2764 thread main-mks pid 2772
[ 107.569366] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0030C38E
[ 107.569367] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0808400C
[ 107.569368] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 4, pasid 32785) at page 3195790, read from 'TC7' (0x54433700) (132)
[ 107.597584] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38480c for process mksReplay pid 2764 thread main-mks pid 2772
[ 107.597587] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0030C387
[ 107.597588] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E04800C
[ 107.597590] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 7, pasid 32785) at page 3195783, read from 'TC4' (0x54433400) (72)
[ 107.597595] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38880c for process mksReplay pid 2764 thread main-mks pid 2772
[ 107.597596] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0030C38D
[ 107.597597] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E04400C
[ 107.597598] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 7, pasid 32785) at page 3195789, read from 'TC5' (0x54433500) (68)
[

tags: added: amdgpu
summary: - Xorg freeze
+ [amdgpu] Xorg freeze
affects: xorg (Ubuntu) → xserver-xorg-video-amdgpu (Ubuntu)
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Maksym Wezdecki (maksym-wezdecki-c) wrote : Re: [Bug 1888193] Re: Xorg freeze
Download full text (5.9 KiB)

Issues in dmesg in kernel are my fault - I'm developing app, which faults
GPU.

Have you read my description of the narrowed down issue?

pon., 20 lip 2020 o 09:50 Daniel van Vugt <email address hidden>
napisał(a):

> This looks like a hardware or driver problem with the Radeon Pro WX
> 7100. Your logs show two issues:
>
> 1. Xorg logs that the monitor is being repeatedly redetected. So it
> sounds like the monitor is not plugged in properly. Maybe try a new
> monitor cable.
>
> 2. The kernel is logging more serious issues:
>
> [ 107.569355] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38480c for
> process mksReplay pid 2764 thread main-mks pid 2772
> [ 107.569358] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
> 0x0030C387
> [ 107.569359] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
> 0x0804800C
> [ 107.569360] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 4, pasid 32785)
> at page 3195783, read from 'TC4' (0x54433400) (72)
> [ 107.569365] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38880c for
> process mksReplay pid 2764 thread main-mks pid 2772
> [ 107.569366] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
> 0x0030C38E
> [ 107.569367] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
> 0x0808400C
> [ 107.569368] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 4, pasid 32785)
> at page 3195790, read from 'TC7' (0x54433700) (132)
> [ 107.597584] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38480c for
> process mksReplay pid 2764 thread main-mks pid 2772
> [ 107.597587] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
> 0x0030C387
> [ 107.597588] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
> 0x0E04800C
> [ 107.597590] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 7, pasid 32785)
> at page 3195783, read from 'TC4' (0x54433400) (72)
> [ 107.597595] amdgpu 0000:0a:00.0: GPU fault detected: 146 0x0c38880c for
> process mksReplay pid 2764 thread main-mks pid 2772
> [ 107.597596] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR
> 0x0030C38D
> [ 107.597597] amdgpu 0000:0a:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS
> 0x0E04400C
> [ 107.597598] amdgpu 0000:0a:00.0: VM fault (0x0c, vmid 7, pasid 32785)
> at page 3195789, read from 'TC5' (0x54433500) (68)
> [
>
> ** Tags added: amdgpu
>
> ** Summary changed:
>
> - Xorg freeze
> + [amdgpu] Xorg freeze
>
> ** Package changed: xorg (Ubuntu) => xserver-xorg-video-amdgpu (Ubuntu)
>
> ** Also affects: linux (Ubuntu)
> Importance: Undecided
> Status: New
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1888193
>
> Title:
> [amdgpu] Xorg freeze
>
> Status in linux package in Ubuntu:
> New
> Status in xserver-xorg-video-amdgpu package in Ubuntu:
> New
>
> Bug description:
> I have Radeon Pro WX 7100. It hangs twice a day.
>
> I narrow-downed this issue to the fan speed lock. After reboot, it
> always runs on 18%, no matter of the load. If I install Radeon Profile
> and force fan to 100% and then turn auto fan configuration, then fan
> reacts on the load.
>
> ProblemType: Bug
> DistroRelease: Ubuntu 18.04
> ...

Read more...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.