amdgpu crash in dp_set_panel_mode from perform_link_training_with_retries

Bug #1956525 reported by Jof Thibaut
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-hwe-5.11 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

From time to time after waking up the screen on Ubuntu Budgie, the refresh rate goes from 120hz to 48hz followed by a flashing of the screen.

With dmesg this output appear:

[12947.570681] WARNING: CPU: 6 PID: 891 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link_dp.c:4282 dp_set_panel_mode+0xbb/0xd0 [amdgpu]
[12947.571031] Modules linked in: ses enclosure scsi_transport_sas uas usb_storage btrfs blake2b_generic xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c nls_iso8859_1 nvidia_uvm(POE) nvidia_drm(POE) nvidia_modeset(POE) intel_rapl_msr intel_rapl_common nvidia(POE) edac_mce_amd snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec snd_hda_core snd_hwdep soundwire_bus snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi kvm_amd amdgpu snd_seq snd_seq_device iommu_v2 gpu_sched snd_timer drm_ttm_helper ttm kvm snd drm_kms_helper crct10dif_pclmul ghash_clmulni_intel cec rc_core joydev asus_nb_wmi hid_multitouch efi_pstore i2c_algo_bit wmi_bmof aesni_intel fb_sys_fops crypto_simd syscopyarea cryptd sysfillrect glue_helper k10temp soundcore sysimgblt input_leds ccp rapl serio_raw asus_wireless
[12947.571130] mac_hid sch_fq_codel msr parport_pc ppdev lp drm parport ip_tables x_tables autofs4 hid_logitech_hidpp hid_logitech_dj usbhid hid_generic mfd_aaeon asus_wmi sparse_keymap crc32_pclmul nvme r8169 ahci i2c_hid xhci_pci xhci_pci_renesas nvme_core libahci i2c_piix4 realtek wmi video hid
[12947.571172] CPU: 6 PID: 891 Comm: Xorg Tainted: P OE 5.11.0-44-generic #48~20.04.2-Ubuntu
[12947.571178] Hardware name: ASUSTeK COMPUTER INC. TUF Gaming FX505DT_FX505DT/FX505DT, BIOS FX505DT.316 01/28/2021
[12947.571181] RIP: 0010:dp_set_panel_mode+0xbb/0xd0 [amdgpu]
[12947.571538] Code: fe b9 01 00 00 00 48 8d 55 e7 48 89 df 44 09 e0 be 0a 01 00 00 44 89 45 dc 88 45 e7 e8 8e b8 ff ff 44 8b 45 dc 83 f8 01 74 a1 <0f> 0b eb 9d e8 7c 99 3b d5 66 66 2e 0f 1f 84 00 00 00 00 00 90 0f
[12947.571543] RSP: 0018:ffffa3d881a17660 EFLAGS: 00010282
[12947.571548] RAX: 00000000ffffffff RBX: ffff9841d0ab9800 RCX: 000000000000010a
[12947.571551] RDX: ffff9841c3a0eba0 RSI: ffffffffc079367b RDI: 0000000000000100
[12947.571554] RBP: ffffa3d881a17688 R08: 0000000000000001 R09: 00000000fffffffb
[12947.571556] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000001
[12947.571559] R13: ffffa3d881a176e8 R14: 0000000000000001 R15: ffff9841d0ab9800
[12947.571562] FS: 00007f49db6fda40(0000) GS:ffff9844cf380000(0000) knlGS:0000000000000000
[12947.571572] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[12947.571575] CR2: 00007f6f24003f88 CR3: 000000010a030000 CR4: 00000000003506e0
[12947.571579] Call Trace:
[12947.571586] perform_link_training_with_retries+0xa7/0x140 [amdgpu]
[12947.571920] enable_link_dp+0xe2/0x230 [amdgpu]
[12947.572255] core_link_enable_stream+0x699/0x830 [amdgpu]
[12947.572588] dce110_apply_ctx_to_hw+0x5c1/0x600 [amdgpu]
[12947.572923] ? dcn10_verify_allow_pstate_change_high+0x22/0x350 [amdgpu]
[12947.573275] ? dcn10_wait_for_mpcc_disconnect+0x43/0x140 [amdgpu]
[12947.573626] dc_commit_state_no_check+0x28f/0xa00 [amdgpu]
[12947.573961] dc_commit_state+0x96/0xb0 [amdgpu]
[12947.574295] amdgpu_dm_atomic_commit_tail+0x56d/0x1510 [amdgpu]
[12947.574643] ? ktime_get_raw+0x34/0x90
[12947.574652] ? ttm_bo_mem_compat+0x28/0x60 [ttm]
[12947.574663] ? ttm_bo_validate+0x4c/0x160 [ttm]
[12947.574676] ? dm_plane_helper_prepare_fb+0x1b4/0x270 [amdgpu]
[12947.575021] ? _cond_resched+0x19/0x30
[12947.575029] ? wait_for_completion_timeout+0x3a/0x100
[12947.575033] ? _cond_resched+0x19/0x30
[12947.575039] commit_tail+0x99/0x130 [drm_kms_helper]
[12947.575068] drm_atomic_helper_commit+0x123/0x150 [drm_kms_helper]
[12947.575096] drm_atomic_commit+0x4a/0x50 [drm]
[12947.575149] drm_atomic_helper_set_config+0x7c/0xc0 [drm_kms_helper]
[12947.575177] drm_mode_setcrtc+0x20d/0x7e0 [drm]
[12947.575223] ? drm_mode_getcrtc+0x190/0x190 [drm]
[12947.575267] drm_ioctl_kernel+0xae/0xf0 [drm]
[12947.575308] drm_ioctl+0x257/0x410 [drm]
[12947.575349] ? drm_mode_getcrtc+0x190/0x190 [drm]
[12947.575395] amdgpu_drm_ioctl+0x4e/0x80 [amdgpu]
[12947.575665] __x64_sys_ioctl+0x91/0xc0
[12947.575675] do_syscall_64+0x38/0x90
[12947.575683] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[12947.575691] RIP: 0033:0x7f49dba5d50b
[12947.575696] Code: 0f 1e fa 48 8b 05 85 39 0d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 55 39 0d 00 f7 d8 64 89 01 48
[12947.575701] RSP: 002b:00007fffbb15c298 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[12947.575705] RAX: ffffffffffffffda RBX: 00007fffbb15c2d0 RCX: 00007f49dba5d50b
[12947.575708] RDX: 00007fffbb15c2d0 RSI: 00000000c06864a2 RDI: 000000000000000e
[12947.575710] RBP: 00000000c06864a2 R08: 0000000000000000 R09: 00005642faffaf80
[12947.575712] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
[12947.575715] R13: 000000000000000e R14: 00005642f92c76e0 R15: 0000000000000000
[12947.575719] ---[ end trace de0be81b276e808a ]---

ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: xorg 1:7.7+19ubuntu14
ProcVersionSignature: Ubuntu 5.11.0-44.48~20.04.2-generic 5.11.22
Uname: Linux 5.11.0-44-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
.proc.driver.nvidia.capabilities.gpu0: Error: path was not a regular file.
.proc.driver.nvidia.capabilities.mig: Error: path was not a regular file.
.proc.driver.nvidia.gpus.0000.01.00.0: Error: path was not a regular file.
.proc.driver.nvidia.registry: Binary: ""
.proc.driver.nvidia.suspend: suspend hibernate resume
.proc.driver.nvidia.suspend_depth: default modeset uvm
.proc.driver.nvidia.version:
 NVRM version: NVIDIA UNIX x86_64 Kernel Module 495.46 Wed Oct 27 16:31:33 UTC 2021
 GCC version: gcc version 9.3.0 (Ubuntu 9.3.0-17ubuntu1~20.04)
ApportVersion: 2.20.11-0ubuntu27.21
Architecture: amd64
BootLog: Error: [Errno 13] Permission non accordée: '/var/log/boot.log'
CasperMD5CheckResult: skip
CompositorRunning: None
CurrentDesktop: Budgie:GNOME
Date: Wed Jan 5 21:26:45 2022
DistUpgraded: Fresh install
DistroCodename: focal
DistroVariant: ubuntu
DkmsStatus:
 nvidia, 495.46, 5.11.0-43-generic, x86_64: installed
 nvidia, 495.46, 5.11.0-44-generic, x86_64: installed
ExtraDebuggingInterest: Yes
GraphicsCard:
 NVIDIA Corporation TU117M [GeForce GTX 1650 Mobile / Max-Q] [10de:1f91] (rev a1) (prog-if 00 [VGA controller])
   Subsystem: ASUSTeK Computer Inc. TU117M [GeForce GTX 1650 Mobile / Max-Q] [1043:109f]
 Advanced Micro Devices, Inc. [AMD/ATI] Picasso [1002:15d8] (rev c2) (prog-if 00 [VGA controller])
   Subsystem: ASUSTeK Computer Inc. Picasso [1043:18f1]
InstallationDate: Installed on 2022-01-04 (1 days ago)
InstallationMedia: Ubuntu-Budgie 20.04.3 LTS "Focal Fossa" - Release amd64 (20210819)
Lsusb:
 Bus 004 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
 Bus 003 Device 002: ID 046d:c52f Logitech, Inc. Unifying Receiver
 Bus 003 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
 Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
MachineType: ASUSTeK COMPUTER INC. TUF Gaming FX505DT_FX505DT
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.11.0-44-generic root=UUID=24438010-da63-4033-9574-1003c41ebf21 ro quiet splash vt.handoff=7
SourcePackage: xorg
Symptom: display
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 01/28/2021
dmi.bios.release: 5.14
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: FX505DT.316
dmi.board.asset.tag: ATN12345678901234567
dmi.board.name: FX505DT
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: 1.0
dmi.chassis.asset.tag: No Asset Tag
dmi.chassis.type: 10
dmi.chassis.vendor: ASUSTeK COMPUTER INC.
dmi.chassis.version: 1.0
dmi.ec.firmware.release: 16.0
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrFX505DT.316:bd01/28/2021:br5.14:efr16.0:svnASUSTeKCOMPUTERINC.:pnTUFGamingFX505DT_FX505DT:pvr1.0:sku:rvnASUSTeKCOMPUTERINC.:rnFX505DT:rvr1.0:cvnASUSTeKCOMPUTERINC.:ct10:cvr1.0:
dmi.product.family: TUF Gaming
dmi.product.name: TUF Gaming FX505DT_FX505DT
dmi.product.version: 1.0
dmi.sys.vendor: ASUSTeK COMPUTER INC.
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.105-3~20.04.2
version.libgl1-mesa-dri: libgl1-mesa-dri 21.0.3-0ubuntu0.3~20.04.5
version.libgl1-mesa-glx: libgl1-mesa-glx N/A
version.nvidia-graphics-drivers: nvidia-graphics-drivers-* N/A
version.xserver-xorg-core: xserver-xorg-core 2:1.20.13-1ubuntu1~20.04.2
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:19.1.0-1
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.917+git20200226-1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.16-1

Revision history for this message
Jof Thibaut (jt33) wrote :
summary: - Bug with refresh rate
+ amdgpu crash in perform_link_training_with_retries
summary: - amdgpu crash in perform_link_training_with_retries
+ amdgpu crash in dp_set_panel_mode from
+ perform_link_training_with_retries
affects: xorg (Ubuntu) → linux (Ubuntu)
tags: added: amdgpu
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
tags: added: hirsute
affects: linux (Ubuntu) → linux-hwe-5.11 (Ubuntu)
Revision history for this message
Jof Thibaut (jt33) wrote :

After updating the kernel to version 5.13.0-27, the problem no longer appears, thank you very much for the backport of this kernel .

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.