Laptop screen blinked, no tty works and 200 dmesg errors per second

Bug #2033967 reported by Leonardo Müller
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Mesa
Fix Released
Unknown
linux-hwe-6.2 (Ubuntu)
Invalid
Undecided
Unassigned
Jammy
Invalid
Undecided
Unassigned
Lunar
Invalid
Undecided
Unassigned
Mantic
Invalid
Undecided
Unassigned
mesa (Ubuntu)
Fix Released
Undecided
Unassigned
Jammy
New
Undecided
Unassigned
Lunar
Won't Fix
Undecided
Unassigned
Mantic
Fix Released
Undecided
Unassigned

Bug Description

I was using my laptop and suddenly the screen blinked. After that, all I could do was to move the mouse, no interaction worked. Trying to change to a TTY would open the black screen but the text cursor wasn't blinking and nothing happened.

I accessed the laptop via SSH and used:

screen env DISPLAY=:0 xfwm4 --vblank=xpresent --replace

To reload xfwm4. After that, the graphical interface started working again. However, no TTY is accessible and there are 200 dmesg errors per second. These are the errors that repeat endlessly:

[43959.444682] amdgpu 0000:06:00.0: amdgpu: couldn't schedule ib on ring <vcn_dec>
[43959.444688] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)

And while I was writing this report, another problem happened:

[44566.016330] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vcn_enc0 timeout, signaled seq=2, emitted seq=6
[44566.016553] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process obs pid 38687 thread obs:cs0 pid 38694
[44566.016748] amdgpu 0000:06:00.0: amdgpu: GPU reset begin!
[44566.317039] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[44566.525640] [drm] Register(0) [mmUVD_RB_RPTR] failed to reach value 0x00000100 != 0x00000000
[44566.728577] [drm] Register(0) [mmUVD_POWER_STATUS] failed to reach value 0x00000001 != 0x00000002
[44566.731204] ------------[ cut here ]------------
[44566.731205] WARNING: CPU: 15 PID: 29333 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:600 amdgpu_irq_put+0xa4/0xc0 [amdgpu]
[44566.731429] Modules linked in: tls wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel nvme_fabrics veth bridge stp llc zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) rfcomm snd_seq_dummy snd_hrtimer vhost_vsock vmw_vsock_virtio_transport_common vhost vhost_iotlb vsock cmac algif_hash algif_skcipher af_alg bnep zram binfmt_misc nls_iso8859_1 nft_masq snd_sof_amd_rembrandt snd_sof_amd_renoir snd_hda_codec_realtek snd_sof_amd_acp snd_hda_codec_generic snd_sof_pci ledtrig_audio snd_sof_xtensa_dsp snd_hda_codec_hdmi snd_sof snd_hda_intel snd_sof_utils snd_intel_dspcfg snd_soc_core snd_intel_sdw_acpi snd_compress nft_limit snd_hda_codec intel_rapl_msr ac97_bus intel_rapl_common snd_hda_core snd_pcm_dmaengine snd_hwdep edac_mce_amd snd_pci_ps btusb snd_seq_midi btrtl snd_rpl_pci_acp6x kvm_amd snd_seq_midi_event btbcm snd_acp_pci btintel btmtk snd_rawmidi uvcvideo
[44566.731465] snd_pci_acp6x nf_log_syslog rtw89_8852ae kvm videobuf2_vmalloc bluetooth snd_pcm videobuf2_memops snd_seq rtw89_8852a irqbypass snd_pci_acp5x videobuf2_v4l2 ecdh_generic snd_seq_device snd_rn_pci_acp3x joydev nft_log input_leds rapl videobuf2_common serio_raw snd_timer wmi_bmof hid_multitouch ecc snd_acp_config rtw89_pci snd_soc_acpi snd k10temp rtw89_core snd_pci_acp3x soundcore ideapad_laptop ccp sparse_keymap platform_profile mac_hid nft_ct nvidia_uvm(PO) nft_chain_nat nf_nat sch_fq_codel nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 cuse msr parport_pc nf_tables ppdev lp nfnetlink parport ramoops pstore_blk reed_solomon pstore_zone efi_pstore autofs4 btrfs blake2b_generic dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear overlay v4l2loopback(O) videodev mc virt_wifi virtio_net net_failover failover virtio_gpu virtio_dma_buf drm_shmem_helper z3fold rndis_host cdc_ether usbnet bfq ath10k_pci
[44566.731504] ath10k_core ath mac80211 libarc4 cfg80211 usb_storage nbd mii nvidia_drm(PO) nvidia_modeset(PO) usbhid nvidia(PO) amdgpu iommu_v2 drm_buddy gpu_sched i2c_algo_bit drm_ttm_helper ttm drm_display_helper cec rc_core drm_kms_helper crct10dif_pclmul syscopyarea crc32_pclmul polyval_clmulni sysfillrect polyval_generic hid_generic nvme sysimgblt ghash_clmulni_intel drm sha512_ssse3 aesni_intel i2c_hid_acpi crypto_simd xhci_pci nvme_core cryptd r8169 video i2c_piix4 i2c_hid xhci_pci_renesas nvme_common realtek wmi hid
[44566.731529] CPU: 15 PID: 29333 Comm: kworker/u32:1 Tainted: P W O 6.2.0-31-generic #31~22.04.1-Ubuntu
[44566.731531] Hardware name: LENOVO 82MJ/LNVNB161216, BIOS H3CN36WW(V2.06) 09/30/2022
[44566.731533] Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
[44566.731539] RIP: 0010:amdgpu_irq_put+0xa4/0xc0 [amdgpu]
[44566.731702] Code: 31 f6 31 ff c3 cc cc cc cc 44 89 ea 4c 89 e6 4c 89 f7 e8 8f fc ff ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b b8 ea ff ff ff eb c3 b8 ea ff ff ff eb bc b8 fe ff ff ff eb
[44566.731704] RSP: 0018:ffffb149228cfbf8 EFLAGS: 00010246
[44566.731705] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000000
[44566.731706] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[44566.731707] RBP: ffffb149228cfc18 R08: 0000000000000000 R09: 0000000000000000
[44566.731708] R10: 0000000000000000 R11: 0000000000000000 R12: ffff960784e30370
[44566.731709] R13: 0000000000000000 R14: ffff960784e20000 R15: ffff960784e20000
[44566.731709] FS: 0000000000000000(0000) GS:ffff9612001c0000(0000) knlGS:0000000000000000
[44566.731711] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[44566.731712] CR2: 00007fb3ad1690ec CR3: 0000000214852000 CR4: 0000000000750ee0
[44566.731713] PKRU: 55555554
[44566.731714] Call Trace:
[44566.731715] <TASK>
[44566.731718] sdma_v4_0_hw_fini+0x41/0xc0 [amdgpu]
[44566.731888] sdma_v4_0_suspend+0x2c/0x60 [amdgpu]
[44566.732047] amdgpu_device_ip_suspend_phase2+0x25d/0x490 [amdgpu]
[44566.732184] amdgpu_device_ip_suspend+0x41/0x80 [amdgpu]
[44566.732336] amdgpu_device_pre_asic_reset+0xd6/0x4a0 [amdgpu]
[44566.732490] amdgpu_device_gpu_recover+0x49f/0xa20 [amdgpu]
[44566.732663] amdgpu_job_timedout+0x13a/0x200 [amdgpu]
[44566.732857] drm_sched_job_timedout+0x6d/0x120 [gpu_sched]
[44566.732862] process_one_work+0x21f/0x440
[44566.732865] worker_thread+0x50/0x3f0
[44566.732867] ? __pfx_worker_thread+0x10/0x10
[44566.732869] kthread+0xee/0x120
[44566.732872] ? __pfx_kthread+0x10/0x10
[44566.732874] ret_from_fork+0x2c/0x50
[44566.732877] </TASK>
[44566.732878] ---[ end trace 0000000000000000 ]---

Additional information:

$ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne IOMMU
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe GPP Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:01.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:02.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:02.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne PCIe GPP Bridge
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Renoir PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Renoir Internal PCIe GPP Bridge to Bus
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 51)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Cezanne Data Fabric; Function 7
01:00.0 3D controller: NVIDIA Corporation TU117M [GeForce GTX 1650 Mobile / Max-Q] (rev a1)
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
03:00.0 Network controller: Realtek Semiconductor Co., Ltd. RTL8852AE 802.11ax PCIe Wireless Network Adapter
04:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller SM981/PM981/PM983
05:00.0 Non-Volatile memory controller: Sandisk Corp Device 5008 (rev 01)
06:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne (rev c5)
06:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Renoir Radeon High Definition Audio Controller
06:00.2 Encryption controller: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor
06:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
06:00.4 USB controller: Advanced Micro Devices, Inc. [AMD] Renoir/Cezanne USB 3.1
06:00.5 Multimedia controller: Advanced Micro Devices, Inc. [AMD] Raven/Raven2/FireFlight/Renoir Audio Processor (rev 01)
06:00.6 Audio device: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) HD Audio Controller

$ glxinfo -B
name of display: :0.0
display: :0 screen: 0
direct rendering: Yes
Extended renderer info (GLX_MESA_query_renderer):
    Vendor: AMD (0x1002)
    Device: RENOIR (renoir, LLVM 15.0.7, DRM 3.49, 6.2.0-31-generic) (0x1638)
    Version: 23.0.4
    Accelerated: yes
    Video memory: 2048MB
    Unified memory: no
    Preferred profile: core (0x1)
    Max core profile version: 4.6
    Max compat profile version: 4.6
    Max GLES1 profile version: 1.1
    Max GLES[23] profile version: 3.2
Memory info (GL_ATI_meminfo):
    VBO free memory - total: 802 MB, largest block: 802 MB
    VBO free aux. memory - total: 22801 MB, largest block: 22801 MB
    Texture free memory - total: 802 MB, largest block: 802 MB
    Texture free aux. memory - total: 22801 MB, largest block: 22801 MB
    Renderbuffer free memory - total: 802 MB, largest block: 802 MB
    Renderbuffer free aux. memory - total: 22801 MB, largest block: 22801 MB
Memory info (GL_NVX_gpu_memory_info):
    Dedicated video memory: 2048 MB
    Total available memory: 25048 MB
    Currently available dedicated video memory: 802 MB
OpenGL vendor string: AMD
OpenGL renderer string: RENOIR (renoir, LLVM 15.0.7, DRM 3.49, 6.2.0-31-generic)
OpenGL core profile version string: 4.6 (Core Profile) Mesa 23.0.4-0ubuntu1~22.04.1
OpenGL core profile shading language version string: 4.60
OpenGL core profile context flags: (none)
OpenGL core profile profile mask: core profile

OpenGL version string: 4.6 (Compatibility Profile) Mesa 23.0.4-0ubuntu1~22.04.1
OpenGL shading language version string: 4.60
OpenGL context flags: (none)
OpenGL profile mask: compatibility profile

OpenGL ES profile version string: OpenGL ES 3.2 Mesa 23.0.4-0ubuntu1~22.04.1
OpenGL ES profile shading language version string: OpenGL ES GLSL ES 3.20

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: linux-image-6.2.0-31-generic 6.2.0-31.31~22.04.1
ProcVersionSignature: Ubuntu 6.2.0-31.31~22.04.1-generic 6.2.15
Uname: Linux 6.2.0-31-generic x86_64
NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair nvidia_modeset nvidia
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
CurrentDesktop: XFCE
Date: Sat Sep 2 20:10:00 2023
InstallationDate: Installed on 2017-06-13 (2272 days ago)
InstallationMedia: Xubuntu 17.04 "Zesty Zapus" - Release amd64 (20170412)
SourcePackage: linux-signed-hwe-6.2
UpgradeStatus: Upgraded to jammy on 2019-12-22 (1350 days ago)
modified.conffile..etc.cron.daily.apport: [deleted]

Revision history for this message
Leonardo Müller (leozinho29-eu) wrote :
Juerg Haefliger (juergh)
affects: linux-signed-hwe-6.2 (Ubuntu) → linux-hwe-6.2 (Ubuntu)
Revision history for this message
Mario Limonciello (superm1) wrote :

I believe this likely needs the VCN fixes from https://gitlab.freedesktop.org/mesa/mesa/-/issues/9728 backported.

Changed in linux-hwe-6.2 (Ubuntu):
status: New → Invalid
Changed in mesa:
status: Unknown → Fix Released
Revision history for this message
Oibaf (oibaf) wrote :

The patch fixing the upstream linked issue is: radeonsi/vcn: fix the incorrect dt_size

and it is in mesa main (which will be in 23.3), as well as 23.1.7, which is already in Ubuntu mantic.

Changed in linux-hwe-6.2 (Ubuntu Lunar):
status: New → Invalid
Changed in linux-hwe-6.2 (Ubuntu Jammy):
status: New → Invalid
Changed in mesa (Ubuntu Mantic):
status: New → Fix Released
Revision history for this message
Brian Murray (brian-murray) wrote :

Ubuntu 23.04 (Lunar Lobster) has reached end of life, so this bug will not be fixed for that specific release.

Changed in mesa (Ubuntu Lunar):
status: New → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.