Comment 0 for bug 1959525

Revision history for this message
Jonas Gamao (yamiyukisenpai) wrote :

## No output at boot
The dmesg I attached is from when nothing showed at boot.

Journal output:
$ journalctl -r --grep amdgpu
-- Journal begins at Thu 2022-01-27 05:36:17 EST, ends at Sun 2022-01-30 15:58:40 EST. --
Jan 30 15:50:24 Y4M1-II kernel: amdgpu: probe of 0000:0c:00.0 failed with error -110
Jan 30 15:50:24 Y4M1-II kernel: [drm] amdgpu: ttm finalized
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_init+0x77/0x1000 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_pci_probe+0x12a/0x1b0 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_driver_load_kms.cold+0x46/0x83 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_driver_unload_kms+0x43/0x70 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_device_fini+0xc5/0x1e5 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_device_ip_fini.isra.0+0x206/0x2cd [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: gmc_v10_0_sw_fini+0x33/0x40 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_bo_fini+0x12/0x50 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_ttm_fini+0xab/0x100 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: amdgpu_vram_mgr_fini+0xe8/0x160 [amdgpu]
Jan 30 15:50:24 Y4M1-II kernel: Modules linked in: hid_generic usbhid hid amdgpu(+) iommu_v2 gpu_sched i2c_algo_bit drm_ttm_helper ttm drm_kms_helper crct10dif_pclmul syscopyarea crc32_pclmul sysfillrect ghash_clmulni_intel sysimgblt fb_sys_fops cec aesni_intel rc_core crypto_simd cryptd drm nvme ahci xhci_pci i2c_piix4 nvme_core igc libahci xhci_pci_renesas wmi
Jan 30 15:50:24 Y4M1-II kernel: [drm:psp_v11_0_ring_destroy [amdgpu]] *ERROR* Fail to stop psp ring
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: amdgpu: finishing device.
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: Fatal error during GPU init
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: amdgpu_device_ip_init failed
Jan 30 15:50:24 Y4M1-II kernel: [drm:amdgpu_device_ip_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v10_0> failed -110
Jan 30 15:50:24 Y4M1-II kernel: [drm:amdgpu_gfx_enable_kcq.cold [amdgpu]] *ERROR* KCQ enable failed
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: SMU is initialized successfully!
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: use vbios provided pptable
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: SMU driver if version not matched
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: smu driver if version = 0x0000003d, smu fw if version = 0x00000040, smu fw version = 0x003a4700 (58.71.0)
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: Will use PSP to load VCN firmware
Jan 30 15:50:24 Y4M1-II kernel: [drm] amdgpu: 16368M of GTT memory ready.
Jan 30 15:50:24 Y4M1-II kernel: [drm] amdgpu: 16368M of VRAM memory ready
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: AGP: 267894784M 0x0000008400000000 - 0x0000FFFFFFFFFFFF
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: GART: 512M 0x0000000000000000 - 0x000000001FFFFFFF
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: VRAM: 16368M 0x0000008000000000 - 0x00000083FEFFFFFF (16368M used)
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: SRAM ECC is not presented.
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: MEM ECC is not presented.
Jan 30 15:50:24 Y4M1-II kernel: amdgpu: ATOM BIOS: 113-EXT800216-L05
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: Fetched VBIOS from VFCT
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: enabling device (0006 -> 0007)
Jan 30 15:50:24 Y4M1-II kernel: amdgpu 0000:0c:00.0: vgaarb: deactivate vga console
Jan 30 15:50:24 Y4M1-II kernel: fb0: switching to amdgpudrmfb from EFI VGA
Jan 30 15:50:24 Y4M1-II kernel: amdgpu: Topology: Add CPU node
Jan 30 15:50:24 Y4M1-II kernel: amdgpu: Virtual CRAT table created for CPU
Jan 30 15:50:24 Y4M1-II kernel: amdgpu: Ignoring ACPI CRAT on non-APU system
Jan 30 15:50:24 Y4M1-II kernel: [drm] amdgpu kernel modesetting enabled.
Jan 30 15:50:24 Y4M1-II kernel: Kernel command line: root=UUID=1f611ee1-b4c4-4178-8681-c1e1fe158f52 ro rootflags=subvol=@ amdgpu.ppfeaturemask=0xffffffff amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 vfio-pci.ids=10de:1c03,10de:10f1 quiet splash vt.handoff=7 initrd=@\boot\initrd.img-5.13.0-27-generic
Jan 30 15:50:24 Y4M1-II kernel: Command line: root=UUID=1f611ee1-b4c4-4178-8681-c1e1fe158f52 ro rootflags=subvol=@ amdgpu.ppfeaturemask=0xffffffff amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 vfio-pci.ids=10de:1c03,10de:10f1 quiet splash vt.handoff=7 initrd=@\boot\initrd.img-5.13.0-27-generic

##Re: when idle
After the display has been idle, and screen timeout occurs after a while (not exactly sure cuz it happens while I was asleep), the GPU "resets" and display us unable to recover.
To reproduce: I set the display timeout to 10 min (sleep is disabled). Anything close to that, and I can still recover the display.
If left unattended for longer, the display is unable to recover.

## System info:
* CPU: Ryzen 9 5900X
* GPU Asrock 6900XT Phantom Gaming D
    0c:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Navi 21 [Radeon RX 6800/6800 XT / 6900 XT] [1002:73bf] (rev c0)
* Displays: 1440p Asus TUF Gaming VG27AQ1A + 4K BenQ EW3270U
* RAM: G.SKILL RIPJAWS V 64GB (2x32GB)
* Linux Y4M1-II 5.13.0-27-generic #29-Ubuntu SMP Wed Jan 12 17:36:47 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux