amdgpu crash during bootup and instability during usage

Bug #1998852 reported by Lenno Nagel
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

The following stack trace appears during boot. During normal usage the system is rather unstable, i've had about 1 crash per hour today after upgrading to Kinetic.

[ 3.165052] amdgpu 0000:08:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ 3.176406] amdgpu 0000:08:00.0: amdgpu: RAP: optional rap ta ucode is not available
[ 3.176407] amdgpu 0000:08:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[ 3.176794] amdgpu 0000:08:00.0: amdgpu: SMU is initialized successfully!
[ 3.177673] ------------[ cut here ]------------
[ 3.177674] WARNING: CPU: 6 PID: 286 at drivers/gpu/drm/amd/amdgpu/../display/dc/clk_mgr/dcn21/rn_clk_mgr_vbios_smu.c:98 rn_vbios_smu_send_msg_with_param+0x105/0x110 [amdgpu]
[ 3.177875] Modules linked in: fjes(+) amdgpu(+) iommu_v2 gpu_sched i2c_algo_bit drm_ttm_helper ttm drm_display_helper cec rc_core crct10dif_pclmul drm_kms_helper crc32_pclmul ghash_clmulni_intel syscopyarea aesni_intel sysfillrect sysimgblt fb_sys_fops crypto_simd cryptd sdhci_pci psmouse drm nvme cqhci mhi_pci_generic(+) r8169 nvme_core xhci_pci i2c_piix4 mhi sdhci xhci_pci_renesas realtek wmi video i2c_scmi
[ 3.177890] CPU: 6 PID: 286 Comm: systemd-udevd Not tainted 5.19.0-26-generic #27-Ubuntu
[ 3.177892] Hardware name: LENOVO 21A0000CMH/21A0000CMH, BIOS R1MET51W (1.21 ) 09/15/2022
[ 3.177892] RIP: 0010:rn_vbios_smu_send_msg_with_param+0x105/0x110 [amdgpu]
[ 3.178052] Code: be 93 62 01 00 e8 ab 1f e8 ff 5b 41 5c 41 5d 41 5e 5d 31 d2 31 c9 31 f6 31 ff c3 cc cc cc cc 3d fe 00 00 00 74 cf 0f 0b eb cb <0f> 0b e9 49 ff ff ff 0f 1f 40 00 0f 1f 44 00 00 55 31 d2 be 02 00
[ 3.178053] RSP: 0018:ffffbab080673498 EFLAGS: 00010202
[ 3.178055] RAX: 00000000000000fe RBX: 0000000000030d41 RCX: 0000000000000000
[ 3.178056] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 3.178056] RBP: ffffbab0806734b8 R08: 0000000000000003 R09: ffff946e63eeac00
[ 3.178057] R10: ffff946e5c6a6c00 R11: 0000000000000005 R12: ffff946e5c6a6c00
[ 3.178057] R13: 000000000000000d R14: 0000000000000001 R15: ffff946e5c6a6c00
[ 3.178058] FS: 00007feb74e9d8c0(0000) GS:ffff947451f80000(0000) knlGS:0000000000000000
[ 3.178059] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3.178060] CR2: 000055ff990ed3ad CR3: 000000011bac2000 CR4: 0000000000750ee0
[ 3.178060] PKRU: 55555554
[ 3.178061] Call Trace:
[ 3.178062] <TASK>
[ 3.178063] rn_vbios_smu_enable_48mhz_tmdp_refclk_pwrdwn+0x17/0x30 [amdgpu]
[ 3.178209] rn_clk_mgr_construct+0x135/0xd10 [amdgpu]
[ 3.178353] ? amdgpu_device_rreg+0xa6/0x120 [amdgpu]
[ 3.178461] dc_clk_mgr_create+0x442/0x5e0 [amdgpu]
[ 3.178606] dc_construct+0x20f/0x500 [amdgpu]
[ 3.178755] ? kmalloc_order+0x6a/0xa0
[ 3.178758] ? kmem_cache_alloc_trace+0x189/0x310
[ 3.178761] dc_create+0x46/0x140 [amdgpu]
[ 3.178905] amdgpu_dm_init.isra.0+0x214/0x350 [amdgpu]
[ 3.179050] ? dev_printk_emit+0x61/0x8d
[ 3.179053] ? memcpy_erms+0x6/0x10
[ 3.179056] dm_hw_init+0x12/0x30 [amdgpu]
[ 3.179196] amdgpu_device_ip_init+0x5a2/0x7f7 [amdgpu]
[ 3.179353] amdgpu_device_init.cold+0x2ea/0x7ef [amdgpu]
[ 3.179501] amdgpu_driver_load_kms+0x1c/0x170 [amdgpu]
[ 3.179605] amdgpu_pci_probe+0x16d/0x3b0 [amdgpu]
[ 3.179702] local_pci_probe+0x47/0x90
[ 3.179705] pci_call_probe+0x55/0x190
[ 3.179707] pci_device_probe+0x84/0x120
[ 3.179708] really_probe+0x1df/0x3b0
[ 3.179711] __driver_probe_device+0x12c/0x1b0
[ 3.179712] driver_probe_device+0x24/0xd0
[ 3.179713] __driver_attach+0xe0/0x210
[ 3.179714] ? __device_attach_driver+0x130/0x130
[ 3.179716] bus_for_each_dev+0x90/0xe0
[ 3.179717] driver_attach+0x1e/0x30
[ 3.179718] bus_add_driver+0x187/0x230
[ 3.179719] driver_register+0x8f/0x100
[ 3.179720] __pci_register_driver+0x62/0x70
[ 3.179722] amdgpu_init+0x6a/0x1000 [amdgpu]
[ 3.179834] ? 0xffffffffc135d000
[ 3.179835] do_one_initcall+0x5e/0x240
[ 3.179838] do_init_module+0x50/0x210
[ 3.179840] load_module+0xb7d/0xcd0
[ 3.179841] ? kernel_read_file+0x255/0x2f0
[ 3.179844] __do_sys_finit_module+0xc4/0x140
[ 3.179845] ? __do_sys_finit_module+0xc4/0x140
[ 3.179846] __x64_sys_finit_module+0x18/0x30
[ 3.179847] do_syscall_64+0x5b/0x90
[ 3.179849] ? syscall_exit_to_user_mode+0x26/0x50
[ 3.179851] ? __x64_sys_read+0x19/0x30
[ 3.179853] ? do_syscall_64+0x67/0x90
[ 3.179853] ? exit_to_user_mode_prepare+0x30/0xb0
[ 3.179855] ? syscall_exit_to_user_mode+0x26/0x50
[ 3.179856] ? __x64_sys_mmap+0x33/0x70
[ 3.179858] ? do_syscall_64+0x67/0x90
[ 3.179859] entry_SYSCALL_64_after_hwframe+0x63/0xcd
[ 3.179861] RIP: 0033:0x7feb7557dc4d
[ 3.179862] Code: 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 83 f1 0d 00 f7 d8 64 89 01 48
[ 3.179863] RSP: 002b:00007fff13c8aa88 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[ 3.179864] RAX: ffffffffffffffda RBX: 000055ff9a5cc2a0 RCX: 00007feb7557dc4d
[ 3.179865] RDX: 0000000000000000 RSI: 00007feb756ff458 RDI: 0000000000000018
[ 3.179865] RBP: 00007feb756ff458 R08: 0000000000000000 R09: 00007fff13c8abb0
[ 3.179866] R10: 0000000000000018 R11: 0000000000000246 R12: 0000000000020000
[ 3.179866] R13: 000055ff9a5c92f0 R14: 0000000000000000 R15: 000055ff9a5e07f0
[ 3.179868] </TASK>
[ 3.179868] ---[ end trace 0000000000000000 ]---
[ 3.180617] [drm] Display Core initialized with v3.2.187!
[ 3.181265] [drm] DMUB hardware initialized: version=0x0101001F

ProblemType: Bug
DistroRelease: Ubuntu 22.10
Package: linux-image-5.19.0-26-generic 5.19.0-26.27
ProcVersionSignature: Ubuntu 5.19.0-26.27-generic 5.19.7
Uname: Linux 5.19.0-26-generic x86_64
NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair
ApportVersion: 2.23.1-0ubuntu3
Architecture: amd64
CasperMD5CheckResult: unknown
CurrentDesktop: KDE
Date: Mon Dec 5 21:36:59 2022
HibernationDevice: RESUME=
InstallationDate: Installed on 2020-05-08 (941 days ago)
InstallationMedia: Kubuntu 20.04 LTS "Focal Fossa" - Release amd64 (20200423)
MachineType: LENOVO 21A0000CMH
ProcFB: 0 amdgpudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.19.0-26-generic root=/dev/mapper/vgkubuntu-root ro iommu=pt initcall_blacklist=acpi_cpufreq_init fbcon=font:TER16x32 quiet splash vt.handoff=7
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-5.19.0-26-generic N/A
 linux-backports-modules-5.19.0-26-generic N/A
 linux-firmware 20220923.gitf09bebf3-0ubuntu1.1
SourcePackage: linux
UpgradeStatus: Upgraded to kinetic on 2022-12-04 (1 days ago)
dmi.bios.date: 09/15/2022
dmi.bios.release: 1.21
dmi.bios.vendor: LENOVO
dmi.bios.version: R1MET51W (1.21 )
dmi.board.asset.tag: Not Available
dmi.board.name: 21A0000CMH
dmi.board.vendor: LENOVO
dmi.board.version: SDK0J40697 WIN
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.ec.firmware.release: 1.21
dmi.modalias: dmi:bvnLENOVO:bvrR1MET51W(1.21):bd09/15/2022:br1.21:efr1.21:svnLENOVO:pn21A0000CMH:pvrThinkPadP14sGen2a:rvnLENOVO:rn21A0000CMH:rvrSDK0J40697WIN:cvnLENOVO:ct10:cvrNone:skuLENOVO_MT_21A0_BU_Think_FM_ThinkPadP14sGen2a:
dmi.product.family: ThinkPad P14s Gen 2a
dmi.product.name: 21A0000CMH
dmi.product.sku: LENOVO_MT_21A0_BU_Think_FM_ThinkPad P14s Gen 2a
dmi.product.version: ThinkPad P14s Gen 2a
dmi.sys.vendor: LENOVO

Revision history for this message
Lenno Nagel (lnagel) wrote :
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.