5.19.0-1004.4-generic crashes with installed radeon graphics card

Bug #2006588 reported by Heinrich Schuchardt
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-riscv (Ubuntu)
New
Undecided
Unassigned

Bug Description

I tried to install https://cdimage.ubuntu.com/ubuntu-server/daily-live/current/lunar-live-server-riscv64.img.gz on an SiFive HiFive Unmatches board with a Radeon graphics card. I saw a crash:

[ 22.208410] [drm] Initialized radeon 2.50.0 20080528 for 0000:07:00.0 on minor 0
[ 44.013359] watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [systemd-udevd:189]
[ 44.020117] Modules linked in: gpu_sched radeon i2c_algo_bit drm_ttm_helper ttm drm_display_helper cec rc_core drm_kms_helper rtc_da9063 syscopyarea sysfillrect da9063_regulator sysimgblt fb_sys_fops nvme drm mscc backlight macsec xhci_pci nvme_core xhci_pci_renesas macb i2c_ocores phylink
[ 44.045737] CPU: 2 PID: 189 Comm: systemd-udevd Not tainted 5.19.0-1004-generic #4-Ubuntu
[ 44.053885] Hardware name: SiFive HiFive Unmatched A00 (DT)
[ 44.059440] epc : apply_relocate_add+0x118/0x292
[ 44.064043] ra : apply_relocate_add+0x88/0x292
[ 44.068559] epc : ffffffff800088e4 ra : ffffffff80008854 sp : ffffffd89b55fc80
[ 44.075772] gp : ffffffff81e2e6c8 tp : ffffffd89bd1ce00 t0 : ffffffff02718510
[ 44.082981] t1 : ffffffff027ce160 t2 : 000000000001376b s0 : ffffffd89b55fd20
[ 44.090191] s1 : ffffffc895f8f2f8 a0 : 000000000001376b a1 : ffffffff025d4000
[ 44.097399] a2 : ffffffff027ce15c a3 : ffffffff027ce15c a4 : 000000000002cae4
[ 44.104609] a5 : 000000000003a641 a6 : 000000000001e8d6 a7 : 0000000000000000
[ 44.111818] s2 : ffffffc895f8e438 s3 : ffffffc895f8e538 s4 : 0000000000000017
[ 44.119028] s5 : fffffffffffff000 s6 : 0000000000000014 s7 : ffffffc895a3d1b0
[ 44.126236] s8 : ffffffff810014f0 s9 : 0000000000000033 s10: 000000000001e8d6
[ 44.133445] s11: 000000000001376b t3 : 0000000000000100 t4 : ffffffff80008374
[ 44.140655] t5 : 0000000000430560 t6 : ffffffff02e2c000
[ 44.145949] status: 0000000200000120 badaddr: 0000000000000000 cause: 8000000000000005
[ 44.153860] [<ffffffff800a9008>] load_module+0x826/0x9fc
[ 44.159152] [<ffffffff800a9410>] __do_sys_finit_module+0x98/0xfe
[ 44.165146] [<ffffffff800a949a>] sys_finit_module+0x24/0x30
[ 44.170704] [<ffffffff80003c62>] ret_from_syscall+0x0/0x2
[ 68.013359] watchdog: BUG: soft lockup - CPU#2 stuck for 45s! [systemd-udevd:189]
[ 68.020111] Modules linked in: gpu_sched radeon i2c_algo_bit drm_ttm_helper ttm drm_display_helper cec rc_core drm_kms_helper rtc_da9063 syscopyarea sysfillrect da9063_regulator sysimgblt fb_sys_fops nvme drm mscc backlight macsec xhci_pci nvme_core xhci_pci_renesas macb i2c_ocores phylink

Revision history for this message
Heinrich Schuchardt (xypron) wrote :
Revision history for this message
Heinrich Schuchardt (xypron) wrote :
Revision history for this message
Heinrich Schuchardt (xypron) wrote :

# lspci
00:00.0 PCI bridge: SiFive, Inc. FU740-C000 RISC-V SoC PCI Express x8 to AXI4 Bridge
01:00.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
02:00.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
02:02.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
02:03.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
02:04.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
02:08.0 PCI bridge: ASMedia Technology Inc. ASM2824 PCIe Gen3 Packet Switch (rev 01)
04:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host Controller
06:00.0 Non-Volatile memory controller: Sandisk Corp PC SN520 NVMe SSD (rev 01)
07:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Caicos [Radeon HD 6450/7450/8450 / R5 230 OEM]
07:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Caicos HDMI Audio [Radeon HD 6450 / 7450/8450/8490 OEM / R5 230/235/235X OEM]

Revision history for this message
Heinrich Schuchardt (xypron) wrote :

Setting kernel.watchdog_thresh=60 in /etc/sysctl.d/99-watchdog_thresh.conf does not solve the problem. /etc/sysctl* is not included in the initrd and the issue occurs before the real root is mounted. 60 seconds is the maximum value.

Revision history for this message
Heinrich Schuchardt (xypron) wrote :

Adding sysctl.kernel.watchdog_thresh=60 to the kernel command line bypasses the problem.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.