Comment 3 for bug 2030479

Revision history for this message
Po-Hsu Lin (cypressyew) wrote (last edit ):

Bah it looks like this issue is quite random on this system... reproduced with 5.15.0-1036-realtime on the second attempt.

[ 698.346847] ------------[ cut here ]------------
[ 698.346856] WARNING: CPU: 0 PID: 15 at kernel/sched/core.c:3106 set_task_cpu+0x168/0x214
[ 698.346880] Modules linked in: binfmt_misc nls_iso8859_1 ipmi_ssif hisi_hpre arm_spe_pmu hns_roce_hw_v2 ecdh_generic hisi_sec2 hisi_zip libcurve25519_generic hisi_qm ecc uacce authenc acpi_ipmi ipmi_si hisi_trng_v2 ipmi_devintf hisi_uncore_hha_pmu ipmi_msghandler hisi_uncore_ddrc_pmu hisi_uncore_l3c_pmu hisi_uncore_pmu cppc_cpufreq sch_fq_codel dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ramoops reed_solomon pstore_blk pstore_zone efi_pstore ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor xor_neon raid6_pq libcrc32c raid1 raid0 multipath linear mlx5_ib ib_uverbs ib_core hibmc_drm drm_vram_helper drm_ttm_helper ttm i2c_algo_bit drm_kms_helper mlx5_core syscopyarea sysfillrect sysimgblt fb_sys_fops cec ses enclosure realtek crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce mlxfw hisi_sas_v3_hw rc_core hns3 psample hisi_sas_main hclge tls libsas drm xhci_pci hnae3 xhci_pci_renesas ahci
[ 698.346947] scsi_transport_sas spi_dw_mmio gpio_dwapb spi_dw aes_neon_bs aes_neon_blk aes_ce_blk crypto_simd cryptd aes_ce_cipher
[ 698.346958] CPU: 0 PID: 15 Comm: ksoftirqd/0 Not tainted 5.15.0-1036-realtime #39-Ubuntu
[ 698.346963] Hardware name: Huawei TaiShan 2280 V2/BC82AMDC, BIOS 2280-V2 CS V3.B160.01 01/15/2020
[ 698.346967] pstate: 604000c9 (nZCv daIF +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 698.346971] pc : set_task_cpu+0x168/0x214
[ 698.346973] lr : detach_tasks+0x138/0x390
[ 698.346978] sp : ffff80000860ba70
[ 698.346979] x29: ffff80000860ba70 x28: ffff0020e12abe80 x27: ffffaf19b897a918
[ 698.346982] x26: ffffaf19b83794c0 x25: ffffaf19b83794c0 x24: ffff203f7facbfa8
[ 698.346985] x23: 0000000000000001 x22: ffff203f7facb4c0 x21: ffffaf19b8977a18
[ 698.346988] x20: 0000000000000044 x19: ffff0020e12abe80 x18: 0000000000000000
[ 698.346990] x17: 0000000000000000 x16: ffffaf19b71a7860 x15: 0000000000000000
[ 698.346993] x14: 0000000000000000 x13: 0000000000000030 x12: ffffaf19b79e2b08
[ 698.346996] x11: ffffaf19b8977b50 x10: 0000000000000004 x9 : ffffaf19b6682338
[ 698.346999] x8 : 000b75952208d9a9 x7 : 0000000000e45932 x6 : 000000000000011f
[ 698.347001] x5 : 00000000ffffffe1 x4 : 0000000000000001 x3 : 000000000000b67e
[ 698.347004] x2 : 0000000000000000 x1 : ffffaf19b7b4c258 x0 : 0000000000000001
[ 698.347008] Call trace:
[ 698.347010] set_task_cpu+0x168/0x214
[ 698.347013] detach_tasks+0x138/0x390
[ 698.347015] load_balance+0x228/0x6c0
[ 698.347018] rebalance_domains+0x264/0x390
[ 698.347021] _nohz_idle_balance.constprop.0.isra.0+0x1b0/0x284
[ 698.347024] run_rebalance_domains+0x6c/0x7c
[ 698.347026] __do_softirq+0x110/0x390
[ 698.347029] run_ksoftirqd+0x5c/0xdc
[ 698.347034] smpboot_thread_fn+0x2dc/0x324
[ 698.347041] kthread+0x154/0x160
[ 698.347050] ret_from_fork+0x10/0x20
[ 698.347058] ---[ end trace 0000000000000002 ]---

Test took about 13 minutes to run in this case.