Reliable crash in lowlatency kernel with LXD
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux-signed (Ubuntu) |
Confirmed
|
Undecided
|
Seth Forshee |
Bug Description
Hello, I am able to crash 5.4.0-21-lowlatency running LXD. This machine in an Intel NUC, part of a three-member LXD cluster. The other machines do not show this crash and they are not running the lowlatency kernel, but that might be incidental. I will swap out the generic kernel for a while to see if the behaviour continues there.
Here is the crash:
[ 3222.385724] ------------[ cut here ]------------
[ 3222.385732] WARNING: CPU: 1 PID: 59852 at kernel/
[ 3222.385733] Modules linked in: binfmt_misc veth nft_masq nft_chain_nat vxlan ip6_udp_tunnel udp_tunnel dummy bridge stp llc ebtable_filter ebtables ip6table_raw ip6table_mangle ip6table_nat ip6table_filter ip6_tables iptable_raw iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter bpfilter nf_tables nfnetlink unix_diag ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs pps_ldisc zfs(PO) zunicode(PO) zavl(PO) icp(PO) nls_iso8859_1 zcommon(PO) znvpair(PO) spl(O) dm_multipath zlua(PO) scsi_dh_rdac scsi_dh_emc scsi_dh_alua snd_hda_
[ 3222.385779] async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel i915 aesni_intel i2c_algo_bit crypto_simd cryptd glue_helper drm_kms_helper syscopyarea sysfillrect sysimgblt e1000e i2c_i801 fb_sys_fops ahci drm libahci lpc_ich video
[ 3222.385795] CPU: 1 PID: 59852 Comm: systemd-resolve Tainted: P O 5.4.0-21-lowlatency #25-Ubuntu
[ 3222.385796] Hardware name: /NUC5i5MYBE, BIOS MYBDWi5v.
[ 3222.385801] RIP: 0010:rcu_
[ 3222.385803] Code: 54 53 48 c7 c3 80 ba 02 00 65 48 03 1d 8b 57 ce 6d 0f 1f 44 00 00 41 8b 85 80 07 00 00 45 84 f6 0f 85 55 02 00 00 85 c0 7e 0c <0f> 0b 41 80 bd 84 07 00 00 00 74 33 4c 89 ef e8 36 fb ff ff 65 66
[ 3222.385804] RSP: 0018:ffffa7bf48
[ 3222.385806] RAX: 0000000000000001 RBX: ffff9817c5caba80 RCX: 0000000000000001
[ 3222.385807] RDX: 0000000000000000 RSI: ffffffff92ce6379 RDI: 0000000000000000
[ 3222.385808] RBP: ffffa7bf48e3bd20 R08: 0000000000000000 R09: 0000000000000000
[ 3222.385808] R10: 0000000000000000 R11: 0000000000000000 R12: ffff9817c5caad00
[ 3222.385809] R13: ffff9816c39ecd00 R14: 0000000000000000 R15: 000000000002ad00
[ 3222.385811] FS: 00007f9c0f53094
[ 3222.385812] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3222.385812] CR2: 000055988bb4b080 CR3: 00000003b5e9a005 CR4: 00000000003606e0
[ 3222.385813] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3222.385814] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 3222.385815] Call Trace:
[ 3222.385823] __schedule+
[ 3222.385826] ? ___sys_
[ 3222.385829] schedule+0x49/0xd0
[ 3222.385831] schedule_
[ 3222.385834] ? __seccomp_
[ 3222.385837] schedule_
[ 3222.385839] ep_poll+0x3c8/0x410
[ 3222.385843] ? wake_up_q+0x70/0x70
[ 3222.385845] do_epoll_
[ 3222.385847] __x64_sys_
[ 3222.385850] do_syscall_
[ 3222.385852] entry_SYSCALL_
[ 3222.385854] RIP: 0033:0x7f9c0f04eb77
[ 3222.385856] Code: 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 8d 05 91 ed 2c 00 41 89 ca 8b 00 85 c0 75 18 b8 e8 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 61 f3 c3 0f 1f 80 00 00 00 00 41 55 41 54 41
[ 3222.385857] RSP: 002b:00007ffc6f
[ 3222.385859] RAX: ffffffffffffffda RBX: 000055988d0ad3c0 RCX: 00007f9c0f04eb77
[ 3222.385859] RDX: 000000000000000e RSI: 00007ffc6fd86d90 RDI: 0000000000000004
[ 3222.385861] RBP: 00007ffc6fd86f40 R08: 00007ffc6fd86d90 R09: 000055988d0b645c
[ 3222.385861] R10: 00000000ffffffff R11: 0000000000000246 R12: 00000000000000cf
[ 3222.385862] R13: ffffffffffffffff R14: 00007ffc6fd86d90 R15: 0000000000000001
[ 3222.385865] ---[ end trace a8d20e83a2bda2fc ]---
[ 3282.386391] rcu: INFO: rcu_preempt detected stalls on CPUs/tasks:
[ 3282.386412] rcu: Tasks blocked on level-0 rcu_node (CPUs 0-3): P59852
[ 3282.386431] (detected by 0, t=60002 jiffies, g=454773, q=28443)
[ 3282.386435] systemd-resolve S 0 59852 59378 0x80000320
[ 3282.386441] Call Trace:
[ 3282.386457] __schedule+
[ 3282.386465] schedule+0x49/0xd0
[ 3282.386470] schedule_
[ 3282.386479] ? __seccomp_
[ 3282.386484] schedule_
[ 3282.386488] ep_poll+0x3c8/0x410
[ 3282.386496] ? wake_up_q+0x70/0x70
[ 3282.386501] do_epoll_
[ 3282.386505] __x64_sys_
[ 3282.386511] do_syscall_
[ 3282.386517] entry_SYSCALL_
[ 3282.386521] RIP: 0033:0x7f9c0f04eb77
[ 3282.386532] Code: Bad RIP value.
[ 3282.386535] RSP: 002b:00007ffc6f
[ 3282.386539] RAX: ffffffffffffffda RBX: 000055988d0ad3c0 RCX: 00007f9c0f04eb77
[ 3282.386541] RDX: 000000000000000e RSI: 00007ffc6fd86d90 RDI: 0000000000000004
[ 3282.386543] RBP: 00007ffc6fd86f40 R08: 00007ffc6fd86d90 R09: 000055988d0b645c
[ 3282.386545] R10: 00000000ffffffff R11: 0000000000000246 R12: 00000000000000cf
[ 3282.386547] R13: ffffffffffffffff R14: 00007ffc6fd86d90 R15: 0000000000000001
[ 3286.108450] rcu: INFO: rcu_preempt detected expedited stalls on CPUs/tasks: { P59852 } 63716 jiffies s: 4773 root: 0x0/T
[ 3286.108472] rcu: blocking rcu_node structures:
Just to confirm and still seeing this with today's -24 package.