Azure: multi-MSI patches break fio tests on NVMe
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux-azure (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned | ||
Focal |
Fix Released
|
High
|
Tim Gardner | ||
Jammy |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
SRU Justification
[Impact]
Recent additions to focal/azure (5.4.0-1087.92) to support multi-MSI on the Azure hypervisor break fio tests on NVMe disks by causing a hung thread. The instance type used to demonstrate the failure was Standard_L64s_v2.
The initial fix is to revert the 4 patches.
06ad0de407e7ab3
6f7265bdd616170
c6a7a02229c2acd
9dc3b7f1d164efd
[Test Plan]
Run the attached reproducer script.
[Where things could go wrong]
Focal will not have support for multi-MSI, but since this is such a new feature there is likely little consequence.
[Other]
SF: #00339521
CVE References
Changed in linux-azure (Ubuntu Focal): | |
status: | In Progress → Fix Committed |
Jul 22 19:47:31 selfprovisioned -rtg-bionic kernel: [ 382.545090] watchdog: BUG: soft lockup - CPU#1 stuck for 21s! [ksoftirqd/1:17] -rtg-bionic kernel: [ 390.001461] Modules linked in: nls_iso8859_1 xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 hv_balloon xt_owner xt_tcpudp serio_raw iptable_security bpfilter joydev sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_ iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear mlx4_en mlx4_core crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hid_generic aesni_intel crypto_simd hid_hyperv cryptd pata_acpi glue_helper hyperv_keyboard hyperv_fb hid hv_netvsc hv_utils -rtg-bionic kernel: [ 390.001461] CPU: 1 PID: 17 Comm: ksoftirqd/1 Not tainted 5.4.0-1088-azure #93 -rtg-bionic kernel: [ 390.001461] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS 090008 12/07/2018 -rtg-bionic kernel: [ 390.001461] RIP: 0010:find_ next_and_ bit+0x1c/ 0x70 -rtg-bionic kernel: [ 390.001461] Code: 47 c2 c3 c3 66 2e 0f 1f 84 00 00 00 00 00 48 89 d0 48 39 ca 76 63 49 89 c8 49 c1 e8 06 4a 8b 14 c7 48 85 f6 74 04 4a 23 14 c6 <49> c7 c0 ff ff ff ff 49 d3 e0 48 83 e1 c0 49 21 d0 75 2d 48 83 c1 -rtg-bionic kernel: [ 390.001461] RSP: 0018:ffffb409cc 6fbac8 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff12 -rtg-bionic kernel: [ 390.001461] RAX: 0000000000000040 RBX: ffffb409cc6fbb28 RCX: 0000000000000001 -rtg-bionic kernel: [ 390.001461] RDX: 0000000000000001 RSI: ffff89239f429050 RDI: ffff89239d4d44e0 -rtg-bionic kernel: [ 390.001461] RBP: ffffb409cc6fbad0 R08: 0000000000000000 R09: 0000000000005000 -rtg-bionic kernel: [ 390.001461] R10: 00000000ffffffff R11: 0000000000000002 R12: ffffb409cc6fbd28 -rtg-bionic kernel: [ 390.001461] R13: 0000000000000003 R14: ffff89239d4d44c0 R15: 0000000000000000 -rtg-bionic kernel: [ 390.001461] FS: 000000000000000 0(0000) GS:ffff89239fa4 0000(0000) knlGS:000000000 0000000 -rtg-bionic kernel: [ 390.001461] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 -rtg-bionic kernel: [ 390.001461] CR2: 00007efe0d000000 CR3: 000000106e436000 CR4: 00000000003506e0 -rtg-bionic kernel: [ 390.001461] Call Trace: -rtg-bionic kernel: [ 390.001461] ? cpumask_ next_and+ 0x1e/0x20 -rtg-bionic kernel: [ 390.001461] update_ sd_lb_stats+ 0x14a/0x7d0 -rtg-bionic kernel: [ 390.001461] find_busiest_ group+0x49/ 0x520 -rtg-bionic kernel: [ 390.001461] load_balance+ 0x16f/0xaf0 -rtg-bionic kernel: [ 390.00...
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned
Jul 22 19:47:31 selfprovisioned