Kernel trace in arch/x86/kvm/mmu/mmu.c:6362 during KVM live migration

Bug #2051114 reported by Christian Rohmann
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-hwe-6.2 (Ubuntu)
New
Undecided
Unassigned

Bug Description

We observed a kernel trace on a KVM hypervisor servers during live migrating an instance:

```
[...]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ------------[ cut here ]------------
Jan 23 10:58:53 fra-az1-comp-22 kernel: WARNING: CPU: 75 PID: 1082578 at arch/x86/kvm/mmu/mmu.c:6362 kvm_shadow_mmu_try_split_huge_pages+0x2e3/0x6d0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: Modules linked in: authenc echainiv esp4 cpuid xt_policy nf_conntrack_netlink xt_multiport xt_REDIRECT xt_nat xt_connmark xt_mark ebt_arp nft_meta_bridge ip6_tables xt_CT xt_mac xt_set xt_state ip_set_hash_net ip_set vhost_net vhost vhost_iotlb tap vxlan ip6_udp_tunnel udp_tunnel xt_comment xt_physdev veth xt_CHECKSUM xt_MASQUERADE xt_con>
Jan 23 10:58:53 fra-az1-comp-22 kernel: linear hid_generic cdc_ether usbhid usbnet hid mii ast i2c_algo_bit drm_shmem_helper drm_kms_helper syscopyarea crct10dif_pclmul sysfillrect crc32_pclmul polyval_clmulni polyval_generic sysimgblt raid1 ghash_clmulni_intel sha512_ssse3 aesni_intel crypto_simd cryptd ice ahci drm nvme i40e libahci xhci_pci i2c_piix4 xhci_pci_renesas nvme_c>
Jan 23 10:58:53 fra-az1-comp-22 kernel: CPU: 75 PID: 1082578 Comm: live_migration Not tainted 6.2.0-37-generic #38~22.04.1-Ubuntu
Jan 23 10:58:53 fra-az1-comp-22 kernel: Hardware name: ASUSTeK COMPUTER INC. RS720A-E11-RS24U/KMPP-D32 Series, BIOS 1501 08/23/2023
Jan 23 10:58:53 fra-az1-comp-22 kernel: RIP: 0010:kvm_shadow_mmu_try_split_huge_pages+0x2e3/0x6d0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: Code: 0a 04 00 00 48 81 c4 e0 00 00 00 5b 41 5c 41 5d 41 5e 41 5f 5d 31 c0 31 d2 31 c9 31 f6 31 ff 45 31 c0 45 31 c9 e9 3d 56 5e e5 <0f> 0b 48 8d bd 68 ff ff ff e8 8f 16 ff ff 49 89 c5 48 85 c0 0f 85
Jan 23 10:58:53 fra-az1-comp-22 kernel: RSP: 0018:ffffbe80b395bba0 EFLAGS: 00010246
Jan 23 10:58:53 fra-az1-comp-22 kernel: RAX: fffff4f620773340 RBX: 0000000000000001 RCX: 0000000000000000
Jan 23 10:58:53 fra-az1-comp-22 kernel: RDX: ffff9a2e5dccdfa8 RSI: 0000000000000000 RDI: 0000000000000000
Jan 23 10:58:53 fra-az1-comp-22 kernel: RBP: ffffbe80b395bca8 R08: 0000000000000001 R09: 00000000001d483f
Jan 23 10:58:53 fra-az1-comp-22 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffbe80af5c1000
Jan 23 10:58:53 fra-az1-comp-22 kernel: R13: ffff9a2dddccdfa8 R14: ffff99c51046d2b0 R15: 0000000000000000
Jan 23 10:58:53 fra-az1-comp-22 kernel: FS: 00007f8ec4a8a640(0000) GS:ffff9ab61eac0000(0000) knlGS:0000000000000000
Jan 23 10:58:53 fra-az1-comp-22 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 23 10:58:53 fra-az1-comp-22 kernel: CR2: 00007f9053dff000 CR3: 000000a83eaa8001 CR4: 0000000000770ee0
Jan 23 10:58:53 fra-az1-comp-22 kernel: PKRU: 55555554
Jan 23 10:58:53 fra-az1-comp-22 kernel: Call Trace:
Jan 23 10:58:53 fra-az1-comp-22 kernel: <TASK>
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? show_regs+0x72/0x90
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? kvm_shadow_mmu_try_split_huge_pages+0x2e3/0x6d0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? __warn+0x8d/0x160
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? kvm_shadow_mmu_try_split_huge_pages+0x2e3/0x6d0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? report_bug+0x1bb/0x1d0
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? handle_bug+0x46/0x90
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? exc_invalid_op+0x19/0x80
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? asm_exc_invalid_op+0x1b/0x20
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? kvm_shadow_mmu_try_split_huge_pages+0x2e3/0x6d0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? tdp_iter_start+0x5c/0xa0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? clear_dirty_pt_masked+0x155/0x1c0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: kvm_mmu_try_split_huge_pages+0x89/0xd0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: kvm_arch_mmu_enable_log_dirty_pt_masked+0x1e2/0x270 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? __check_object_size.part.0+0x72/0x150
Jan 23 10:58:53 fra-az1-comp-22 kernel: kvm_clear_dirty_log_protect+0x228/0x2f0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: kvm_vm_ioctl+0x171/0x8a0 [kvm]
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? __handle_mm_fault+0x3c0/0x720
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? __fget_light+0xb5/0x160
Jan 23 10:58:53 fra-az1-comp-22 kernel: __x64_sys_ioctl+0x9d/0xe0
Jan 23 10:58:53 fra-az1-comp-22 kernel: do_syscall_64+0x5c/0x90
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? do_user_addr_fault+0x1d0/0x640
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? exit_to_user_mode_prepare+0x3b/0xd0
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? irqentry_exit_to_user_mode+0x17/0x20
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? irqentry_exit+0x43/0x50
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? srso_alias_return_thunk+0x5/0x7f
Jan 23 10:58:53 fra-az1-comp-22 kernel: ? exc_page_fault+0x92/0x1b0
Jan 23 10:58:53 fra-az1-comp-22 kernel: entry_SYSCALL_64_after_hwframe+0x73/0xdd
Jan 23 10:58:53 fra-az1-comp-22 kernel: RIP: 0033:0x7f93d811a94f
Jan 23 10:58:53 fra-az1-comp-22 kernel: Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <41> 89 c0 3d 00 f0 ff ff 77 1f 48 8b 44 24 18 64 48 2b 04 25 28 00
Jan 23 10:58:53 fra-az1-comp-22 kernel: RSP: 002b:00007f8ec4a89120 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Jan 23 10:58:53 fra-az1-comp-22 kernel: RAX: ffffffffffffffda RBX: 00000000c018aec0 RCX: 00007f93d811a94f
Jan 23 10:58:53 fra-az1-comp-22 kernel: RDX: 00007f8ec4a89300 RSI: ffffffffc018aec0 RDI: 0000000000000058
Jan 23 10:58:53 fra-az1-comp-22 kernel: RBP: 000055bbcc127370 R08: 0000000000040000 R09: 0000000000000000
Jan 23 10:58:53 fra-az1-comp-22 kernel: R10: 00000000000c0000 R11: 0000000000000246 R12: 00007f8ec4a89300
Jan 23 10:58:53 fra-az1-comp-22 kernel: R13: 000055bbcc128430 R14: 0000000000000001 R15: 00007f93d4410058
Jan 23 10:58:53 fra-az1-comp-22 kernel: </TASK>
Jan 23 10:58:53 fra-az1-comp-22 kernel: ---[ end trace 0000000000000000 ]---
[...]
```

The machine runs OpenStack Nova using libvirt (KVM) to run virtual machines. There was only this one trace and no obvious issues with the workload or the machine (before and after migration).

summary: - Kernel trace in arch/x86/kvm/mmu/mmu.c:6362 using KVM live migration
+ Kernel trace in arch/x86/kvm/mmu/mmu.c:6362 during KVM live migration
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.