unshare test in ubuntu_stress_smoke_tests cause "BUG: unable to handle page fault for address" on F-oem-5.14 with Intel node vought
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Stress-ng |
New
|
Undecided
|
Unassigned | ||
ubuntu-kernel-tests |
New
|
Undecided
|
Unassigned | ||
linux-oem-5.14 (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
Issue found on Focal OEM-5.14.0-1025.27 with Intel node vought
The sut will stop responding after this, and the test will be killed in the end with the test timeout setting.
stress-ng test suite HEAD SHA1: 48be8ff
Mar 1 09:07:00 vought stress-ng: system: 'vought' Linux 5.14.0-1025-oem #27-Ubuntu SMP Thu Feb 24 09:13:19 UTC 2022 x86_64
Mar 1 09:07:00 vought stress-ng: memory (MB): total 359934.79, free 353583.48, shared 2.77, buffer 238.38, swap 9215.99, free swap 9214.70
Mar 1 09:07:00 vought stress-ng: info: [195104] setting to a 5 second run per stressor
Mar 1 09:07:00 vought stress-ng: info: [195104] dispatching hogs: 4 unshare
Mar 1 09:07:00 vought kernel: [ 1061.465476] BUG: unable to handle page fault for address: 0000000000001cc8
Mar 1 09:07:00 vought kernel: [ 1061.465554] #PF: supervisor read access in kernel mode
Mar 1 09:07:00 vought kernel: [ 1061.465596] #PF: error_code(0x0000) - not-present page
Mar 1 09:07:00 vought kernel: [ 1061.465637] PGD 0 P4D 0
Mar 1 09:07:00 vought kernel: [ 1061.465663] Oops: 0000 [#1] SMP NOPTI
Mar 1 09:07:00 vought kernel: [ 1061.465698] CPU: 85 PID: 196061 Comm: stress-ng Tainted: P O 5.14.0-1025-oem #27-Ubuntu
Mar 1 09:07:00 vought kernel: [ 1061.465771] Hardware name: Intel Corporation S2600WFD/S2600WFD, BIOS SE5C620.
Mar 1 09:07:00 vought kernel: [ 1061.465846] RIP: 0010:__
Mar 1 09:07:00 vought kernel: [ 1061.465895] Code: ff ff 84 c0 0f 85 1b 01 00 00 44 89 e0 48 8b 55 b0 8b 75 c4 c1 e8 0c 48 8b 7d a8 83 e0 01 88 45 c8 48 85 d2 0f 85 3e 01 00 00 <3b> 77 08 0f 82 35 01 00 00 48 89 7d b8 48 8b 07 44 89 e2 81 e2 00
Mar 1 09:07:00 vought kernel: [ 1061.466030] RSP: 0018:ffffb66fac
Mar 1 09:07:00 vought kernel: [ 1061.466074] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466128] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000001cc0
Mar 1 09:07:00 vought kernel: [ 1061.466182] RBP: ffffb66fac707ba8 R08: 0000000000000000 R09: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466236] R10: 0000000000000002 R11: ffff94438d0bb730 R12: 0000000000052cc0
Mar 1 09:07:00 vought kernel: [ 1061.466290] R13: 0000000000000002 R14: 0000000000052cc0 R15: 0000000000000001
Mar 1 09:07:00 vought kernel: [ 1061.466343] FS: 00007fbf1cef538
Mar 1 09:07:00 vought kernel: [ 1061.466406] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 1 09:07:00 vought kernel: [ 1061.466452] CR2: 0000000000001cc8 CR3: 000000bd8869a002 CR4: 00000000007706e0
Mar 1 09:07:00 vought kernel: [ 1061.466508] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466562] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 1 09:07:00 vought kernel: [ 1061.466616] PKRU: 55555554
Mar 1 09:07:00 vought kernel: [ 1061.466640] Call Trace:
Mar 1 09:07:00 vought kernel: [ 1061.466662] <TASK>
Mar 1 09:07:00 vought kernel: [ 1061.466687] kmalloc_
Mar 1 09:07:00 vought kernel: [ 1061.466729] __kmalloc_
Mar 1 09:07:00 vought kernel: [ 1061.466766] ? queue_delayed_
Mar 1 09:07:00 vought kernel: [ 1061.466810] kvmalloc_
Mar 1 09:07:00 vought kernel: [ 1061.466843] expand_
Mar 1 09:07:00 vought kernel: [ 1061.466884] prealloc_
Mar 1 09:07:00 vought kernel: [ 1061.468360] alloc_super+
Mar 1 09:07:00 vought kernel: [ 1061.469786] ? __fput_
Mar 1 09:07:00 vought kernel: [ 1061.471204] sget_fc+0x74/0x2e0
Mar 1 09:07:00 vought kernel: [ 1061.472564] ? compare_
Mar 1 09:07:00 vought kernel: [ 1061.473880] ? mqueue_
Mar 1 09:07:00 vought kernel: [ 1061.475139] vfs_get_
Mar 1 09:07:00 vought kernel: [ 1061.476351] get_tree_
Mar 1 09:07:00 vought kernel: [ 1061.477314] mqueue_
Mar 1 09:07:00 vought kernel: [ 1061.478060] vfs_get_
Mar 1 09:07:00 vought kernel: [ 1061.478791] fc_mount+0x13/0x50
Mar 1 09:07:00 vought kernel: [ 1061.479522] mq_create_
Mar 1 09:07:00 vought kernel: [ 1061.480260] mq_init_
Mar 1 09:07:00 vought kernel: [ 1061.481004] copy_ipcs+
Mar 1 09:07:00 vought kernel: [ 1061.481754] create_
Mar 1 09:07:00 vought kernel: [ 1061.482527] unshare_
Mar 1 09:07:00 vought kernel: [ 1061.483306] ksys_unshare+
Mar 1 09:07:00 vought kernel: [ 1061.484096] __x64_sys_
Mar 1 09:07:00 vought kernel: [ 1061.484895] do_syscall_
Mar 1 09:07:00 vought kernel: [ 1061.485680] entry_SYSCALL_
Mar 1 09:07:00 vought kernel: [ 1061.486466] RIP: 0033:0x7fbf1d03bf5b
Mar 1 09:07:00 vought kernel: [ 1061.487243] Code: 73 01 c3 48 8b 0d 35 7f 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa b8 10 01 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 05 7f 0c 00 f7 d8 64 89 01 48
Mar 1 09:07:00 vought kernel: [ 1061.488376] RSP: 002b:00007ffc6c
Mar 1 09:07:00 vought kernel: [ 1061.488923] RAX: ffffffffffffffda RBX: 000000000000000b RCX: 00007fbf1d03bf5b
Mar 1 09:07:00 vought kernel: [ 1061.489459] RDX: 0000000000000004 RSI: 00005648fda918d7 RDI: 0000000008000000
Mar 1 09:07:00 vought kernel: [ 1061.490003] RBP: 00007ffc6ca677e0 R08: 0000000000000000 R09: 00007fbf1cf142d0
Mar 1 09:07:00 vought kernel: [ 1061.490513] R10: 0000000000000000 R11: 0000000000000246 R12: 00007ffc6ca677d8
Mar 1 09:07:00 vought kernel: [ 1061.491010] R13: 00007ffc6ca677d0 R14: 00007ffc6ca67940 R15: 0000000000000020
Mar 1 09:07:00 vought kernel: [ 1061.491499] </TASK>
Mar 1 09:07:00 vought kernel: [ 1061.491977] Modules linked in: unix_diag binfmt_misc snd_timer snd soundcore uhid userio hci_vhci bluetooth ecdh_generic ecc vhost_net tap vhost_vsock vmw_vsock_
Mar 1 09:07:00 vought kernel: [ 1061.492048] scsi_dh_rdac scsi_dh_emc scsi_dh_alua intel_rapl_msr intel_rapl_common isst_if_common dax_pmem_compat nd_pmem device_dax nd_btt dax_pmem_core skx_edac ipmi_ssif x86_pkg_
Mar 1 09:07:00 vought kernel: [ 1061.501754] CR2: 0000000000001cc8
Mar 1 09:07:00 vought kernel: [ 1061.502452] ---[ end trace 349f30fe5376f696 ]---
Mar 1 09:07:00 vought kernel: [ 1061.598697] RIP: 0010:__
Mar 1 09:07:00 vought kernel: [ 1061.599414] Code: ff ff 84 c0 0f 85 1b 01 00 00 44 89 e0 48 8b 55 b0 8b 75 c4 c1 e8 0c 48 8b 7d a8 83 e0 01 88 45 c8 48 85 d2 0f 85 3e 01 00 00 <3b> 77 08 0f 82 35 01 00 00 48 89 7d b8 48 8b 07 44 89 e2 81 e2 00
Mar 1 09:07:00 vought kernel: [ 1061.600790] RSP: 0018:ffffb66fac
Mar 1 09:07:00 vought kernel: [ 1061.601511] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.602282] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000001cc0
Mar 1 09:07:00 vought kernel: [ 1061.602922] RBP: ffffb66fac707ba8 R08: 0000000000000000 R09: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.603547] R10: 0000000000000002 R11: ffff94438d0bb730 R12: 0000000000052cc0
Mar 1 09:07:00 vought kernel: [ 1061.604175] R13: 0000000000000002 R14: 0000000000052cc0 R15: 0000000000000001
Mar 1 09:07:00 vought kernel: [ 1061.604803] FS: 00007fbf1cef538
Mar 1 09:07:00 vought kernel: [ 1061.605528] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 1 09:07:00 vought kernel: [ 1061.606317] CR2: 0000000000001cc8 CR3: 000000bd8869a002 CR4: 00000000007706e0
Mar 1 09:07:00 vought kernel: [ 1061.606965] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.607614] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 1 09:07:00 vought kernel: [ 1061.608260] PKRU: 55555554
This should not be considered as a regression as we used to run this test on another node (spitfire).