unshare test in ubuntu_stress_smoke_tests cause "BUG: unable to handle page fault for address" on F-oem-5.14 with Intel node vought

Bug #1962551 reported by Po-Hsu Lin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Stress-ng
New
Undecided
Unassigned
ubuntu-kernel-tests
New
Undecided
Unassigned
linux-oem-5.14 (Ubuntu)
New
Undecided
Unassigned

Bug Description

Issue found on Focal OEM-5.14.0-1025.27 with Intel node vought

The sut will stop responding after this, and the test will be killed in the end with the test timeout setting.

stress-ng test suite HEAD SHA1: 48be8ff

Mar 1 09:07:00 vought stress-ng: system: 'vought' Linux 5.14.0-1025-oem #27-Ubuntu SMP Thu Feb 24 09:13:19 UTC 2022 x86_64
Mar 1 09:07:00 vought stress-ng: memory (MB): total 359934.79, free 353583.48, shared 2.77, buffer 238.38, swap 9215.99, free swap 9214.70
Mar 1 09:07:00 vought stress-ng: info: [195104] setting to a 5 second run per stressor
Mar 1 09:07:00 vought stress-ng: info: [195104] dispatching hogs: 4 unshare
Mar 1 09:07:00 vought kernel: [ 1061.465476] BUG: unable to handle page fault for address: 0000000000001cc8
Mar 1 09:07:00 vought kernel: [ 1061.465554] #PF: supervisor read access in kernel mode
Mar 1 09:07:00 vought kernel: [ 1061.465596] #PF: error_code(0x0000) - not-present page
Mar 1 09:07:00 vought kernel: [ 1061.465637] PGD 0 P4D 0
Mar 1 09:07:00 vought kernel: [ 1061.465663] Oops: 0000 [#1] SMP NOPTI
Mar 1 09:07:00 vought kernel: [ 1061.465698] CPU: 85 PID: 196061 Comm: stress-ng Tainted: P O 5.14.0-1025-oem #27-Ubuntu
Mar 1 09:07:00 vought kernel: [ 1061.465771] Hardware name: Intel Corporation S2600WFD/S2600WFD, BIOS SE5C620.86B.0D.01.0395.022720191340 02/27/2019
Mar 1 09:07:00 vought kernel: [ 1061.465846] RIP: 0010:__alloc_pages+0x125/0x310
Mar 1 09:07:00 vought kernel: [ 1061.465895] Code: ff ff 84 c0 0f 85 1b 01 00 00 44 89 e0 48 8b 55 b0 8b 75 c4 c1 e8 0c 48 8b 7d a8 83 e0 01 88 45 c8 48 85 d2 0f 85 3e 01 00 00 <3b> 77 08 0f 82 35 01 00 00 48 89 7d b8 48 8b 07 44 89 e2 81 e2 00
Mar 1 09:07:00 vought kernel: [ 1061.466030] RSP: 0018:ffffb66fac707b50 EFLAGS: 00010246
Mar 1 09:07:00 vought kernel: [ 1061.466074] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466128] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000001cc0
Mar 1 09:07:00 vought kernel: [ 1061.466182] RBP: ffffb66fac707ba8 R08: 0000000000000000 R09: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466236] R10: 0000000000000002 R11: ffff94438d0bb730 R12: 0000000000052cc0
Mar 1 09:07:00 vought kernel: [ 1061.466290] R13: 0000000000000002 R14: 0000000000052cc0 R15: 0000000000000001
Mar 1 09:07:00 vought kernel: [ 1061.466343] FS: 00007fbf1cef5380(0000) GS:ffff94438d080000(0000) knlGS:0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466406] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 1 09:07:00 vought kernel: [ 1061.466452] CR2: 0000000000001cc8 CR3: 000000bd8869a002 CR4: 00000000007706e0
Mar 1 09:07:00 vought kernel: [ 1061.466508] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.466562] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 1 09:07:00 vought kernel: [ 1061.466616] PKRU: 55555554
Mar 1 09:07:00 vought kernel: [ 1061.466640] Call Trace:
Mar 1 09:07:00 vought kernel: [ 1061.466662] <TASK>
Mar 1 09:07:00 vought kernel: [ 1061.466687] kmalloc_large_node+0x45/0xb0
Mar 1 09:07:00 vought kernel: [ 1061.466729] __kmalloc_node+0x430/0x4f0
Mar 1 09:07:00 vought kernel: [ 1061.466766] ? queue_delayed_work_on+0x36/0x50
Mar 1 09:07:00 vought kernel: [ 1061.466810] kvmalloc_node+0x5c/0x90
Mar 1 09:07:00 vought kernel: [ 1061.466843] expand_shrinker_info+0xfa/0x230
Mar 1 09:07:00 vought kernel: [ 1061.466884] prealloc_shrinker+0xba/0x100
Mar 1 09:07:00 vought kernel: [ 1061.468360] alloc_super+0x2c3/0x340
Mar 1 09:07:00 vought kernel: [ 1061.469786] ? __fput_sync+0x30/0x30
Mar 1 09:07:00 vought kernel: [ 1061.471204] sget_fc+0x74/0x2e0
Mar 1 09:07:00 vought kernel: [ 1061.472564] ? compare_single+0x10/0x10
Mar 1 09:07:00 vought kernel: [ 1061.473880] ? mqueue_create+0x20/0x20
Mar 1 09:07:00 vought kernel: [ 1061.475139] vfs_get_super+0x3d/0x100
Mar 1 09:07:00 vought kernel: [ 1061.476351] get_tree_keyed+0x1d/0x20
Mar 1 09:07:00 vought kernel: [ 1061.477314] mqueue_get_tree+0x1c/0x20
Mar 1 09:07:00 vought kernel: [ 1061.478060] vfs_get_tree+0x2a/0xc0
Mar 1 09:07:00 vought kernel: [ 1061.478791] fc_mount+0x13/0x50
Mar 1 09:07:00 vought kernel: [ 1061.479522] mq_create_mount+0xd9/0x160
Mar 1 09:07:00 vought kernel: [ 1061.480260] mq_init_ns+0x3b/0x50
Mar 1 09:07:00 vought kernel: [ 1061.481004] copy_ipcs+0x138/0x230
Mar 1 09:07:00 vought kernel: [ 1061.481754] create_new_namespaces.isra.0+0x9a/0x2b0
Mar 1 09:07:00 vought kernel: [ 1061.482527] unshare_nsproxy_namespaces+0x61/0xb0
Mar 1 09:07:00 vought kernel: [ 1061.483306] ksys_unshare+0x1ea/0x3d0
Mar 1 09:07:00 vought kernel: [ 1061.484096] __x64_sys_unshare+0x12/0x20
Mar 1 09:07:00 vought kernel: [ 1061.484895] do_syscall_64+0x38/0xc0
Mar 1 09:07:00 vought kernel: [ 1061.485680] entry_SYSCALL_64_after_hwframe+0x44/0xae
Mar 1 09:07:00 vought kernel: [ 1061.486466] RIP: 0033:0x7fbf1d03bf5b
Mar 1 09:07:00 vought kernel: [ 1061.487243] Code: 73 01 c3 48 8b 0d 35 7f 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa b8 10 01 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 05 7f 0c 00 f7 d8 64 89 01 48
Mar 1 09:07:00 vought kernel: [ 1061.488376] RSP: 002b:00007ffc6ca677a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000110
Mar 1 09:07:00 vought kernel: [ 1061.488923] RAX: ffffffffffffffda RBX: 000000000000000b RCX: 00007fbf1d03bf5b
Mar 1 09:07:00 vought kernel: [ 1061.489459] RDX: 0000000000000004 RSI: 00005648fda918d7 RDI: 0000000008000000
Mar 1 09:07:00 vought kernel: [ 1061.490003] RBP: 00007ffc6ca677e0 R08: 0000000000000000 R09: 00007fbf1cf142d0
Mar 1 09:07:00 vought kernel: [ 1061.490513] R10: 0000000000000000 R11: 0000000000000246 R12: 00007ffc6ca677d8
Mar 1 09:07:00 vought kernel: [ 1061.491010] R13: 00007ffc6ca677d0 R14: 00007ffc6ca67940 R15: 0000000000000020
Mar 1 09:07:00 vought kernel: [ 1061.491499] </TASK>
Mar 1 09:07:00 vought kernel: [ 1061.491977] Modules linked in: unix_diag binfmt_misc snd_timer snd soundcore uhid userio hci_vhci bluetooth ecdh_generic ecc vhost_net tap vhost_vsock vmw_vsock_virtio_transport_common vhost vhost_iotlb vsock zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) dccp_ipv4 dccp atm wp512 streebog_generic sm3_generic sha3_generic rmd160 poly1305_generic poly1305_x86_64 nhpoly1305_avx2 nhpoly1305_sse2 nhpoly1305 libpoly1305 michael_mic md4 cmac ccm algif_rng twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common sm4_generic serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic fcrypt des3_ede_x86_64 des_generic libdes cast6_avx_x86_64 cast6_generic cast5_avx_x86_64 cast5_generic cast_common camellia_generic camellia_aesni_avx2 camellia_aesni_avx_x86_64 camellia_x86_64 blowfish_generic blowfish_x86_64 blowfish_common algif_skcipher algif_hash aegis128 aegis128_aesni algif_aead af_alg nls_iso8859_1 dm_multipath
Mar 1 09:07:00 vought kernel: [ 1061.492048] scsi_dh_rdac scsi_dh_emc scsi_dh_alua intel_rapl_msr intel_rapl_common isst_if_common dax_pmem_compat nd_pmem device_dax nd_btt dax_pmem_core skx_edac ipmi_ssif x86_pkg_temp_thermal intel_powerclamp coretemp irdma ice kvm_intel ib_uverbs joydev input_leds ib_core kvm rapl intel_cstate efi_pstore mei_me mei intel_pch_thermal ioatdma dca acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler nfit mac_hid sch_fq_codel msr ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid hid ast drm_vram_helper i2c_algo_bit drm_ttm_helper ttm drm_kms_helper syscopyarea sysfillrect sysimgblt crct10dif_pclmul crc32_pclmul ghash_clmulni_intel fb_sys_fops aesni_intel cec crypto_simd rc_core cryptd i40e drm ahci lpc_ich libahci i2c_i801 xhci_pci i2c_smbus xhci_pci_renesas wmi
Mar 1 09:07:00 vought kernel: [ 1061.501754] CR2: 0000000000001cc8
Mar 1 09:07:00 vought kernel: [ 1061.502452] ---[ end trace 349f30fe5376f696 ]---
Mar 1 09:07:00 vought kernel: [ 1061.598697] RIP: 0010:__alloc_pages+0x125/0x310
Mar 1 09:07:00 vought kernel: [ 1061.599414] Code: ff ff 84 c0 0f 85 1b 01 00 00 44 89 e0 48 8b 55 b0 8b 75 c4 c1 e8 0c 48 8b 7d a8 83 e0 01 88 45 c8 48 85 d2 0f 85 3e 01 00 00 <3b> 77 08 0f 82 35 01 00 00 48 89 7d b8 48 8b 07 44 89 e2 81 e2 00
Mar 1 09:07:00 vought kernel: [ 1061.600790] RSP: 0018:ffffb66fac707b50 EFLAGS: 00010246
Mar 1 09:07:00 vought kernel: [ 1061.601511] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.602282] RDX: 0000000000000000 RSI: 0000000000000002 RDI: 0000000000001cc0
Mar 1 09:07:00 vought kernel: [ 1061.602922] RBP: ffffb66fac707ba8 R08: 0000000000000000 R09: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.603547] R10: 0000000000000002 R11: ffff94438d0bb730 R12: 0000000000052cc0
Mar 1 09:07:00 vought kernel: [ 1061.604175] R13: 0000000000000002 R14: 0000000000052cc0 R15: 0000000000000001
Mar 1 09:07:00 vought kernel: [ 1061.604803] FS: 00007fbf1cef5380(0000) GS:ffff94438d080000(0000) knlGS:0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.605528] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 1 09:07:00 vought kernel: [ 1061.606317] CR2: 0000000000001cc8 CR3: 000000bd8869a002 CR4: 00000000007706e0
Mar 1 09:07:00 vought kernel: [ 1061.606965] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 1 09:07:00 vought kernel: [ 1061.607614] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 1 09:07:00 vought kernel: [ 1061.608260] PKRU: 55555554

Revision history for this message
Po-Hsu Lin (cypressyew) wrote :

This should not be considered as a regression as we used to run this test on another node (spitfire).

tags: added: 5.14 oem sru-20220221 ubuntu-stress-smoke-test
description: updated
Revision history for this message
Po-Hsu Lin (cypressyew) wrote :

Please find attachment for syslog on node vought.

Revision history for this message
Po-Hsu Lin (cypressyew) wrote :

This can be found on 5.13 kernel as well, see bug 1959215

Revision history for this message
Colin Ian King (colin-king) wrote :

Looks like a kernel bug and not a stress-ng issue to me.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.