Kernel panic during checkbox stress_ng_test on Grace running noble 6.8 (arm64+largemem) kernel
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Committed
|
Undecided
|
Mitchell Augustin | ||
Noble |
Fix Committed
|
Undecided
|
Mitchell Augustin |
Bug Description
A kernel oops and panic occurred during 22.04 SoC certification on Gunyolk (Grace/Grace) with 6.8 kernel, arm64+largemem variant
Steps to reproduce:
Run (as root) the following commands:
add-apt-repository -y ppa:checkbox-
apt-add-repository -y ppa:firmware-
apt update
apt install -y canonical-
/usr/lib/
stress_ng_test caused a kernel panic after about 5 minutes. I have attached dmesg output from my reproducer to this report.
Initially, this was identified via a panic during the above test, which was running as part of a run of certify-soc-22.04.
Attached is a tarball containing:
- apport.
- reproduced-
- original-dmesg.txt: The dmesg output I captured when the stress_ng_test originally failed during the full cert suite run
Changed in linux (Ubuntu): | |
assignee: | nobody → Jose Ogando Justo (joseogando) |
Changed in linux (Ubuntu): | |
assignee: | Jose Ogando Justo (joseogando) → Mitchell Augustin (mitchellaugustin) |
status: | Fix Committed → In Progress |
Changed in linux (Ubuntu Noble): | |
status: | In Progress → Fix Committed |
tags: |
added: verification-done-noble-linux removed: verification-needed-noble-linux |
This is also reproducible on the latest mainline version (https:/ /kernel. ubuntu. com/mainline/ v6.8/arm64/, retrieved 20 Mar 2024 @ 5 PM):
20 Mar 22:54: Running stress-ng aiol stressor for 240 seconds... generic- 64k #202403131158 lock_irqsave+ 0x44/0x100 wake_up+ 0x68/0x758 lock_irqsave+ 0x44/0x100 wake_up+ 0x68/0x758 process+ 0x24/0x50
[ 354.451450] Unable to handle kernel paging request at virtual address 17be9b4aa3e187be
[ 354.459580] Mem abort info:
[ 354.462439] ESR = 0x0000000096000021
[ 354.466274] EC = 0x25: DABT (current EL), IL = 32 bits
[ 354.471703] SET = 0, FnV = 0
[ 354.474819] EA = 0, S1PTW = 0
[ 354.478024] FSC = 0x21: alignment fault
[ 354.482118] Data abort info:
[ 354.485056] ISV = 0, ISS = 0x00000021, ISS2 = 0x00000000
[ 354.490662] CM = 0, WnR = 0, TnD = 0, TagAccess = 0
[ 354.495823] GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
[ 354.501251] [17be9b4aa3e187be] address between user and kernel address ranges
[ 354.508548] Internal error: Oops: 0000000096000021 [#1] SMP
[ 354.514245] Modules linked in: qrtr cfg80211 binfmt_misc nls_iso8859_1 input_leds dax_hmem cxl_acpi acpi_ipmi onboard_usb_hub nvidia_cspmu ipmi_ssif cxl_co
re ipmi_devintf arm_cspmu_module arm_smmuv3_pmu ipmi_msghandler uio_pdrv_genirq uio spi_nor cppc_cpufreq joydev mtd acpi_power_meter dm_multipath nvme_fabrics
efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor
xor_neon raid6_pq libcrc32c raid1 raid0 hid_generic rndis_host usbhid cdc_ether hid usbnet uas usb_storage crct10dif_ce polyval_ce polyval_generic ghash_ce s
m4_ce_gcm sm4_ce_ccm sm4_ce sm4_ce_cipher sm4 sm3_ce sm3 nvme sha3_ce i2c_smbus ixgbe sha2_ce nvme_core ast sha256_arm64 xhci_pci sha1_ce xfrm_algo xhci_pci_r
enesas i2c_algo_bit nvme_auth mdio spi_tegra210_quad i2c_tegra aes_neon_bs aes_neon_blk aes_ce_blk aes_ce_cipher
[ 354.594676] CPU: 61 PID: 0 Comm: swapper/61 Kdump: loaded Not tainted 6.8.0-060800-
[ 354.604728] Hardware name: Supermicro MBD-G1SMH/G1SMH, BIOS 1.0c 12/28/2023
[ 354.611844] pstate: 034000c9 (nzcv daIF +PAN -UAO +TCO +DIT -SSBS BTYPE=--)
[ 354.618962] pc : _raw_spin_
[ 354.623863] lr : try_to_
[ 354.628053] sp : ffff8000807afaf0
[ 354.631436] x29: ffff8000807afaf0 x28: 0000000000040000 x27: 0000000000000000
[ 354.638731] x26: ffffa06103dc8a98 x25: ffff8000807afd98 x24: 0000000000000002
[ 354.646027] x23: ffff0000f8156840 x22: 17be9b4aa3e187be x21: 0000000000000000
[ 354.653323] x20: 0000000000000003 x19: 00000000000000c0 x18: ffff8000819a0098
[ 354.660619] x17: 0000000000000000 x16: 0000000000000000 x15: 0000ffffe97dca18
[ 354.667914] x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
[ 354.675208] x11: 0000000000000000 x10: 0000000000000000 x9 : ffffa06100ba6810
[ 354.682504] x8 : 0000000000000000 x7 : 0000004000000000 x6 : 0000000000009080
[ 354.689800] x5 : 0000c2fb0dc488b0 x4 : 0000000000000000 x3 : ffff0000894178c0
[ 354.697096] x2 : 0000000000000001 x1 : 0000000000000000 x0 : 17be9b4aa3e187be
[ 354.704391] Call trace:
[ 354.706886] _raw_spin_
[ 354.711426] try_to_
[ 354.715254] wake_up_
[ 354.719082] aio...