Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ubuntu-kernel-tests |
Invalid
|
Undecided
|
Unassigned | ||
linux (Ubuntu) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Issue found on P8 node modoc with Eoan (5.3.0-43.36)
(Note that this test has passed with P9 node baltar without any traces in syslog)
The ubuntu_
15:16:53 DEBUG| [stdout] ok 4 selftests: net: reuseport_dualstack
15:16:53 DEBUG| [stdout] # selftests: net: reuseaddr_conflict
15:16:53 DEBUG| [stdout] # Opening 127.0.0.1:9999
15:16:53 DEBUG| [stdout] # Opening INADDR_ANY:9999
15:16:53 DEBUG| [stdout] # bind: Address already in use
15:16:53 DEBUG| [stdout] # Opening in6addr_any:9999
15:16:53 DEBUG| [stdout] # Opening INADDR_ANY:9999
15:16:53 DEBUG| [stdout] # bind: Address already in use
15:16:53 DEBUG| [stdout] # Opening INADDR_ANY:9999 after closing ipv6 socket
15:16:53 DEBUG| [stdout] # bind: Address already in use
15:16:53 DEBUG| [stdout] # Successok 5 selftests: net: reuseaddr_conflict
15:16:53 DEBUG| [stdout] # selftests: net: tls
15:16:56 DEBUG| [stdout] # tls.c:967:
15:17:26 DEBUG| [stdout] # Alarm clock
15:45:20 INFO | Timer expired (1800 sec.), nuking pid 33351
(And test continues)
It looks like it's the selftests: net: tls that's causing this issue.
If you ssh to the node, the following trace could be found in dmesg:
Injecting error (-12) to MEM_GOING_OFFLINE
Injecting error (-12) to MEM_GOING_OFFLINE
Injecting error (-12) to MEM_GOING_OFFLINE
Oops: Exception in kernel mode, sig: 4 [#1]
LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
Modules linked in: tls binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ipmi_powernv uio_pdrv_genirq ipmi_devintf ipmi_msghandler uio powernv_rng ibmpowernv vmx_crypto leds_powernv powernv_op_panel sch_fq_codel ip_tables x_tables autofs4 ses enclosure scsi_transport_sas btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_vpmsum crc32c_vpmsum tg3 ipr [last unloaded: notifier_
CPU: 18 PID: 36045 Comm: tls Not tainted 5.3.0-43-generic #36-Ubuntu
NIP: c00800000a4d6a40 LR: c00800000a4d6a40 CTR: c000000000179270
REGS: c000000fc97837a0 TRAP: 0e40 Not tainted (5.3.0-43-generic)
MSR: 900000000288b033 <SF,HV,
CFAR: c00000000000dfc4 IRQMASK: 0
GPR00: c00800000a4d6a40 c000000fc9783a30 c0000000019d9000 0000000000000000
GPR04: c000000f5afc0000 0000000000000000 c000000fc97839b8 0000000000000000
GPR08: c000000f5afc0000 0000000000000000 0000000000000000 c000000ff9a13780
GPR12: 0000000088002462 c000000ffffeb380 0000000000000000 0000000000000000
GPR16: 0000000000000000 00000ac875961368 00000ac875960d38 00000ac875960d90
GPR20: 00007ffffc6b4ef0 c000000e3996dc48 0000000000000000 0000000000000000
GPR24: c0000000004a9d70 0000000000000000 000000000000ea60 0000000000000000
GPR28: c00c00000398c740 c00000000ab32e70 0000000000000000 c000000f7ecebd00
NIP [c00800000a4d6a40] tls_sw_
LR [c00800000a4d6a40] tls_sw_
Call Trace:
[c000000fc9783a30] [c00800000a4d6a40] tls_sw_
[c000000fc9783a80] [c000000000d7698c] inet_sendpage+
[c000000fc9783ad0] [c000000000c33fd8] kernel_
[c000000fc9783af0] [c000000000c34044] sock_sendpage+
[c000000fc9783b10] [c0000000004a9e8c] pipe_to_
[c000000fc9783b40] [c0000000004ab3d4] __splice_
[c000000fc9783ba0] [c0000000004ad994] splice_
[c000000fc9783c20] [c0000000004a9dbc] direct_
[c000000fc9783c40] [c0000000004aad14] splice_
[c000000fc9783cc0] [c0000000004aaee4] do_splice_
[c000000fc9783d30] [c0000000004565c4] do_sendfile+
[c000000fc9783dd0] [c000000000456b70] sys_sendfile64+
[c000000fc9783e20] [c00000000000b388] system_
Instruction dump:
00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
00000000 00000000 00000000 00000000 <00000000> 00000000 00000000 00000000
---[ end trace 8961ea39a6f2dd08 ]---
BUG: Kernel NULL pointer dereference at 0x00000000
Faulting instruction address: 0xc00000000020bd74
Oops: Kernel access of bad area, sig: 11 [#2]
LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
Modules linked in: tls binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua ipmi_powernv uio_pdrv_genirq ipmi_devintf ipmi_msghandler uio powernv_rng ibmpowernv vmx_crypto leds_powernv powernv_op_panel sch_fq_codel ip_tables x_tables autofs4 ses enclosure scsi_transport_sas btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_vpmsum crc32c_vpmsum tg3 ipr [last unloaded: notifier_
CPU: 10 PID: 38095 Comm: modprobe Tainted: G D 5.3.0-43-generic #36-Ubuntu
NIP: c00000000020bd74 LR: c0000000007a24f4 CTR: c00000000020bd40
REGS: c000000fb0ad3580 TRAP: 0300 Tainted: G D (5.3.0-43-generic)
MSR: 900000000280b033 <SF,HV,
CFAR: c00000000000dfc4 DAR: 0000000000000000 DSISR: 40000000 IRQMASK: 0
GPR00: c0000000007a24f4 c000000fb0ad3810 c0000000019d9000 c00800000c3b9a39
GPR04: 0000000000000000 0000000000000003 0000000000000010 5f5f636667383032
GPR08: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR12: c00000000020bd40 c000000fffff4780 c000000fb0ad3d70 0000000000000001
GPR16: 000001fde4b4eeb0 0000000000000000 000001fde4b2cfb8 0000000000000000
GPR20: 0000000000000000 c00800000c217370 c000000fb0ad3c30 c000000fb0ad3aa0
GPR24: 0000000000000002 c00800000c3b9a39 c00800000a4de188 0000000000000010
GPR28: c00000000020bd40 0000000000000001 c00800000a4de198 0000000000000001
NIP [c00000000020bd74] cmp_name+0x34/0x190
LR [c0000000007a24f4] bsearch+0x84/0x110
Call Trace:
[c000000fb0ad3810] [c000000fb0ad3d70] 0xc000000fb0ad3d70 (unreliable)
[c000000fb0ad3830] [c00000000004ecc8] apply_relocate_
[c000000fb0ad3890] [c00000000020c418] find_exported_
[c000000fb0ad3920] [c00000000020e308] each_symbol_
[c000000fb0ad3a40] [c00000000020e64c] find_symbol+
[c000000fb0ad3af0] [c000000000214218] load_module+
[c000000fb0ad3d00] [c000000000214b38] __do_sys_
[c000000fb0ad3e20] [c00000000000b388] system_
Instruction dump:
3842d2c0 7c0802a6 60000000 f821ffe1 78690520 e8840008 2c290fc0 40800140
78890520 2c290fc0 40800134 7ce01c28 <7d002428> 39200000 7cea43f8 7ce94bf8
---[ end trace 8961ea39a6f2dd09 ]---
Please find attachment for the complete syslog.
ProblemType: Bug
DistroRelease: Ubuntu 19.10
Package: linux-image-
ProcVersionSign
Uname: Linux 5.3.0-43-generic ppc64le
AlsaDevices:
total 0
crw-rw---- 1 root audio 116, 1 Mar 25 15:09 seq
crw-rw---- 1 root audio 116, 33 Mar 25 15:09 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
ApportVersion: 2.20.11-0ubuntu8.7
Architecture: ppc64el
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found.
Date: Wed Mar 25 15:40:27 2020
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig'
Lsusb:
Bus 004 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 003 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
PciMultimedia:
ProcEnviron:
TERM=xterm-
PATH=(custom, no user)
LANG=C.UTF-8
SHELL=/bin/bash
ProcFB:
ProcKernelCmdLine: root=UUID=
ProcLoadAvg: 5.13 5.02 4.07 1/1493 37887
ProcLocks:
1: POSIX ADVISORY WRITE 3475 00:18:752 0 EOF
2: POSIX ADVISORY WRITE 3689 00:18:851 0 EOF
3: FLOCK ADVISORY WRITE 3656 00:18:820 0 EOF
ProcSwaps:
Filename Type Size Used Priority
/swap.img file 8388544 0 -2
ProcVersion: Linux version 5.3.0-43-generic (buildd@
RelatedPackageV
linux-
linux-
linux-firmware 1.183.5
RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
VarLogDump_list: total 0
cpu_cores: Number of cores present = 20
cpu_coreson: Number of cores online = 20
cpu_dscr: DSCR is 0
cpu_freq:
min: 3.694 GHz (cpu 159)
max: 3.695 GHz (cpu 1)
avg: 3.695 GHz
cpu_runmode:
Could not retrieve current diagnostics mode,
No kernel interface to firmware
cpu_smt: SMT=8
This is what I saw on this Eoan P8 node modoc on the last cycle: /bugs.launchpad .net/ubuntu/ +source/ linux/+ bug/1867155
https:/
Which makes it failed to finish the test.