Bug #1927076 “IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_s...” : Impish (21.10) : Bugs : linux package : Ubuntu

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-05-04:

#1

net-entei.log Edit (128.6 KiB, text/plain)

description:	updated
tags:	added: 5.8 focal groovy kqa-blocker ppc64el sru-20210412 ubuntu-kernel-selftests
description:	updated

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-05-05:

#2

It looks like this test will cause system reboot, without suspicious error messages in syslog

ubuntu@entei:~/autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/net$ sudo ./reuseport_bpf_cpu
....
send cpu 125, receive socket 125
send cpu 127, receive socket 127
---- IPv6 TCP ----
packet_write_wait: Connection to 10.245.71.180 port 22: Broken pipe
(system rebooted)

In syslog:
ay 5 06:09:30 entei systemd[1]: motd-news.service: Succeeded.
May 5 06:09:30 entei systemd[1]: Finished Message of the Day.
May 5 06:09:31 entei systemd[1]: apt-daily-upgrade.service: Succeeded.
May 5 06:09:31 entei systemd[1]: Finished Daily apt upgrade and clean activities.
May 5 06:14:32 entei PackageKit: daemon quit
May 5 06:14:32 entei systemd[1]: packagekit.service: Succeeded.
May 5 06:14:53 entei ntpd[42145]: kernel reports TIME_ERROR: 0x2041: Clock Unsynchronized
May 5 06:15:12 entei systemd[1]: Started Session 4 of user ubuntu.
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@May 5 06:21:15 entei systemd-sysctl[1479]: Not setting net/ipv4/conf/all/promote_secondaries (explicit setting exists).
May 5 06:21:15 entei systemd-sysctl[1479]: Not setting net/ipv4/conf/default/promote_secondaries (explicit setting exists).
May 5 06:21:15 entei lvm[1468]: /dev/sdc: open failed: No medium found

System rebooted around 06:15:12

This can be reproduced with 5.8.0-50-generic as well.

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-05-05:

#3

Download full text (5.5 KiB)

OK, it's a combination effect, this issue can be reproduced in the following order:
1. Run the cpu-hotplug test
sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/cpu-hotplug/cpu-on-off-test.sh
2. Run the reuseport_bpf_cpu test
sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/net/reuseport_bpf_cpu

You may need to run reuseport_bpf_cpu multiple times to trigger this.
But it looks OK if the cpu-hotplug test was not executed first

[ 287.477797] Oops: Exception in kernel mode, sig: 4 [#1]
[ 287.477841] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
[ 287.477990] Modules linked in: binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua joydev input_leds mac_hid ofpart cmdlinepart plx_dma powernv_flash mtd at24 ipmi_powernv uio_pdrv_genirq powernv_rng ipmi_devintf ibmpowernv ipmi_msghandler opal_prd uio vmx_crypto sch_fq_codel ip_tables x_tables autofs4 btrfs blake2b_generic hid_generic raid10 raid456 usbhid uas async_raid6_recov hid async_memcpy async_pq usb_storage async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast drm_vram_helper drm_ttm_helper i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core crct10dif_vpmsum crc32c_vpmsum drm ahci tg3 libahci drm_panel_orientation_quirks xhci_pci xhci_pci_renesas
[ 287.478276] CPU: 0 PID: 3267 Comm: reuseport_bpf_c Not tainted 5.8.0-50-generic #56-Ubuntu
[ 287.478294] NIP: c008000001592094 LR: c000000000ea092c CTR: c008000001592094
[ 287.478313] REGS: c0000007ff6eb510 TRAP: 0e40 Not tainted (5.8.0-50-generic)
[ 287.478330] MSR: 900000000288b033 <SF,HV,VEC,VSX,EE,FP,ME,IR,DR,RI,LE> CR: 24002488 XER: 20000000
[ 287.478356] CFAR: c000000000ea0928 IRQMASK: 0
[ 287.478356] GPR00: c000000000ea0b04 c0000007ff6eb7a0 c0000000020dd900 c000000712caf2e0
[ 287.478356] GPR04: c008000001260038 c008000001260000 c000000712caf2e0 0000000000000028
[ 287.478356] GPR08: 0000000129432812 0000000000000000 c00000077f82bd58 0000000000000000
[ 287.478356] GPR12: c008000001592094 c000000002380000 c000000002003e80 00000000000022b8
[ 287.478356] GPR16: 00000000000049c3 000000000000000a 0000000000000001 0000000000000001
[ 287.478356] GPR20: c00000077f82bd48 0000000000000000 00000000000022b8 0000000000000001
[ 287.478356] GPR24: 0000000000000001 0000000000000000 c008000001260000 0000000000000080
[ 287.478356] GPR28: c000000712caf2e0 0000000000000028 0000000000000028 c008000001260000
[ 287.478628] NIP [c008000001592094] 0xc008000001592094
[ 287.478645] LR [c000000000ea092c] __bpf_prog_run_save_cb+0x5c/0x190
[ 287.478660] Call Trace:
[ 287.478671] [c0000007ff6eb7a0] [c000000000f3f84c] __ip_queue_xmit+0x18c/0x4d0 (unreliable)
[ 287.478691] [c0000007ff6eb810] [c000000000ea0b04] run_bpf_filter+0xa4/0x1f0
[ 287.478709] [c0000007ff6eb870] [c000000000ea0cd0] reuseport_select_sock+0x80/0x170
[ 287.478728] [c0000007ff6eb8b0] [c0000000010838ec] inet6_lhash2_lookup+0x1dc/0x200
[ 287.478748] [c0000007ff6eb930] [c000000001083a7c] inet6_lookup_listener+0x16c/0x180
[ 287.478768] [c0000007ff6eba00] [c00000000105e968] tcp_v6_rcv+0x828/0xf50
[ 287.478785] [c000...

OK, it's a combination effect, this issue can be reproduced in the following order:
1. Run the cpu-hotplug test
   sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/cpu-hotplug/cpu-on-off-test.sh 
2. Run the reuseport_bpf_cpu test
   sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/net/reuseport_bpf_cpu

You may need to run reuseport_bpf_cpu multiple times to trigger this.
But it looks OK if the cpu-hotplug test was not executed first

[  287.477797] Oops: Exception in kernel mode, sig: 4 [#1]
[  287.477841] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
[  287.477990] Modules linked in: binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua joydev input_leds mac_hid ofpart cmdlinepart plx_dma powernv_flash mtd at24 ipmi_powernv uio_pdrv_genirq powernv_rng ipmi_devintf ibmpowernv ipmi_msghandler opal_prd uio vmx_crypto sch_fq_codel ip_tables x_tables autofs4 btrfs blake2b_generic hid_generic raid10 raid456 usbhid uas async_raid6_recov hid async_memcpy async_pq usb_storage async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast drm_vram_helper drm_ttm_helper i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops cec rc_core crct10dif_vpmsum crc32c_vpmsum drm ahci tg3 libahci drm_panel_orientation_quirks xhci_pci xhci_pci_renesas
[  287.478276] CPU: 0 PID: 3267 Comm: reuseport_bpf_c Not tainted 5.8.0-50-generic #56-Ubuntu
[  287.478294] NIP:  c008000001592094 LR: c000000000ea092c CTR: c008000001592094
[  287.478313] REGS: c0000007ff6eb510 TRAP: 0e40   Not tainted  (5.8.0-50-generic)
[  287.478330] MSR:  900000000288b033 <SF,HV,VEC,VSX,EE,FP,ME,IR,DR,RI,LE>  CR: 24002488  XER: 20000000
[  287.478356] CFAR: c000000000ea0928 IRQMASK: 0 
[  287.478356] GPR00: c000000000ea0b04 c0000007ff6eb7a0 c0000000020dd900 c000000712caf2e0 
[  287.478356] GPR04: c008000001260038 c008000001260000 c000000712caf2e0 0000000000000028 
[  287.478356] GPR08: 0000000129432812 0000000000000000 c00000077f82bd58 0000000000000000 
[  287.478356] GPR12: c008000001592094 c000000002380000 c000000002003e80 00000000000022b8 
[  287.478356] GPR16: 00000000000049c3 000000000000000a 0000000000000001 0000000000000001 
[  287.478356] GPR20: c00000077f82bd48 0000000000000000 00000000000022b8 0000000000000001 
[  287.478356] GPR24: 0000000000000001 0000000000000000 c008000001260000 0000000000000080 
[  287.478356] GPR28: c000000712caf2e0 0000000000000028 0000000000000028 c008000001260000 
[  287.478628] NIP [c008000001592094] 0xc008000001592094
[  287.478645] LR [c000000000ea092c] __bpf_prog_run_save_cb+0x5c/0x190
[  287.478660] Call Trace:
[  287.478671] [c0000007ff6eb7a0] [c000000000f3f84c] __ip_queue_xmit+0x18c/0x4d0 (unreliable)
[  287.478691] [c0000007ff6eb810] [c000000000ea0b04] run_bpf_filter+0xa4/0x1f0
[  287.478709] [c0000007ff6eb870] [c000000000ea0cd0] reuseport_select_sock+0x80/0x170
[  287.478728] [c0000007ff6eb8b0] [c0000000010838ec] inet6_lhash2_lookup+0x1dc/0x200
[  287.478748] [c0000007ff6eb930] [c000000001083a7c] inet6_lookup_listener+0x16c/0x180
[  287.478768] [c0000007ff6eba00] [c00000000105e968] tcp_v6_rcv+0x828/0xf50
[  287.478785] [c0000007ff6ebb50] [c0000000010109a0] ip6_protocol_deliver_rcu+0x110/0x6c0
[  287.478804] [c0000007ff6ebbd0] [c000000001011074] ip6_input+0xe4/0x100
[  287.478820] [c0000007ff6ebc40] [c0000000010106a4] ipv6_rcv+0x164/0x190
[  287.478838] [c0000007ff6ebcc0] [c000000000e63014] __netif_receive_skb_one_core+0x74/0xb0
[  287.478856] [c0000007ff6ebd10] [c000000000e634b8] process_backlog+0x138/0x270
[  287.478875] [c0000007ff6ebd80] [c000000000e64b60] napi_poll+0x100/0x350
[  287.478892] [c0000007ff6ebe10] [c000000000e64ea4] net_rx_action+0xf4/0x2d0
[  287.478909] [c0000007ff6ebea0] [c0000000010d0870] __do_softirq+0x150/0x3dc
[  287.478927] [c0000007ff6ebf90] [c00000000002a48c] call_do_softirq+0x14/0x24
[  287.478945] [c0000006e639f670] [c000000000015fd8] do_softirq_own_stack+0x38/0x50
[  287.478965] [c0000006e639f690] [c00000000015daa0] do_softirq+0x90/0xa0
[  287.478982] [c0000006e639f6c0] [c00000000015db68] __local_bh_enable_ip+0xb8/0xe0
[  287.479001] [c0000006e639f6e0] [c00000000100ad98] ip6_finish_output2+0x208/0x660
[  287.479020] [c0000006e639f780] [c00000000100a750] ip6_xmit+0x370/0x7b0
[  287.479037] [c0000006e639f8a0] [c000000001067fc8] inet6_csk_xmit+0xb8/0x120
[  287.479055] [c0000006e639f940] [c000000000f6b7b4] __tcp_transmit_skb+0x424/0x9d0
[  287.479074] [c0000006e639fa50] [c000000000f6c5e8] tcp_connect+0x2d8/0x380
[  287.479091] [c0000006e639fb00] [c00000000105b118] tcp_v6_connect+0x5c8/0x790
[  287.479110] [c0000006e639fbe0] [c000000000f9e610] __inet_stream_connect+0x130/0x390
[  287.479129] [c0000006e639fc40] [c000000000f9e8cc] inet_stream_connect+0x5c/0x90
[  287.482595] [c0000006e639fc80] [c000000000e28328] __sys_connect_file+0xa8/0xe0
[  287.485864] [c0000006e639fcc0] [c000000000e28444] __sys_connect+0xe4/0x140
[  287.488920] [c0000006e639fda0] [c000000000e284c8] sys_connect+0x28/0x40
[  287.493522] [c0000006e639fdc0] [c000000000035354] system_call_exception+0xf4/0x1c0
[  287.499806] [c0000006e639fe20] [c00000000000ca70] system_call_common+0xf0/0x278
[  287.502829] Instruction dump:
[  287.506039] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX 
[  287.509195] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX 
[  287.514249] ---[ end trace ad377ffe0b8272bb ]---
[  287.667686] 
[  288.667720] Kernel panic - not syncing: Aiee, killing interrupt handler!
[  [  339.079983439,5] OPAL: Reboot request...

summary:	- IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net tend to - fail on P8 node entei with 5.8 kernel + IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 + node entei with 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1])
summary:	IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 - node entei with 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1]) + node entei on 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1])
tags:	removed: kqa-blocker

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-05-05: Re: IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 node entei on 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1])

#4

Remove the kqa-blocker tag, as it can be reproduced with the kernel in updates.

Revision history for this message

Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote on 2021-05-05: Missing required logs.

#5

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1927076

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status:	New → Incomplete

bugproxy (bugproxy) on 2021-05-11

tags:

added: architecture-ppc64le bugnameltc-192677 severity-high targetmilestone-inin---

Revision history for this message

Andrew Cloke (andrew-cloke) wrote on 2021-08-16: Re: IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 node entei on 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1])

#6

Since the groovy 5.8 kernel is now EOL, can this be reproduced with the 5.11 kernel?

Or can we close this bug out?

Changed in ubuntu-power-systems:
status:	New → Incomplete

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-09-03:

#7

Download full text (6.2 KiB)

Hi Andrew,

I just retest this manually on node entei with the steps in comment #3, and this issue can be reproduced (system gets reboot) with a different message from the ipmi console.

[ 417.696448] BUG: Unable to handle kernel instruction fetch (NULL pointer?)
[ 417.696522] Faulting instruction address: 0x00000000
[ 417.696677] Oops: Kernel access of bad area, sig: 11 [#1]
[ 417.696693] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
[ 417.696715] Modules linked in: binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua joydev input_leds mac_hid ofpart plx_dma cmdlinepart ipmi_powernv pow
ernv_flash ipmi_devintf ibmpowernv at24 vmx_crypto opal_prd ipmi_msghandler powernv_rng mtd uio_pdrv_genirq uio sch_fq_codel ip_tables x_tables autofs4 btrfs blake2b_
generic uas raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor hid_generic usbhid hid usb_storage async_tx xor raid6_pq libcrc32c raid1 raid0 multipath
linear ast drm_vram_helper i2c_algo_bit drm_ttm_helper ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crct10dif_vpmsum cec crc32c_vpmsum rc_core drm
ahci tg3 xhci_pci libahci drm_panel_orientation_quirks xhci_pci_renesas
[ 417.697008] CPU: 0 PID: 3117 Comm: reuseport_bpf_c Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[ 417.697034] NIP: 0000000000000000 LR: c000000000e77ba8 CTR: 0000000000000000
[ 417.697055] REGS: c0000007ff6e74d0 TRAP: 0400 Not tainted (5.11.0-27-generic)
[ 417.697077] MSR: 9000000040009033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28022444 XER: 20000000
[ 417.697309] CFAR: c000000000010300 IRQMASK: 0
[ 417.697309] GPR00: c000000000e77b80 c0000007ff6e7770 c000000001e99600 c000000014cecd00
[ 417.697309] GPR04: c008000004230038 c000000014cecd00 0000000000000008 0000000000000001
[ 417.697309] GPR08: 0000000000000001 0000000000000000 c0000000501e9580 0000000000000000
[ 417.697309] GPR12: 0000000000000000 c000000002150000 0000000000000000 0000000000000000
[ 417.697309] GPR16: 0000000000000040 c00000078e09a480 0000000000000001 0000000000000001
[ 417.697309] GPR20: 00000000000022b8 0000000000000000 000000000000cfb3 000000000100007f
[ 417.697309] GPR24: 0000000000000000 0000000000000008 c000000001dba880 c008000004230000
[ 417.697309] GPR28: 0000000000000080 c000000003292000 0000000090dd40dc c000000014cecd00
[ 417.697503] NIP [0000000000000000] 0x0
[ 417.697517] LR [c000000000e77ba8] reuseport_select_sock+0x108/0x3f0
[ 417.697541] Call Trace:
[ 417.697550] [c0000007ff6e7810] [c000000000f64314] udp4_lib_lookup2+0x1a4/0x2b0
[ 417.697576] [c0000007ff6e7890] [c000000000f65928] __udp4_lib_lookup+0x358/0x540
[ 417.697602] [c0000007ff6e79d0] [c000000000f66978] __udp4_lib_rcv+0x608/0xe10
[ 417.697626] [c0000007ff6e7af0] [c000000000f0fa20] ip_protocol_deliver_rcu+0x60/0x2c0
[ 417.697813] [c0000007ff6e7b40] [c000000000f0fcf0] ip_local_deliver_finish+0x70/0x90
[ 417.697838] [c0000007ff6e7b60] [c000000000f0fda0] ip_local_deliver+0x90/0x180
[ 417.697861] [c0000007ff6e7be0] [c000000000f0f140] ip_rcv_finish+0xc0/0xf0
[ 417.697883] [c0000007ff6e7c20] [c000000000f0ffa8] ip_rcv+0x118/0x130
[ 417.697904] [c0000007ff6e7ca0] [c000000000e3a3b4] __netif_receive_skb_one_...

Hi Andrew,

I just retest this manually on node entei with the steps in comment #3, and this issue can be reproduced (system gets reboot) with a different message from the ipmi console.

[  417.696448] BUG: Unable to handle kernel instruction fetch (NULL pointer?)
[  417.696522] Faulting instruction address: 0x00000000
[  417.696677] Oops: Kernel access of bad area, sig: 11 [#1]
[  417.696693] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
[  417.696715] Modules linked in: binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua joydev input_leds mac_hid ofpart plx_dma cmdlinepart ipmi_powernv pow
ernv_flash ipmi_devintf ibmpowernv at24 vmx_crypto opal_prd ipmi_msghandler powernv_rng mtd uio_pdrv_genirq uio sch_fq_codel ip_tables x_tables autofs4 btrfs blake2b_
generic uas raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor hid_generic usbhid hid usb_storage async_tx xor raid6_pq libcrc32c raid1 raid0 multipath 
linear ast drm_vram_helper i2c_algo_bit drm_ttm_helper ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crct10dif_vpmsum cec crc32c_vpmsum rc_core drm
 ahci tg3 xhci_pci libahci drm_panel_orientation_quirks xhci_pci_renesas
[  417.697008] CPU: 0 PID: 3117 Comm: reuseport_bpf_c Not tainted 5.11.0-27-generic #29~20.04.1-Ubuntu
[  417.697034] NIP:  0000000000000000 LR: c000000000e77ba8 CTR: 0000000000000000
[  417.697055] REGS: c0000007ff6e74d0 TRAP: 0400   Not tainted  (5.11.0-27-generic)
[  417.697077] MSR:  9000000040009033 <SF,HV,EE,ME,IR,DR,RI,LE>  CR: 28022444  XER: 20000000
[  417.697309] CFAR: c000000000010300 IRQMASK: 0 
[  417.697309] GPR00: c000000000e77b80 c0000007ff6e7770 c000000001e99600 c000000014cecd00 
[  417.697309] GPR04: c008000004230038 c000000014cecd00 0000000000000008 0000000000000001 
[  417.697309] GPR08: 0000000000000001 0000000000000000 c0000000501e9580 0000000000000000 
[  417.697309] GPR12: 0000000000000000 c000000002150000 0000000000000000 0000000000000000 
[  417.697309] GPR16: 0000000000000040 c00000078e09a480 0000000000000001 0000000000000001 
[  417.697309] GPR20: 00000000000022b8 0000000000000000 000000000000cfb3 000000000100007f 
[  417.697309] GPR24: 0000000000000000 0000000000000008 c000000001dba880 c008000004230000 
[  417.697309] GPR28: 0000000000000080 c000000003292000 0000000090dd40dc c000000014cecd00 
[  417.697503] NIP [0000000000000000] 0x0
[  417.697517] LR [c000000000e77ba8] reuseport_select_sock+0x108/0x3f0
[  417.697541] Call Trace:
[  417.697550] [c0000007ff6e7810] [c000000000f64314] udp4_lib_lookup2+0x1a4/0x2b0
[  417.697576] [c0000007ff6e7890] [c000000000f65928] __udp4_lib_lookup+0x358/0x540
[  417.697602] [c0000007ff6e79d0] [c000000000f66978] __udp4_lib_rcv+0x608/0xe10
[  417.697626] [c0000007ff6e7af0] [c000000000f0fa20] ip_protocol_deliver_rcu+0x60/0x2c0
[  417.697813] [c0000007ff6e7b40] [c000000000f0fcf0] ip_local_deliver_finish+0x70/0x90
[  417.697838] [c0000007ff6e7b60] [c000000000f0fda0] ip_local_deliver+0x90/0x180
[  417.697861] [c0000007ff6e7be0] [c000000000f0f140] ip_rcv_finish+0xc0/0xf0
[  417.697883] [c0000007ff6e7c20] [c000000000f0ffa8] ip_rcv+0x118/0x130
[  417.697904] [c0000007ff6e7ca0] [c000000000e3a3b4] __netif_receive_skb_one_core+0x74/0xb0
[  417.698083] [c0000007ff6e7cf0] [c000000000e3a770] process_backlog+0xd0/0x230
[  417.698108] [c0000007ff6e7d60] [c000000000e3cf88] net_rx_action+0x1e8/0x580
[  417.698130] [c0000007ff6e7e70] [c0000000010a94c0] __do_softirq+0x160/0x404
[  417.698152] [c0000007ff6e7f90] [c00000000002bad8] call_do_softirq+0x14/0x24
[  417.698174] [c00000005021b760] [c000000000017158] do_softirq_own_stack+0x38/0x50
[  417.698199] [c00000005021b780] [c0000000001576e0] do_softirq+0xa0/0xb0
[  417.698391] [c00000005021b7b0] [c0000000001577e8] __local_bh_enable_ip+0xf8/0x120
[  417.698415] [c00000005021b7d0] [c000000000f1418c] ip_finish_output2+0x1fc/0x730
[  417.698439] [c00000005021b870] [c000000000f178fc] ip_output+0xdc/0x1c0
[  417.698460] [c00000005021b920] [c000000000f16cb4] ip_local_out+0x64/0x90
[  417.698482] [c00000005021b960] [c000000000f18634] ip_send_skb+0x34/0xc0
[  417.698667] [c00000005021b990] [c000000000f6074c] udp_send_skb.isra.0+0x16c/0x4a0
[  417.698691] [c00000005021b9e0] [c000000000f61490] udp_sendmsg+0x960/0xcf0
[  417.698714] [c00000005021bbd0] [c000000000f77cd4] inet_sendmsg+0x64/0xb0
[  417.698735] [c00000005021bc10] [c000000000dfb3a0] sock_sendmsg+0x80/0xb0
[  417.698758] [c00000005021bc40] [c000000000dfffd8] __sys_sendto+0xf8/0x1b0
[  417.698910] [c00000005021bd90] [c000000000e00100] sys_send+0x30/0x40
[  417.698932] [c00000005021bdb0] [c000000000036204] system_call_exception+0xf4/0x200
[  417.698999] [c00000005021be10] [c00000000000d860] system_call_common+0xf0/0x27c
[  417.699023] --- interrupt: c00 at 0x7aef2b250724
[  417.699040] NIP:  00007aef2b250724 LR: 000005f77e131568 CTR: 0000000000000000
[  417.716065] REGS: c00000005021be80 TRAP: 0c00   Not tainted  (5.11.0-27-generic)
[  417.716693] MSR:  900000000280f033 <SF,HV,VEC,VSX,EE,PR,FP,ME,IR,DR,RI,LE>  CR: 28004440  XER: 00000000
[  417.717786] IRQMASK: 0 
[  417.717786] GPR00: 000000000000014e 00007fffed1b68e0 00007aef2b337100 0000000000000084 
[  417.717786] GPR04: 000005f77e131f08 0000000000000001 0000000000000000 0000000000000000 
[  417.717786] GPR08: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
[  417.717786] GPR12: 0000000000000000 00007aef2b3ca330 0000000000000000 0000000000000000 
[  417.717786] GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000 
[  417.717786] GPR20: 0000000000000001 000005f7b987049c 000000000000007f 00007fffed1b6b88 
[  417.717786] GPR24: 000005f7b987029c 0000000000000083 000005f7b98702a0 0000000000000002 
[  417.717786] GPR28: 0000000000000000 0000000000000080 0000000000000002 0000000000000084 
[  417.735682] NIP [00007aef2b250724] 0x7aef2b250724
[  417.736172] LR [000005f77e131568] 0x5f77e131568
[  417.736699] --- interrupt: c00
[  417.736710] Instruction dump:
[  417.737229] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX 
[  417.737755] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX 
[  417.738770] ---[ end trace 9fc532dd6c3c783a ]---
[  417.900679] 
[  418.900837] Kernel panic - not syncing: Aiee, killing interrupt handler!
[  419.063147] Rebooting in 10 seconds..
[  481.430090023,5] OPAL: Reboot request...

Po-Hsu Lin (cypressyew) on 2021-09-06

Changed in ubuntu-power-systems:
status:	Incomplete → Confirmed

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-09-06:

#8

This issue can be reproduced on P8 node entei with:
  * F-5.4 (5.4.0-81-generic)
  * F-5.11 (5.11.0-27-generic #29~20.04.1-Ubuntu)
  * H-5.11 (5.11.0-31-generic)

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-09:

#9

Po-Hsu Lin,
Thanks for the info.
I was able to reproduce the issue on other 2 Power8 servers, but just running the `reuseport_bpf_cpu test` more than once (as you mentioned on comment#3).
I've tested this with focal-hwe (Linux thiel 5.11.0-27-generic) and hirsute (5.11.0-31-generic).
steps:
1. Run the cpu-hotplug test; 2. Run the reuseport_bpf_cpu test; 3. re-run the reuseport_bpf_cpu test.

```
thiel login:
[24669.414656] Oops: Exception in kernel mode, sig: 4 [#1]
[24669.414710] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
```

```
gulpin login:
[277274.876010] Oops: Exception in kernel mode, sig: 4 [#1]
[277274.876235] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
```

I've also tested this 2 servers above with the upstream kernel from `https://kernel.ubuntu.com/~kernel-ppa/mainline/v5.11.22/`
I put the `reuseport_bpf_cpu test` on 20x loop and did not hit the issue, so I'd say we may have an issue with ubuntu-kernel.
all 20x the test reported a success and the server did not reboot.
```
send cpu 157, receive socket 157
send cpu 159, receive socket 159
SUCCESS
```

Revision history for this message

Krzysztof Kozlowski (krzk) wrote on 2021-09-10:

#10

Can you repeat tests with latest kernels? I could not reproduce it on Focal with following configurations:
1. Power8: P8LPAR05 MAAS
2. Power9: QEMU with 4 or 128 CPUs (and 4 GB of RAM)

Tested kernels:
F/5.4.0-84-generic
F/5.11.0-34-generic

Tried steps:
1. Freshly boot machine.
2. Log in via ssh.
3. sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/cpu-hotplug/cpu-on-off-test.sh

4. for i in `seq 100`; do echo $i ; sleep 2 ; sudo ./autotest/client/tmp/ubuntu_kernel_selftests/src/linux/tools/testing/selftests/net/reuseport_bpf_cpu ; done

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#11

lscpu_thiel Edit (1.3 KiB, text/plain)

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#12

lscpu_gulpin Edit (1.3 KiB, text/plain)

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#13

Ok I've re-ran the test with latest kernel versions on the same systems:

`thiel` (8001-22C) with focal-hwe (5.11.0-34-generic):
```
[ 3255.763649] Oops: Exception in kernel mode, sig: 5 [#1]
[ 3255.763723] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
```

And

`gulpin` (8335-GTA) with hirsute (5.11.0-34-generic)
2nd run of `reuseport_bpf_cpu`:
```
[ 760.451968] BUG: Unable to handle kernel instruction fetch (NULL pointer?)
[ 760.452035] Faulting instruction address: 0x00000000
[ 760.452196] Oops: Kernel access of bad area, sig: 11 [#1]
[ 760.452212] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
```

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#14

Also re-ran on
`entei` it is also a POWER8 (8335-GTA) with Hirsute latest kernel (5.11.0-34-generic)

hit the same error - second run of `reuseport_bpf_cpu`:
```
[ 232.349547] Oops: Exception in kernel mode, sig: 4 [#1]
[ 232.349647] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA PowerNV
...
[ 232.355607] LR [000008d19f3b15a8] 0x8d19f3b15a8
[ 232.355855] --- interrupt: c00
[ 232.355869] Instruction dump:
[ 232.356114] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX
[ 232.356374] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX
[ 232.356905] ---[ end trace c99c88cea832039b ]---
[ 232.508560]
[ 233.508662] Kernel panic - not syncing: Aiee, killing interrupt handler!
[ 233.950570] Rebooting in 10 seconds..

```

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#15

lscpu_entei Edit (1.3 KiB, text/plain)

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-10:

#16

Krzysztof, you were on a PowerVM LPAR (P8LPAR05), just let me know if there's anything that need to be tested

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-10:

#17

Hi, Patricia.

Can you clarify if you always need to run the hotplug test before reuseport_bpf_cpu in order to reproduce? I wonder what is the state that the hotplug test leaves the system in? How is it being run? The makefile runs it with the -a option, which in the systems I have available would fail to offline the last CPU (which is expected, different behavior from x86, where cpu0 cannot be offlined). Running it without any options would only offline the last CPU and online it again.

I tried looking for differences between our kernels and 5.11.22 and the only cpuset changes I noticed were already present in the kernel you have just tested, and they were on paths unrelated to hotplug or BPF, so I am still baffled as to the real differences here.

And given the different systems fail differently, it looks like this will require a dump or xmon so we can debug it.

Thanks for all the help. I may ask for system access next week in order to help there.
Cascardo.

Revision history for this message

Patricia Domingues (patriciasd) wrote on 2021-09-14:

#18

Cascardo, I was trying to reproduce the issue as Po-Hsu Lin has mentioned (#3), but the hotplug leaves the system in the same state (shows this output):
```
./cpu-hotplug/cpu-on-off-test.sh
pid 21291's current affinity mask: ffffffffffffffffffffffffffffffff
pid 21291's new affinity mask: 1
CPU online/offline summary:
present_cpus = 0-127 present_max = 127
  Cpus in online state: 0-127
  Cpus in offline state: 0
Limited scope test: one hotplug cpu
  (leaves cpu in the original state):
  online to offline to online: cpu 127
```
I was running this way:
```
ubuntu@gulpin:~/ubuntu-hirsute/tools/testing/selftests$
make TARGETS=net
./cpu-hotplug/cpu-on-off-test.sh
./net/reuseport_bpf_cpu
```
Let me know if there's anything else needed.

Revision history for this message

Krzysztof Kozlowski (krzk) wrote on 2021-09-15:

#19

Thanks Patricia for tests. It looks it was seen before: lp:1909286. I will mark it as duplicate.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2021-09-15:

#20

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux (Ubuntu Hirsute):
status:	New → Confirmed

Revision history for this message

Kleber Sacilotto de Souza (kleber-souza) wrote on 2021-09-15:

#21

This is also failing with focal/linux on the node dryden.

The latest result is from 5.4.0-85.95, which is stopping around this test:

04:57:26 DEBUG| [stdout] # selftests: net: reuseport_bpf_cpu
[...]
04:57:26 DEBUG| [stdout] # ---- IPv6 UDP ----
[...]
04:57:26 DEBUG| [stdout] # send cpu 145, receive socket 145
04:57:26 DEBUG| [stdout] # send cpu 147, receive socket 147
04:57:26 DEBUG| [stdout] # send cpu 149, receive socket 149
04:57:26 DEBUG| [stdout] # send cpu 151, receive socket 151
04:57:26 DEBUG| [stdout] # ---- IPv4 TCP ----

This is from an automated test run, so I don't have access to the kernel logs.

I have found the issue on all the regression tests run on this system since the oldest results we still have (5.4.0-76.85).

Changed in linux (Ubuntu Focal):
status:	New → Confirmed

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-15:

#22

I was looking at latest changes since 5.11 to powerpc64 BPF JIT and found the following commit:

commit 20ccb004bad659c186f9091015a956da220d615d
Author: Naveen N. Rao <email address hidden>
Date: Wed Jun 9 14:30:24 2021 +0530

powerpc/bpf: Use bctrl for making function calls

blrl corrupts the link stack. Instead use bctrl when making function
calls from BPF programs.

    Reported-by: Anton Blanchard <email address hidden>
    Signed-off-by: Naveen N. Rao <email address hidden>
    Signed-off-by: Michael Ellerman <email address hidden>
    Link: https://<email address hidden>

Though the link stack is unarchitected, that is, it should be transparent to the user aside from branch prediction performance, perhaps there is a bug in the implementation. Considering we have only observed this on POWER8 and with different stack traces, I wouldn't discard the possibility.

As this is not present on 5.13 either, I am building a test kernel with a backport so it can be tested.

Cascardo.

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-15:

#23

Test kernel at:

https://kernel.ubuntu.com/~cascardo/lp1927076_1.tar

Revision history for this message

Krzysztof Kozlowski (krzk) wrote on 2021-09-15:

#24

Thadeu, It is present on v5.13 (tested v5.13.17).

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-15:

#25

Krzysztof mentioned that this has been found on 5.14 as well. Using a system he lent me (huggins), I also tested with the commit that changed the call to use CTR and it failed as well. But it always failed when __bpf_prog_run_save_cb was calling the jited bpf_func, and CTR always matched NIP (though in that case, it is the CTR from __bpf_prog_run_save_cb, not the JITed code). Sometimes it was NULL (all zeroes), sometimes it looked like a legit kernel address, and I got one 0xfe800000fe80000000 (or something like it), which looks like some corruption on bpf_prog.

Also, I noticed it doesn't happen always on CPU 0, which would be odd on its own. But it seems more likely. And it's either very hard to reproduce without doing the CPU hotplug or it is really necessary, and I left the program running on a loop for a long time and did not have any luck.

I also changed it to an eBPF program instead of cBPF, but still a socket filter type. And used get_smp_processor_id instead of the raw_processor_id (though I recall this being the same on ppc64el), and it still reproduced. And when I returned a constant instead of doing the call, it also reproduced. No wonder, as when it fails, the program never runs. But the way those programs are compiled makes no difference.

Cascardo.

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-16 (last edit on 2021-09-16):

#26

I tested that when reuseport_bpf_cpu did not consider the last CPU, the one that has been hotplugged, it didn't crash. It didn't set affinity to that CPU, didn't even allocate socket for it.

Then, I realized that attaching the BPF code was happening on that CPU as it happened right after the tests were run, and the last test set the CPU affinity to that CPU. So, I set the affinity to CPU 0 right before attaching the BPF code. So far, the system did not crash either.

Cascardo.

Scratch that. It looks like the system was not reproducing until I rebooted and tested it again. The last test didn't work out. That is, even setting affinity to a different CPU before attaching BPF resulted in a crash.

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-09-16:

#27

I will mark the other 2 bugs as a dup of this one, due to the fact that Thadeu has provided some further investigations here.

Revision history for this message

Krzysztof Kozlowski (krzk) wrote on 2021-09-16 (last edit on 2021-09-16):

#28

Since this is leading bug, I will copy&paste here also list of reproducible environments from the other bug:

Also reproduced on (huggins, POWER8NVL, 8335-GTB):
* 5.11.0-20-generic mainline (v5.11.22).
* 5.13.17-051317-generic mainline fails even on first run of reuseport_bpf_cpu test.
* 5.14.4-051404-generic mainline after 4 tries of the test.

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-16:

#29

Download full text (5.9 KiB)

9f:mon> di c008000013566000 1000
c008000013566000 7fe00008 trap
...
c008000013566eb8 60000000 nop
...
c008000013566ec0 7c0802a6 mflr r0
c008000013566ec4 f8010010 std r0,16(r1)
c008000013566ec8 f821ffa1 stdu r1,-96(r1)
c008000013566ecc 3d80c000 lis r12,-16384
c008000013566ed0 798c07c6 rldicr r12,r12,32,31
c008000013566ed4 658c0036 oris r12,r12,54
c008000013566ed8 618c51e0 ori r12,r12,20960
c008000013566edc 7d8903a6 mtctr r12
c008000013566ee0 4e800421 bctrl
c008000013566ee4 7c681b78 mr r8,r3
c008000013566ee8 38210060 addi r1,r1,96
c008000013566eec e8010010 ld r0,16(r1)
c008000013566ef0 7c0803a6 mtlr r0
c008000013566ef4 7d034378 mr r3,r8
c008000013566ef8 4e800020 blr
c008000013566efc 7fe00008 trap
...
9f:mon> r
R00 = c0000000000173d8 R16 = c000007fe13e8cb0
R01 = c0000040074efda0 R17 = c000007f8f6a0000
R02 = c0000000022d9900 R18 = c000007f8f6a0080
R03 = c0000040074efbd8 R19 = c000007f8f6a0080
R04 = 0000000000200000 R20 = c0000000012ff767
R05 = 0000000000030000 R21 = c000007f8f6a0080
R06 = 0000000000020000 R22 = c00000000136cc78
R07 = 0000000000517782 R23 = 0000000000000001
R08 = 0000001ebf7e3a55 R24 = 000000000000009f
R09 = 0000000000000000 R25 = 0000000000000e60
R10 = 0000000000000001 R26 = 0000000000000900
R11 = 0000000000000f8e R27 = 0000000000000500
R12 = 0000000000004400 R28 = 0000000000000a00
R13 = c000007fff6f4f80 R29 = 0000000000000f00
R14 = 0000000000000000 R30 = 0000000000000002
R15 = c0000000012f6020 R31 = 0000000000000003
pc = c000000000017038 replay_soft_interrupts+0x68/0x2e0
cfar= 0000000000000000
lr = c0000000000173d8 arch_local_irq_restore+0x128/0x160
msr = 9000000000001033 cr = 24004428
ctr = c000000000042468 xer = 0000000020000000 trap = 500
9f:mon> c0
[link register ] c000000000f36d4c __bpf_prog_run_save_cb+0x5c/0x190
[c000003ffffa3780] c000000000fdf76c __ip_finish_output+0x8c/0x140 (unreliable)
[c000003ffffa37f0] c000000000f36f2c run_bpf_filter+0xac/0x200
[c000003ffffa3850] c000000000f37104 reuseport_select_sock+0x84/0x170
[c000003ffffa3890] c00000000112c1f8 inet6_lhash2_lookup+0x1c8/0x200
[c000003ffffa3910] c00000000112c48c inet6_lookup_listener+0x25c/0x280
[c000003ffffa3a00] c000000001105e58 tcp_v6_rcv+0x7b8/0xf50
[c000003ffffa3b50] c0000000010b79c0 ip6_protocol_deliver_rcu+0x110/0x630
[c000003ffffa3bc0] c0000000010b803c ip6_input+0x10c/0x130
[c000003ffffa3c40] c0000000010b76c4 ipv6_rcv+0x194/0x1c0
[c000003ffffa3cc0] c000000000ef68f4 __netif_receive_skb_one_core+0x74/0xb0
[c000003ffffa3d10] c000000000ef6d68 process_backlog+0x138/0x280
[c000003ffffa3d80] c000000000ef7e00 napi_poll+0x100/0x3c0
[c000003ffffa3e10] c000000000ef81b4 net_rx_action+0xf4/0x2d0
[c000003ffffa3ea0] c0000000011815f0 __do_softirq+0x150/0x428
[c000003ffffa3f90] c00000000002caec call_do_softirq+0x14/0x24
[c000000008fcf630] c000000000017448 do_softirq_own_stack+0x38/0x50
[c000000008fcf650] c0000000001640a0 do_softirq+0xa0/0xb0
[c000000008fcf680] c0000000001641a8 __local_bh_enable_ip+0xf8/0x120
[c000000008fcf6a0] c0000000010b1a98 ip6_finish_output2+0x248/0x7c0
[c...

9f:mon> di c008000013566000 1000
c008000013566000  7fe00008      trap
 ...
c008000013566eb8  60000000      nop
 ...
c008000013566ec0  7c0802a6      mflr    r0
c008000013566ec4  f8010010      std     r0,16(r1)
c008000013566ec8  f821ffa1      stdu    r1,-96(r1)
c008000013566ecc  3d80c000      lis     r12,-16384
c008000013566ed0  798c07c6      rldicr  r12,r12,32,31
c008000013566ed4  658c0036      oris    r12,r12,54
c008000013566ed8  618c51e0      ori     r12,r12,20960
c008000013566edc  7d8903a6      mtctr   r12
c008000013566ee0  4e800421      bctrl
c008000013566ee4  7c681b78      mr      r8,r3
c008000013566ee8  38210060      addi    r1,r1,96
c008000013566eec  e8010010      ld      r0,16(r1)
c008000013566ef0  7c0803a6      mtlr    r0
c008000013566ef4  7d034378      mr      r3,r8
c008000013566ef8  4e800020      blr
c008000013566efc  7fe00008      trap
 ...
9f:mon> r
R00 = c0000000000173d8   R16 = c000007fe13e8cb0
R01 = c0000040074efda0   R17 = c000007f8f6a0000
R02 = c0000000022d9900   R18 = c000007f8f6a0080
R03 = c0000040074efbd8   R19 = c000007f8f6a0080
R04 = 0000000000200000   R20 = c0000000012ff767
R05 = 0000000000030000   R21 = c000007f8f6a0080
R06 = 0000000000020000   R22 = c00000000136cc78
R07 = 0000000000517782   R23 = 0000000000000001
R08 = 0000001ebf7e3a55   R24 = 000000000000009f
R09 = 0000000000000000   R25 = 0000000000000e60
R10 = 0000000000000001   R26 = 0000000000000900
R11 = 0000000000000f8e   R27 = 0000000000000500
R12 = 0000000000004400   R28 = 0000000000000a00
R13 = c000007fff6f4f80   R29 = 0000000000000f00
R14 = 0000000000000000   R30 = 0000000000000002
R15 = c0000000012f6020   R31 = 0000000000000003
pc  = c000000000017038 replay_soft_interrupts+0x68/0x2e0
cfar= 0000000000000000
lr  = c0000000000173d8 arch_local_irq_restore+0x128/0x160
msr = 9000000000001033   cr  = 24004428
ctr = c000000000042468   xer = 0000000020000000   trap =  500
9f:mon> c0
[link register   ] c000000000f36d4c __bpf_prog_run_save_cb+0x5c/0x190
[c000003ffffa3780] c000000000fdf76c __ip_finish_output+0x8c/0x140 (unreliable)
[c000003ffffa37f0] c000000000f36f2c run_bpf_filter+0xac/0x200
[c000003ffffa3850] c000000000f37104 reuseport_select_sock+0x84/0x170
[c000003ffffa3890] c00000000112c1f8 inet6_lhash2_lookup+0x1c8/0x200
[c000003ffffa3910] c00000000112c48c inet6_lookup_listener+0x25c/0x280
[c000003ffffa3a00] c000000001105e58 tcp_v6_rcv+0x7b8/0xf50
[c000003ffffa3b50] c0000000010b79c0 ip6_protocol_deliver_rcu+0x110/0x630
[c000003ffffa3bc0] c0000000010b803c ip6_input+0x10c/0x130
[c000003ffffa3c40] c0000000010b76c4 ipv6_rcv+0x194/0x1c0
[c000003ffffa3cc0] c000000000ef68f4 __netif_receive_skb_one_core+0x74/0xb0
[c000003ffffa3d10] c000000000ef6d68 process_backlog+0x138/0x280
[c000003ffffa3d80] c000000000ef7e00 napi_poll+0x100/0x3c0
[c000003ffffa3e10] c000000000ef81b4 net_rx_action+0xf4/0x2d0
[c000003ffffa3ea0] c0000000011815f0 __do_softirq+0x150/0x428
[c000003ffffa3f90] c00000000002caec call_do_softirq+0x14/0x24
[c000000008fcf630] c000000000017448 do_softirq_own_stack+0x38/0x50
[c000000008fcf650] c0000000001640a0 do_softirq+0xa0/0xb0
[c000000008fcf680] c0000000001641a8 __local_bh_enable_ip+0xf8/0x120
[c000000008fcf6a0] c0000000010b1a98 ip6_finish_output2+0x248/0x7c0
[c000000008fcf740] c0000000010b1410 ip6_xmit+0x370/0x7b0
[c000000008fcf860] c00000000110f7f4 inet6_csk_xmit+0xb4/0x130
[c000000008fcf900] c00000000100e910 __tcp_transmit_skb+0x440/0x9f0
[c000000008fcfa10] c00000000100f758 tcp_connect+0x2e8/0x390
[c000000008fcfae0] c0000000011023e4 tcp_v6_connect+0x5e4/0x7b0
[c000000008fcfbd0] c000000001042e10 __inet_stream_connect+0x130/0x3a0
[c000000008fcfc30] c0000000010430dc inet_stream_connect+0x5c/0x90
[c000000008fcfc70] c000000000eb8e58 __sys_connect_file+0xa8/0xe0
[c000000008fcfcb0] c000000000eb8f74 __sys_connect+0xe4/0x140
[c000000008fcfd90] c000000000eb8ff8 sys_connect+0x28/0x40
[c000000008fcfdb0] c000000000036fe4 system_call_exception+0xf4/0x200
[c000000008fcfe10] c00000000000d860 system_call_common+0xf0/0x27c
--- Exception: c00 (System Call) at 00007cf993a78388
SP (7ffff12cb970) is in userspace
0:mon> r
R00 = c000000000f36f2c   R16 = 000000000000ada3
R01 = c000003ffffa3780   R17 = 000000000000000a
R02 = c0000000022d9900   R18 = 0000000000000001
R03 = c000000037c930e0   R19 = 0000000000000001
R04 = c008000013110038   R20 = c0000040213f4d48
R05 = c008000013110000   R21 = 0000000000000000
R06 = c000000037c930e0   R22 = 00000000000022b9
R07 = 0000000000000001   R23 = 0000000000000001
R08 = 0000000000000001   R24 = 000000000000ada3
R09 = 0000000000000000   R25 = c008000013110000
R10 = c00000011c2edf00   R26 = 0000000000000000
R11 = 0000000000000000   R27 = 00000000000000a0
R12 = c008000013566eb8   R28 = c000000037c930e0
R13 = c000000002590000   R29 = 0000000000000028
R14 = c0000000021fa500   R30 = 0000000000000028
R15 = 00000000000022b9   R31 = c008000013110000
pc  = c008000013566eb8
cfar= c000000000f36d48 __bpf_prog_run_save_cb+0x58/0x190
lr  = c000000000f36d4c __bpf_prog_run_save_cb+0x5c/0x190
msr = 9000000000029033   cr  = 28002882
ctr = c008000013566eb8   xer = 0000000020000000   trap =  700
0:mon> di c008000013566000 1000
c008000013566000  7fe00008      trap
 ...
0:mon> di c008000013560000 10000
c008000013560000  00000001      .long 0x1
c008000013560004  7fe00008      trap
 ...
c008000013562728  60000000      nop
 ...
c008000013562730  7c0802a6      mflr    r0
c008000013562734  f8010010      std     r0,16(r1)
c008000013562738  f821ffa1      stdu    r1,-96(r1)
c00800001356273c  3d80c000      lis     r12,-16384
c008000013562740  798c07c6      rldicr  r12,r12,32,31
c008000013562744  658c0036      oris    r12,r12,54
c008000013562748  618c51e0      ori     r12,r12,20960
c00800001356274c  7d8903a6      mtctr   r12
c008000013562750  4e800421      bctrl
c008000013562754  7c681b78      mr      r8,r3
c008000013562758  38210060      addi    r1,r1,96
c00800001356275c  e8010010      ld      r0,16(r1)
c008000013562760  7c0803a6      mtlr    r0
c008000013562764  7d034378      mr      r3,r8
c008000013562768  4e800020      blr
c00800001356276c  7fe00008      trap
 ...
0:mon>

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-16:

#30

About this latest comment. So, CPU #0 has crashed at pc = c008000013566eb8, its ctr and r12 match, same as usual, it was called by __bpf_prog_run_save_cb as the BPF JITed program. Dumping the program from CPU #0 perspective, it has traps at that address.

It turns out the JIT fills up a whole page with traps and puts the JITed BPF program on a random offset of that page (look at kernel/bpf/core.c:bpf_jit_binary_alloc).

When we go to the hotplugged CPU, however, CPU #9f (159), that same page looks different, with the code placed where it was expected.

Still, it looks like fp->aux->jit_data is NULL on both CPUs, which is not as expected.

I am wondering if either the icache is not being flushed properly, or RCU is not operating correctly. As other issues are not seen, more likely something related to the icache. But I don't see any IPIs involved when flushing the icache, so possibly firmware or micro-architecture related?

Cascardo.

Revision history for this message

Krzysztof Kozlowski (krzk) wrote on 2021-09-23:

#31

Upstream report: https://lore.kernel.org/linuxppc-dev/YUpIqytZqpohq4EM@mussarela/T/#u

Revision history for this message

Thadeu Lima de Souza Cascardo (cascardo) wrote on 2021-09-23:

#32

Sent request upstream:

https://lore.kernel.org/linuxppc-dev/YUpIqytZqpohq4EM@mussarela/T/#u

I will ping some folks for some help there.

Cascardo.

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2021-09-23:

#33

Hey Krzysztof and Thadeu,
Thanks for the follow-up and the info!

bugproxy (bugproxy) on 2021-10-11

tags:

added: bugnameltc-194783 severity-medium
removed: bugnameltc-192677 severity-high

Po-Hsu Lin (cypressyew) on 2021-11-09

summary:

IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8
- node entei on 5.8 kernel (Oops: Exception in kernel mode, sig: 4 [#1])
+ node entei (Oops: Exception in kernel mode, sig: 4 [#1])

bugproxy (bugproxy) on 2021-11-09

tags:

added: bugnameltc-192677 severity-high
removed: bugnameltc-194783 severity-medium

Revision history for this message

Daniel Axtens (daxtens) wrote on 2021-11-12 (last edit on 2021-11-12):

#34

I can repro this with the latest Focal kernel (5.4.0-90) on:

description: PowerNV
product: 8247-22L (IBM Power System S822L)

Trying to see if I can repro it upstream.

FWIW my opening hypothesis is that something in a percpu data structure isn't getting updated over hotplug.

Revision history for this message

Daniel Axtens (daxtens) wrote on 2021-11-12:

#35

I can repro on upstream, all the way back to 5.4.0. It might have existed before that - I haven't tested any earlier yet.

Was the test methodology changed just before this was found? I'm just wondering why it suddenly appeared ~a year after Focal was released. I thought it might have been a patch picked up for a SRU, but it's looking like the problem predates Focal by some way...

Revision history for this message

bugproxy (bugproxy) wrote on 2021-11-12: Comment bridged from LTC Bugzilla

#36

I can repro this with the latest Focal kernel on:

Revision history for this message

Daniel Axtens (daxtens) wrote on 2021-11-16:

#37

I've made some good progress here.

I found that older version like 4.19 work, so I ran git bisect. I'm still doing the final check, but it looks like the series that causes the issue is the one containing these:

d53d2f78cead bpf: Use vmalloc special flag
1a7b7d922081 modules: Use vmalloc special flag
868b104d7379 mm/vmalloc: Add flag for freeing of special permsissions

In particular:

commit 868b104d7379e28013e9d48bdd2db25e0bdcf751 (HEAD)
Author: Rick Edgecombe <email address hidden>
Date: Thu Apr 25 17:11:36 2019 -0700

mm/vmalloc: Add flag for freeing of special permsissions

    Add a new flag VM_FLUSH_RESET_PERMS, for enabling vfree operations to
    immediately clear executable TLB entries before freeing pages, and handle
    resetting permissions on the directmap. This flag is useful for any kind
    of memory with elevated permissions, or where there can be related
    permissions changes on the directmap. Today this is RO+X and RO memory.

    Although this enables directly vfreeing non-writeable memory now,
    non-writable memory cannot be freed in an interrupt because the allocation
    itself is used as a node on deferred free list. So when RO memory needs to
    be freed in an interrupt the code doing the vfree needs to have its own
    work queue, as was the case before the deferred vfree list was added to
    vmalloc.

    For architectures with set_direct_map_ implementations this whole operation
    can be done with one TLB flush when centralized like this. For others with
    directmap permissions, currently only arm64, a backup method using
    set_memory functions is used to reset the directmap. When arm64 adds
    set_direct_map_ functions, this backup can be removed.

    When the TLB is flushed to both remove TLB entries for the vmalloc range
    mapping and the direct map permissions, the lazy purge operation could be
    done to try to save a TLB flush later. However today vm_unmap_aliases
    could flush a TLB range that does not include the directmap. So a helper
    is added with extra parameters that can allow both the vmalloc address and
    the direct mapping to be flushed during this operation. The behavior of the
    normal vm_unmap_aliases function is unchanged.

and

commit d53d2f78ceadba081fc7785570798c3c8d50a718
Author: Rick Edgecombe <email address hidden>
Date: Thu Apr 25 17:11:38 2019 -0700

bpf: Use vmalloc special flag

    Use new flag VM_FLUSH_RESET_PERMS for handling freeing of special
    permissioned memory in vmalloc and remove places where memory was set RW
    before freeing which is no longer needed. Don't track if the memory is RO
    anymore because it is now tracked in vmalloc.

This is _extremely_ in "subtly break under the hash MMU" areas.

Hopefully this is enough to get some Power MMU experts to weigh in. I will keep working on it.

I've made some good progress here.

I found that older version like 4.19 work, so I ran git bisect. I'm still doing the final check, but it looks like the series that causes the issue is the one containing these:

d53d2f78cead bpf: Use vmalloc special flag
1a7b7d922081 modules: Use vmalloc special flag
868b104d7379 mm/vmalloc: Add flag for freeing of special permsissions

In particular:

commit 868b104d7379e28013e9d48bdd2db25e0bdcf751 (HEAD)
Author: Rick Edgecombe <rick.p.edgecombe@intel.com>
Date:   Thu Apr 25 17:11:36 2019 -0700

mm/vmalloc: Add flag for freeing of special permsissions
    
    Add a new flag VM_FLUSH_RESET_PERMS, for enabling vfree operations to
    immediately clear executable TLB entries before freeing pages, and handle
    resetting permissions on the directmap. This flag is useful for any kind
    of memory with elevated permissions, or where there can be related
    permissions changes on the directmap. Today this is RO+X and RO memory.
    
    Although this enables directly vfreeing non-writeable memory now,
    non-writable memory cannot be freed in an interrupt because the allocation
    itself is used as a node on deferred free list. So when RO memory needs to
    be freed in an interrupt the code doing the vfree needs to have its own
    work queue, as was the case before the deferred vfree list was added to
    vmalloc.
    
    For architectures with set_direct_map_ implementations this whole operation
    can be done with one TLB flush when centralized like this. For others with
    directmap permissions, currently only arm64, a backup method using
    set_memory functions is used to reset the directmap. When arm64 adds
    set_direct_map_ functions, this backup can be removed.
    
    When the TLB is flushed to both remove TLB entries for the vmalloc range
    mapping and the direct map permissions, the lazy purge operation could be
    done to try to save a TLB flush later. However today vm_unmap_aliases
    could flush a TLB range that does not include the directmap. So a helper
    is added with extra parameters that can allow both the vmalloc address and
    the direct mapping to be flushed during this operation. The behavior of the
    normal vm_unmap_aliases function is unchanged.

and

commit d53d2f78ceadba081fc7785570798c3c8d50a718
Author: Rick Edgecombe <rick.p.edgecombe@intel.com>
Date:   Thu Apr 25 17:11:38 2019 -0700

bpf: Use vmalloc special flag
    
    Use new flag VM_FLUSH_RESET_PERMS for handling freeing of special
    permissioned memory in vmalloc and remove places where memory was set RW
    before freeing which is no longer needed. Don't track if the memory is RO
    anymore because it is now tracked in vmalloc.

This is _extremely_ in "subtly break under the hash MMU" areas.

Hopefully this is enough to get some Power MMU experts to weigh in. I will keep working on it.

Revision history for this message

Brian Murray (brian-murray) wrote on 2022-01-26:

#38

The Hirsute Hippo has reached End of Life, so this bug will not be fixed for that release.

Changed in linux (Ubuntu Hirsute):
status:	Confirmed → Won't Fix

Revision history for this message

Andrew Cloke (andrew-cloke) wrote on 2022-02-22:

#39

Marking as "incomplete" while waiting for input from Power MMU experts.

Changed in ubuntu-power-systems:
status:	Confirmed → Incomplete

Revision history for this message

bugproxy (bugproxy) wrote on 2022-02-22:

#40

------- Comment From <email address hidden> 2022-02-22 09:29 EDT-------
(In reply to comment #32)
> Marking as "incomplete" while waiting for input from Power MMU experts.

Adding a couple of developers from MM team to review...

Po-Hsu Lin (cypressyew) on 2022-03-09

tags:

added: 5.13 impish

bugproxy (bugproxy) on 2022-03-30

tags:

added: bugnameltc-194783 severity-medium
removed: bugnameltc-192677 severity-high

Po-Hsu Lin (cypressyew) on 2022-07-26

tags:

added: sru-20220711

bugproxy (bugproxy) on 2022-07-26

tags:

added: bugnameltc-192677 severity-high
removed: bugnameltc-194783 severity-medium

bugproxy (bugproxy) on 2023-04-20

tags:

added: bugnameltc-194783 severity-medium
removed: bugnameltc-192677 severity-high

Revision history for this message

Frank Heimes (fheimes) wrote on 2023-05-31:

#41

Meanwhile impish reached it's end of life
and starting with Ubuntu 22.04 LTS, POWER9 and POWER10 processors are supported
and the support for POWER8 ended with Ubuntu 21.10, respectively Ubuntu 20.04 LTS
(https://ubuntu.com/download/server/power)
and this bug is limited to P8,
I'm going to close the 'affects Impish' entry to 'Won't Fix'.

Changed in linux (Ubuntu Impish):
status:	New → Won't Fix

bugproxy (bugproxy) on 2023-05-31

tags:

added: bugnameltc-192677 severity-high
removed: bugnameltc-194783 severity-medium

	Status	Importance	Assigned to
The Ubuntu-power-systems project	Incomplete	Undecided	Unassigned
ubuntu-kernel-tests	New	Undecided	Unassigned
linux (Ubuntu)	Incomplete	Undecided	Unassigned
Focal	Confirmed	Undecided	Unassigned
Hirsute	Won't Fix	Undecided	Unassigned
Impish	Won't Fix	Undecided	Unassigned

Ubuntu
linux package

IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 node entei (Oops: Exception in kernel mode, sig: 4 [#1])

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntulinux package

IPv6 TCP in reuseport_bpf_cpu from ubuntu_kernel_selftests/net crash P8 node entei (Oops: Exception in kernel mode, sig: 4 [#1])

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package