Ubuntu17.04: Kernel Oops: Exception in kernel mode, sig: 5 [#1] during Avocado KVM Test runs [Regression]
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
The Ubuntu-power-systems project |
New
|
Undecided
|
Unassigned | ||
linux (Ubuntu) |
New
|
Undecided
|
Taco Screen team |
Bug Description
== Comment: #0 - Satheesh Rajendran <email address hidden> - 2017-03-27 12:30:45 ==
---Problem Description---
Kernel hit with oops while running avocado(kvm) tests "Oops: Exception in kernel mode, sig: 5 [#1]"
Contact Information = <email address hidden>
---uname output---
Linux ltc-test-ci1 4.10.0-14-generic #16-Ubuntu SMP Fri Mar 17 15:19:05 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux
Machine Type = power 8 ppc64le
---Debugger---
A debugger is not configured
---Steps to Reproduce---
1. Run Avocado(kvm) tests
#git clone git://git.
#python avocado-setup.py --bootstrap --run-suite guest_cpu --guest-os Ubuntu.
2. After sometime the below mentioned traces were seen.
Stack trace output:
[20751.909458] ------------[ cut here ]------------
[20751.909461] kernel BUG at /build/
[20751.909542] Oops: Exception in kernel mode, sig: 5 [#1]
[20751.909549] SMP NR_CPUS=2048
[20751.909549] NUMA
[20751.909555] PowerNV
[20751.909583] Modules linked in: vhost_net macvtap macvlan xt_CHECKSUM ipt_MASQUERADE nf_nat_
[20751.910225] binfmt_misc powernv_rng powernv_op_panel ipmi_powernv ipmi_devintf ipmi_msghandler uio_pdrv_genirq uio leds_powernv vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_core nfsd auth_rpcgss nfs_acl lockd grace configfs iscsi_tcp libiscsi_tcp sunrpc libiscsi scsi_transport_
[20751.910629] CPU: 24 PID: 6926 Comm: CPU 24/KVM Not tainted 4.10.0-14-generic #16-Ubuntu
[20751.910700] task: c0000007b29ac000 task.stack: c0000007f07b8000
[20751.910759] NIP: c00000000030d748 LR: c00000000030d658 CTR: 0000000000000000
[20751.910828] REGS: c0000007f07bb3b0 TRAP: 0700 Not tainted (4.10.0-14-generic)
[20751.910897] MSR: 9000000000029033 <SF,HV,
[20751.910903] CR: 44882882 XER: 00000000
[20751.910984] CFAR: c00000000030d884 SOFTE: 1
[20751.911580] NIP [c00000000030d748] __migration_
[20751.911639] LR [c00000000030d658] __migration_
[20751.911697] Call Trace:
[20751.911722] [c0000007f07bb630] [c00000000030d658] __migration_
[20751.911807] [c0000007f07bb670] [c0000000002bbc6c] do_swap_
[20751.911866] [c0000007f07bb6f0] [c0000000002bfa98] handle_
[20751.911937] [c0000007f07bb7e0] [c0000000002b4104] __get_user_
[20751.912008] [c0000007f07bb890] [c0000000002b47e4] get_user_
[20751.912079] [c0000007f07bb930] [c0000000002b59ac] get_user_
[20751.912152] [c0000007f07bb980] [d00000000f66ca74] kvmppc_
[20751.912236] [c0000007f07bba70] [d00000000f6696f8] kvmppc_
[20751.912312] [c0000007f07bbb80] [d00000000f6131ac] kvmppc_
[20751.912387] [c0000007f07bbba0] [d00000000f61030c] kvm_arch_
[20751.912462] [c0000007f07bbbe0] [d00000000f603db8] kvm_vcpu_
[20751.912534] [c0000007f07bbd40] [c00000000035b1f4] do_vfs_
[20751.912594] [c0000007f07bbde0] [c00000000035bab4] SyS_ioctl+0xd4/0xf0
[20751.912654] [c0000007f07bbe30] [c00000000000b184] system_
[20751.912713] Instruction dump:
[20751.912749] 3d020017 79293448 39481a68 ebca0000 7fde4a14 e93e0020 712a0001 4082014c
[20751.912822] 7fc9f378 e9290000 7d2948f8 792907e0 <0b090000> 39400000 3bbe001c 39000001
[20751.912899] ---[ end trace 5eaae2f83c5daa20 ]---
System Dump Info:
The system is not configured to capture a system dump.
== Comment: #3 - IRANNA D. ANKAD <email address hidden> - 2017-03-28 01:50:15 ==
This is a regression from 4.10.0-13 kernel and blocking our regression tetsing
== Comment: #10 - VIPIN K. PARASHAR <email address hidden> - 2017-03-30 03:44:13 ==
root@ltc-
Linux ltc-test-ci1 4.10.0-15-generic #17-Ubuntu SMP Fri Mar 24 17:50:37 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux
root@ltc-
NAME="Ubuntu"
VERSION="17.04 (Zesty Zapus)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu Zesty Zapus (development branch)"
VERSION_ID="17.04"
HOME_URL="https:/
SUPPORT_URL="https:/
BUG_REPORT_URL="https:/
PRIVACY_
VERSION_
UBUNTU_
root@ltc-
processor : 79
cpu : POWER8E (raw), altivec supported
clock : 2061.000000MHz
revision : 2.1 (pvr 004b 0201)
timebase : 512000000
platform : PowerNV
model : 8247-21L
machine : PowerNV 8247-21L
firmware : OPAL
root@ltc-
== Comment: #11 - VIPIN K. PARASHAR <email address hidden> - 2017-03-30 03:58:04 ==
Mar 24 07:15:57
===========
[ 1955.041619] ------------[ cut here ]------------
[ 1955.041623] kernel BUG at /build/
[ 1955.041633] Oops: Exception in kernel mode, sig: 5 [#1]
[ 1955.041637] SMP NR_CPUS=2048
[ 1955.041638] NUMA
[ 1955.041641] PowerNV
[ 1955.041645] Modules linked in: vhost_net macvtap macvlan rpcsec_gss_krb5 nfsv4 nfs fscache xt_CHECKSUM ipt_MASQUERADE nf_nat_
[ 1955.041724] nf_defrag_ipv6 nf_nat nf_conntrack binfmt_misc powernv_rng powernv_op_panel ipmi_powernv ipmi_devintf ipmi_msghandler leds_powernv uio_pdrv_genirq uio vmx_crypto nfsd auth_rpcgss nfs_acl lockd grace sunrpc ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_
[ 1955.041786] CPU: 40 PID: 7306 Comm: CPU 13/KVM Not tainted 4.10.0-14-generic #16-Ubuntu
[ 1955.041792] task: c0000007937b5a00 task.stack: c0000007f2a48000
[ 1955.041796] NIP: c00000000030d748 LR: c00000000030d658 CTR: 0000000000000000
[ 1955.041801] REGS: c0000007f2a4b3b0 TRAP: 0700 Not tainted (4.10.0-14-generic)
[ 1955.041805] MSR: 900000000282b033 <SF,HV,
[ 1955.041813] CR: 44882882 XER: 00000000
[ 1955.041819] CFAR: c00000000030d884 SOFTE: 1
[ 1955.041819] GPR00: c00000000030d658 c0000007f2a4b630 c00000000144c900 f000000001fb8b30
[ 1955.041819] GPR04: c0000007ee2cd2c8 f000000001fb8b30 000000001f005977 000000007759001f
[ 1955.041819] GPR08: c0000000015bc900 0000000000000001 0000000000000001 0000000000d02cee
[ 1955.041819] GPR12: 0000000000002200 c00000000fb96800 0000000000000000 000000000000005a
[ 1955.041819] GPR16: 0000000000010000 00003ffe30590000 0000000000000000 0000000088000000
[ 1955.041819] GPR20: 0000000020000000 0000000088000000 0000000022000000 c0000000fd3d0000
[ 1955.041819] GPR24: c0000000015be3d8 c0000007c3c66880 c0000007ed9b40d8 0000000000000000
[ 1955.041819] GPR28: c0000007f2a4b710 3e00000000077759 f000000001ddd640 f000000001fb8b30
[ 1955.041874] NIP [c00000000030d748] __migration_
[ 1955.041879] LR [c00000000030d658] __migration_
[ 1955.041883] Call Trace:
[ 1955.041886] [c0000007f2a4b630] [c00000000030d658] __migration_
[ 1955.041894] [c0000007f2a4b670] [c0000000002bbc6c] do_swap_
[ 1955.041900] [c0000007f2a4b6f0] [c0000000002bfa98] handle_
[ 1955.041906] [c0000007f2a4b7e0] [c0000000002b4104] __get_user_
[ 1955.041912] [c0000007f2a4b890] [c0000000002b47e4] get_user_
[ 1955.041918] [c0000007f2a4b930] [c0000000002b59ac] get_user_
[ 1955.041927] [c0000007f2a4b980] [d00000000f7aca74] kvmppc_
[ 1955.041935] [c0000007f2a4ba70] [d00000000f7a96f8] kvmppc_
[ 1955.041947] [c0000007f2a4bb80] [d00000000f7531ac] kvmppc_
[ 1955.041958] [c0000007f2a4bba0] [d00000000f75030c] kvm_arch_
[ 1955.041967] [c0000007f2a4bbe0] [d00000000f743db8] kvm_vcpu_
[ 1955.041974] [c0000007f2a4bd40] [c00000000035b1f4] do_vfs_
[ 1955.041980] [c0000007f2a4bde0] [c00000000035bab4] SyS_ioctl+0xd4/0xf0
[ 1955.041986] [c0000007f2a4be30] [c00000000000b184] system_
[ 1955.041990] Instruction dump:
[ 1955.041994] 3d020017 79293448 39481a68 ebca0000 7fde4a14 e93e0020 712a0001 4082014c
[ 1955.042003] 7fc9f378 e9290000 7d2948f8 792907e0 <0b090000> 39400000 3bbe001c 39000001
[ 1955.042016] ---[ end trace 1c0e9a056f95491f ]---
Mar 27 11:54:27
============
[20751.909458] ------------[ cut here ]------------
[20751.909461] kernel BUG at /build/
[20751.909542] Oops: Exception in kernel mode, sig: 5 [#1]
[20751.909549] SMP NR_CPUS=2048
[20751.909549] NUMA
[20751.909555] PowerNV
[20751.909583] Modules linked in: vhost_net macvtap macvlan xt_CHECKSUM ipt_MASQUERADE nf_nat_
[20751.910225] binfmt_misc powernv_rng powernv_op_panel ipmi_powernv ipmi_devintf ipmi_msghandler uio_pdrv_genirq uio leds_powernv vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_core nfsd auth_rpcgss nfs_acl lockd grace configfs iscsi_tcp libiscsi_tcp sunrpc libiscsi scsi_transport_
[20751.910629] CPU: 24 PID: 6926 Comm: CPU 24/KVM Not tainted 4.10.0-14-generic #16-Ubuntu
[20751.910700] task: c0000007b29ac000 task.stack: c0000007f07b8000
[20751.910759] NIP: c00000000030d748 LR: c00000000030d658 CTR: 0000000000000000
[20751.910828] REGS: c0000007f07bb3b0 TRAP: 0700 Not tainted (4.10.0-14-generic)
[20751.910897] MSR: 9000000000029033 <SF,HV,
[20751.910903] CR: 44882882 XER: 00000000
[20751.910984] CFAR: c00000000030d884 SOFTE: 1
[20751.910984] GPR00: c00000000030d658 c0000007f07bb630 c00000000144c900 f000000001f9f1f0
[20751.910984] GPR04: c0000007e7c7e0e0 f000000001f9f1f0 000000001f001c61 00000000611c001f
[20751.910984] GPR08: c0000000015bc900 0000000000000001 0000000000000001 0000000000e0c7e7
[20751.910984] GPR12: 0000000000002200 c00000000fb8d800 0000000000000000 000000000007fe1d
[20751.910984] GPR16: 0000000000010000 00003fff121c0000 0000000000000000 0000000088000000
[20751.910984] GPR20: 0000000020000000 0000000088000000 0000000022000000 c000000004990000
[20751.910984] GPR24: c0000000015be3d8 c0000007f05d3700 c0000007b972c030 0000000000000000
[20751.910984] GPR28: c0000007f07bb710 3e000000000d611c f000000003584700 f000000001f9f1f0
[20751.911580] NIP [c00000000030d748] __migration_
[20751.911639] LR [c00000000030d658] __migration_
[20751.911697] Call Trace:
[20751.911722] [c0000007f07bb630] [c00000000030d658] __migration_
[20751.911807] [c0000007f07bb670] [c0000000002bbc6c] do_swap_
[20751.911866] [c0000007f07bb6f0] [c0000000002bfa98] handle_
[20751.911937] [c0000007f07bb7e0] [c0000000002b4104] __get_user_
[20751.912008] [c0000007f07bb890] [c0000000002b47e4] get_user_
[20751.912079] [c0000007f07bb930] [c0000000002b59ac] get_user_
[20751.912152] [c0000007f07bb980] [d00000000f66ca74] kvmppc_
[20751.912236] [c0000007f07bba70] [d00000000f6696f8] kvmppc_
[20751.912312] [c0000007f07bbb80] [d00000000f6131ac] kvmppc_
[20751.912387] [c0000007f07bbba0] [d00000000f61030c] kvm_arch_
[20751.912462] [c0000007f07bbbe0] [d00000000f603db8] kvm_vcpu_
[20751.912534] [c0000007f07bbd40] [c00000000035b1f4] do_vfs_
[20751.912594] [c0000007f07bbde0] [c00000000035bab4] SyS_ioctl+0xd4/0xf0
[20751.912654] [c0000007f07bbe30] [c00000000000b184] system_
[20751.912713] Instruction dump:
[20751.912749] 3d020017 79293448 39481a68 ebca0000 7fde4a14 e93e0020 712a0001 4082014c
[20751.912822] 7fc9f378 e9290000 7d2948f8 792907e0 <0b090000> 39400000 3bbe001c 39000001
[20751.912899] ---[ end trace 5eaae2f83c5daa20 ]---
As pasted above, two instances of Oops are seen in kernel logs.
== Comment: #15 - VIPIN K. PARASHAR <email address hidden> - 2017-03-30 11:34:06 ==
From Linux source
============
/*
* Something used the pte of a page under migration. We need to
* get to the page and wait until migration is finished.
* When we return from this function the fault will be retried.
*/
void __migration_
{
..
..
page = migration_
static inline struct page *migration_
{
struct page *p = pfn_to_
/*
* Any use of migration entries may only occur while the
* corresponding page is locked
*/
return p;
}
Kernel Oops is getting invoked, due to hitting a BUG_ON in kernel, while
servicing KVM ioctl and subsequent page fault with pages being migrated.
Same issue has been noticed on intel as well
https:/
summary: |
- Ubuntu 17.04: Kernel Oops: Exception in kernel mode, sig: 5 [#1] during + Ubuntu17.04: Kernel Oops: Exception in kernel mode, sig: 5 [#1] during Avocado KVM Test runs [Regression] |
tags: |
added: severity-high removed: severity-critical |
tags: | removed: bugnameltc-152928 severity-high |
tags: | added: bugnameltc-152928 severity-high |
Default Comment by Bridge