Ubuntu 17.04 KVM: stack trace generated when enabling SRIOV in power
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
The Ubuntu-power-systems project |
In Progress
|
Undecided
|
Canonical Kernel Team | ||
linux (Ubuntu) |
In Progress
|
Medium
|
Unassigned | ||
Zesty |
In Progress
|
Medium
|
Unassigned |
Bug Description
---Problem Description---
When enabling SRIOV with kernel 4.10.0-26-generic in power will see this stack trace:
[ 2084.079575] ------------[ cut here ]------------
[ 2084.079583] WARNING: CPU: 120 PID: 734 at /build/
[ 2084.079584] Modules linked in: mst_pciconf(OE) mst_pci(OE) xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_
[ 2084.079640] xor raid6_pq libcrc32c raid1 raid0 multipath linear mlx4_en(OE) ses enclosure scsi_transport_sas crc32c_vpmsum tg3 mlx5_core(OE) mlx4_core(OE) ipr devlink mlx_compat(OE)
[ 2084.079658] CPU: 120 PID: 734 Comm: kworker/120:0 Tainted: G W OE 4.10.0-26-generic #30-Ubuntu
[ 2084.079663] Workqueue: events work_for_cpu_fn
[ 2084.079665] task: c000000fee60dc00 task.stack: c000000fee534000
[ 2084.079666] NIP: c00000000009c210 LR: c00000000009d404 CTR: 0000000000000000
[ 2084.079668] REGS: c000000fee537700 TRAP: 0700 Tainted: G W OE (4.10.0-26-generic)
[ 2084.079669] MSR: 900000000282b033 <SF,HV,
[ 2084.079677] CR: 42004428 XER: 20000000
[ 2084.079678] CFAR: c00000000009d400 SOFTE: 1
[ 2084.079702] NIP [c00000000009c210] pnv_pci_
[ 2084.079704] LR [c00000000009d404] pnv_npu_
[ 2084.079705] Call Trace:
[ 2084.079708] [c000000fee5379b0] [c00000000009d404] pnv_npu_
[ 2084.079710] [c000000fee537a80] [c000000000096c74] pnv_pci_
[ 2084.079714] [c000000fee537b00] [c0000000000291a0] dma_set_
[ 2084.079728] [c000000fee537b20] [d0000000143531e4] init_one+
[ 2084.079732] [c000000fee537bd0] [c00000000066ba9c] local_pci_
[ 2084.079734] [c000000fee537c60] [c0000000001016b8] work_for_
[ 2084.079737] [c000000fee537c90] [c0000000001061a0] process_
[ 2084.079740] [c000000fee537d20] [c000000000106780] worker_
[ 2084.079742] [c000000fee537dc0] [c00000000010f0a4] kthread+0x164/0x1b0
[ 2084.079746] [c000000fee537e30] [c00000000000b4e8] ret_from_
[ 2084.079747] Instruction dump:
[ 2084.079748] 7c0802a6 fbe1fff8 f8010010 f821ffd1 7c690074 7929d182 0b090000 2fa30000
[ 2084.079753] 419e0060 e8630330 7c690074 7929d182 <0b090000> 2fa30000 419e0048 7c852378
[ 2084.079759] ---[ end trace 7bf01a937efd69d8 ]---
This issue was introduced by this commit:
https:/
and the solution will be to add this commit:
https:/
which reads: "powerpc/npu-dma: Remove spurious WARN_ON when a PCI device has no of_node"
Requesting fix inclusion in 17.04 and probably 16.04.3.
---uname output---
4.10.0-26-generic #30-Ubuntu SMP Tue Jun 27 09:29:34 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux
---Additional Hardware Info---
Need a Mellanox card that supports SRIOV.
Machine Type = P8
---Steps to Reproduce---
Just enable SRIOV in a power system with Mellanox CX4 or CX5 will be like this:
echo 1 > /sys/class/
Stack trace output:
[ 2084.079567] mlx5_core 0004:01:04.0: Using 64-bit DMA iommu bypass
[ 2084.079575] ------------[ cut here ]------------
[ 2084.079583] WARNING: CPU: 120 PID: 734 at /build/
[ 2084.079584] Modules linked in: mst_pciconf(OE) mst_pci(OE) xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_
[ 2084.079640] xor raid6_pq libcrc32c raid1 raid0 multipath linear mlx4_en(OE) ses enclosure scsi_transport_sas crc32c_vpmsum tg3 mlx5_core(OE) mlx4_core(OE) ipr devlink mlx_compat(OE)
[ 2084.079658] CPU: 120 PID: 734 Comm: kworker/120:0 Tainted: G W OE 4.10.0-26-generic #30-Ubuntu
[ 2084.079663] Workqueue: events work_for_cpu_fn
[ 2084.079665] task: c000000fee60dc00 task.stack: c000000fee534000
[ 2084.079666] NIP: c00000000009c210 LR: c00000000009d404 CTR: 0000000000000000
[ 2084.079668] REGS: c000000fee537700 TRAP: 0700 Tainted: G W OE (4.10.0-26-generic)
[ 2084.079669] MSR: 900000000282b033 <SF,HV,
[ 2084.079677] CR: 42004428 XER: 20000000
[ 2084.079678] CFAR: c00000000009d400 SOFTE: 1
[ 2084.079702] NIP [c00000000009c210] pnv_pci_
[ 2084.079704] LR [c00000000009d404] pnv_npu_
[ 2084.079705] Call Trace:
[ 2084.079708] [c000000fee5379b0] [c00000000009d404] pnv_npu_
[ 2084.079710] [c000000fee537a80] [c000000000096c74] pnv_pci_
[ 2084.079714] [c000000fee537b00] [c0000000000291a0] dma_set_
[ 2084.079728] [c000000fee537b20] [d0000000143531e4] init_one+
[ 2084.079732] [c000000fee537bd0] [c00000000066ba9c] local_pci_
[ 2084.079734] [c000000fee537c60] [c0000000001016b8] work_for_
[ 2084.079737] [c000000fee537c90] [c0000000001061a0] process_
[ 2084.079740] [c000000fee537d20] [c000000000106780] worker_
[ 2084.079742] [c000000fee537dc0] [c00000000010f0a4] kthread+0x164/0x1b0
[ 2084.079746] [c000000fee537e30] [c00000000000b4e8] ret_from_
[ 2084.079747] Instruction dump:
[ 2084.079748] 7c0802a6 fbe1fff8 f8010010 f821ffd1 7c690074 7929d182 0b090000 2fa30000
[ 2084.079753] 419e0060 e8630330 7c690074 7929d182 <0b090000> 2fa30000 419e0048 7c852378
[ 2084.079759] ---[ end trace 7bf01a937efd69d8 ]---
[ 2084.080096] mlx5_core 0004:01:04.0: firmware version: 12.20.1010
tags: | added: architecture-ppc64le bugnameltc-156405 severity-high targetmilestone-inin1704 |
Changed in ubuntu: | |
assignee: | nobody → Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage) |
affects: | ubuntu → linux (Ubuntu) |
Changed in ubuntu-power-systems: | |
assignee: | nobody → Canonical Kernel Team (canonical-kernel-team) |
tags: | added: kernel-da-key |
Changed in ubuntu-power-systems: | |
status: | New → In Progress |
Changed in linux (Ubuntu Zesty): | |
status: | New → In Progress |
importance: | Undecided → Medium |
assignee: | nobody → Joseph Salisbury (jsalisbury) |
I built a test kernel with a pick of commit 377aa6b0efba. The test kernel can be downloaded from:
http:// kernel. ubuntu. com/~jsalisbury /lp1702768/
Can you test this kernel and see if it resolves this bug?