POWER9: Additional power9 patches

Bug #1671613 reported by bugproxy on 2017-03-09
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Undecided
Tim Gardner
Zesty
Undecided
Tim Gardner

Bug Description

Dear Canonical,

Currently Ubuntu is having some issues on a POWER9 machine, as:

    [ 443.933079] ------------[ cut here ]------------
    [ 443.933102] WARNING: CPU: 10 PID: 10243 at /build/linux-iMfvOW/linux-4.10.0/arch/powerpc/mm/fault.c:428 do_page_fault+0x5d8/0x7d0
    [ 443.933105] Modules linked in: vmx_crypto ofpart cmdlinepart ipmi_powernv powernv_flash ipmi_devintf mtd uio_pdrv_genirq ipmi_msghandler ibmpowernv opal_prd uio ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 ses enclosure scsi_transport_sas btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crc32c_vpmsum drm i40e aacraid
    [ 443.933206] CPU: 10 PID: 10243 Comm: apt-get Not tainted 4.10.0-9-generic #11-Ubuntu
    [ 443.933211] task: c0000007adbc2c00 task.stack: c0000007adc60000
    [ 443.933214] NIP: c0000000000519d8 LR: c0000000000515b4 CTR: 0000000000000003
    [ 443.933217] REGS: c0000007adc63b00 TRAP: 0700 Not tainted (4.10.0-9-generic)
    [ 443.933219] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>
    [ 443.933237] CR: 48248422 XER: 00000000
    [ 443.933240] CFAR: c00000000005163c SOFTE: 1
                   GPR00: c0000000000515b4 c0000007adc63d80 c00000000142c400 c0000007ad0a3b60
                   GPR04: 00003fff7b7bc92c 0000000000000000 00000007fc020000 0000000000000000
                   GPR08: 0000000000000003 0000000000000001 c0000000012ec400 0000000000000000
                   GPR12: 0000000000004000 c00000000fb85a00 0000000000000001 00003fff7d640ea8
                   GPR16: 00003fffdc1b6e78 00003fffdc1b6e70 00003fffdc1b6e68 00003fffdc1b6e58
                   GPR20: 00003fff7d640e00 ffffffffffffffc0 0000000000030001 0000000000000000
                   GPR24: 0000000000000054 c0000007b21e0b68 c0000007ad0a3b60 c0000007b21e0b00
                   GPR28: 00003fff7b7bc92c 0000000008000000 0000000000000000 c0000007adc63ea0
    [ 443.933307] NIP [c0000000000519d8] do_page_fault+0x5d8/0x7d0
    [ 443.933312] LR [c0000000000515b4] do_page_fault+0x1b4/0x7d0
    [ 443.933314] Call Trace:
    [ 443.933320] [c0000007adc63d80] [c0000000000515b4] do_page_fault+0x1b4/0x7d0 (unreliable)
    [ 443.933328] [c0000007adc63e30] [c00000000000a4e8] handle_page_fault+0x10/0x30
    [ 443.933332] Instruction dump:
    [ 443.933337] 60000000 60420000 71280007 4182fe1c 77a90800 4182fc94 3d42ffec 892ab491
    [ 443.933351] 2f890000 409efc84 39200001 992ab491 <0fe00000> 4bfffc74 7f86e378 7ec5b378
    [ 443.933366] ---[ end trace 4798790ae3dab875 ]---

======

We would like to include the following patches to fix it:

* cee216a696b2004017a5ecb583366093d90b1568 mm/autonuma: don't use set_pte_at when updating protnone ptes

* 288bc54949fc2625a4fd811a188fb200cc498946 mm/autonuma: let architecture override how the write bit should be stashed in a protnone pte.

* 595cd8f256d24face93b2722927ec9c980419c26 mm/ksm: handle protnone saved writes when making page write protect

* c137a2757b88658ce61c74b25ff650ecda7f09d8 switch ppc64 to its own implementation of saved write

* db08f2030a173fdb95b2e8e28d82c4e8c04df2ac mm/gup: check for protnone only if it is a PTE entry

* 9a8b300f2f7812ebf4630b8b40499da38b38e882 mm/thp/autonuma: use TNF flag instead of vm fault

 * power/mm: update pte_write and pte_wrprotect to handle savedwrite: http://ozlabs.org/~akpm/mmots/broken-out/power-mm-update-pte_write-and-pte_wrprotect-to-handle-savedwrite.patch

* powerpc/mm: Handle protnone ptes on fork: http://ozlabs.org/~akpm/mmots/broken-out/powerpc-mm-handle-protnone-ptes-on-fork.patch

Thanks

bugproxy (bugproxy) on 2017-03-09
tags: added: architecture-ppc64le bugnameltc-152432 severity-high targetmilestone-inin1604
Changed in ubuntu:
assignee: nobody → Taco Screen team (taco-screen-team)
affects: ubuntu → linux (Ubuntu)
Download full text (4.3 KiB)

Leann,

A fresh set of Power9 patches for the kernel.

                  Michael

On 03/09/2017 11:29 AM, Launchpad Bug Tracker wrote:
> bugproxy (bugproxy) has assigned this bug to you for Ubuntu:
>
> Dear Canonical,
>
> Currently Ubuntu is having some issues on a POWER9 machine, as:
>
>
> [ 443.933079] ------------[ cut here ]------------
> [ 443.933102] WARNING: CPU: 10 PID: 10243 at /build/linux-iMfvOW/linux-4.10.0/arch/powerpc/mm/fault.c:428 do_page_fault+0x5d8/0x7d0
> [ 443.933105] Modules linked in: vmx_crypto ofpart cmdlinepart ipmi_powernv powernv_flash ipmi_devintf mtd uio_pdrv_genirq ipmi_msghandler ibmpowernv opal_prd uio ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 ses enclosure scsi_transport_sas btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crc32c_vpmsum drm i40e aacraid
> [ 443.933206] CPU: 10 PID: 10243 Comm: apt-get Not tainted 4.10.0-9-generic #11-Ubuntu
> [ 443.933211] task: c0000007adbc2c00 task.stack: c0000007adc60000
> [ 443.933214] NIP: c0000000000519d8 LR: c0000000000515b4 CTR: 0000000000000003
> [ 443.933217] REGS: c0000007adc63b00 TRAP: 0700 Not tainted (4.10.0-9-generic)
> [ 443.933219] MSR: 9000000000029033 <SF,HV,EE,ME,IR,DR,RI,LE>
> [ 443.933237] CR: 48248422 XER: 00000000
> [ 443.933240] CFAR: c00000000005163c SOFTE: 1
> GPR00: c0000000000515b4 c0000007adc63d80 c00000000142c400 c0000007ad0a3b60
> GPR04: 00003fff7b7bc92c 0000000000000000 00000007fc020000 0000000000000000
> GPR08: 0000000000000003 0000000000000001 c0000000012ec400 0000000000000000
> GPR12: 0000000000004000 c00000000fb85a00 0000000000000001 00003fff7d640ea8
> GPR16: 00003fffdc1b6e78 00003fffdc1b6e70 00003fffdc1b6e68 00003fffdc1b6e58
> GPR20: 00003fff7d640e00 ffffffffffffffc0 0000000000030001 0000000000000000
> GPR24: 0000000000000054 c0000007b21e0b68 c0000007ad0a3b60 c0000007b21e0b00
> GPR28: 00003fff7b7bc92c 0000000008000000 0000000000000000 c0000007adc63ea0
> [ 443.933307] NIP [c0000000000519d8] do_page_fault+0x5d8/0x7d0
> [ 443.933312] LR [c0000000000515b4] do_page_fault+0x1b4/0x7d0
> [ 443.933314] Call Trace:
> [ 443.933320] [c0000007adc63d80] [c0000000000515b4] do_page_fault+0x1b4/0x7d0 (unreliable)
> [ 443.933328] [c0000007adc63e30] [c00000000000a4e8] handle_page_fault+0x10/0x30
> [ 443.933332] Instruction dump:
> [ 443.933337] 60000000 60420000 71280007 4182fe1c 77a90800 4182fc94 3d42ffec 892ab491
> [ 443.933351] 2f890000 409efc84 39200001 992ab491 <0fe00000> 4bfffc74 7f86e378 7ec5b378
> [ 443.933366] ---[ end trace 4798790ae3dab875 ]---
>
>
> ======
>
>
> We would like to include the following patches to fix it:
>
> * cee216a696b2004017a5ecb583366093d90b1568 mm/autonuma: don't use
> set_pte_at when updating protnone ptes...

Read more...

Tim Gardner (timg-tpi) on 2017-03-09
Changed in linux (Ubuntu Zesty):
assignee: Taco Screen team (taco-screen-team) → Tim Gardner (timg-tpi)
status: New → In Progress
Tim Gardner (timg-tpi) wrote :

Test kernel at http://kernel.ubuntu.com/~rtg/power9-lp1671613/

git://kernel.ubuntu.com/rtg/ubuntu-zesty.git power9-lp1671613

Tim Gardner (timg-tpi) on 2017-03-17
Changed in linux (Ubuntu Zesty):
status: In Progress → Fix Committed
Launchpad Janitor (janitor) wrote :
Download full text (17.1 KiB)

This bug was fixed in the package linux - 4.10.0-14.16

---------------
linux (4.10.0-14.16) zesty; urgency=low

  [ Tim Gardner ]

  * Release Tracking Bug
    - LP: #1673805

  * msleep() bug causes Nuvoton I2C TPM device driver delays (LP: #1667567)
    - tpm: msleep() delays - replace with usleep_range() in i2c nuvoton driver
    - SAUCE: tpm: add sleep only for retry in i2c_nuvoton_write_status()

  * C++ demangling support missing from perf (LP: #1396654)
    - [Config] added binutils-dev to Build-deps

  * dm-queue-length module is not included in installer/initramfs (LP: #1673350)
    - [Config] d-i: Also add dm-queue-length to multipath modules

  * move aufs.ko from -extra to linux-image package (LP: #1673498)
    - [config] aufs.ko moved to linux-image package

  * Using an NVMe drive causes huge power drain (LP: #1664602)
    - nvme: Add a quirk mechanism that uses identify_ctrl
    - nvme: Enable autonomous power state transitions

  * Broadcom bluetooth modules sometimes fail to initialize (LP: #1483101)
    - Bluetooth: btbcm: Add a delay for module reset

  * Need support of Broadcom bluetooth device [413c:8143] (LP: #1166113)
    - Bluetooth: btusb: Add support for 413c:8143

  * Zesty update to v4.10.3 stable release (LP: #1673118)
    - serial: 8250_pci: Add MKS Tenta SCOM-0800 and SCOM-0801 cards
    - KVM: s390: Disable dirty log retrieval for UCONTROL guests
    - KVM: VMX: use correct vmcs_read/write for guest segment selector/base
    - Bluetooth: Add another AR3012 04ca:3018 device
    - phy: qcom-ufs: Don't kfree devres resource
    - phy: qcom-ufs: Fix misplaced jump label
    - s390/qdio: clear DSCI prior to scanning multiple input queues
    - s390/dcssblk: fix device size calculation in dcssblk_direct_access()
    - s390/kdump: Use "LINUX" ELF note name instead of "CORE"
    - s390/chsc: Add exception handler for CHSC instruction
    - s390: TASK_SIZE for kernel threads
    - s390/topology: correct allocation of topology information
    - s390: make setup_randomness work
    - s390: use correct input data address for setup_randomness
    - net: mvpp2: fix DMA address calculation in mvpp2_txq_inc_put()
    - cxl: Prevent read/write to AFU config space while AFU not configured
    - cxl: fix nested locking hang during EEH hotplug
    - brcmfmac: fix incorrect event channel deduction
    - mnt: Tuck mounts under others instead of creating shadow/side mounts.
    - IB/ipoib: Fix deadlock between rmmod and set_mode
    - IB/IPoIB: Add destination address when re-queue packet
    - IB/mlx5: Fix out-of-bound access
    - IB/SRP: Avoid using IB_MR_TYPE_SG_GAPS
    - IB/srp: Avoid that duplicate responses trigger a kernel bug
    - IB/srp: Fix race conditions related to task management
    - Btrfs: fix data loss after truncate when using the no-holes feature
    - orangefs: Use RCU for destroy_inode
    - memory/atmel-ebi: Fix ns <-> cycles conversions
    - tracing: Fix return value check in trace_benchmark_reg()
    - ktest: Fix child exit code processing
    - ceph: remove req from unsafe list when unregistering it
    - target: Fix NULL dereference during LUN lookup + active I/O shutdown
    - drivers/pci/hotplug: Han...

Changed in linux (Ubuntu Zesty):
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers