Revert or drop obsolete patches for PCI

Bug #1911205 reported by Marcelo Cerri on 2021-01-12
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-azure (Ubuntu)
Undecided
Marcelo Cerri
Focal
Medium
Marcelo Cerri
Groovy
Medium
Marcelo Cerri

Bug Description

[Impact]

In both 5.4 and 5.8 kernel, there are two patches in PCI to workaround an issue with GPU and Infiniband at the time. Those two patches are no longer necessary in today's Azure environment. Those two patches introduces bugs to the latest Azure HPC VM SKU.

4.15 kernel is not affected.

Microsoft has regression tested and verified that the following two patches can be dropped or reverted in 5.4 and 5.8.

5.4 kernel:
Patch 1: Revert "PCI: hv: Make sure the bus domain is really unique"
https://git.launchpad.net/~canonical-kernel/ubuntu/+source/linux-azure/+git/bionic/commit/drivers/pci/controller/pci-hyperv.c?h=azure-5.4&id=faf2750396d60001046333bd46eee757bb4cac7b

Patch 2: UBUNTU: SAUCE: pci-hyperv: Use only 16 bit integer for PCI domain
https://git.launchpad.net/~canonical-kernel/ubuntu/+source/linux-azure/+git/bionic/commit/drivers/pci/controller/pci-hyperv.c?h=azure-5.4&id=04ac3e06ab708131c1c18cb7db10ca5dd6feedce

5.8 kernel:
Patch 1: Revert "PCI: hv: Make sure the bus domain is really unique"
https://git.launchpad.net/~canonical-kernel/ubuntu/+source/linux-azure/+git/groovy/commit/drivers/pci/controller/pci-hyperv.c?h=Ubuntu-azure-5.8.0-1016.17&id=7c85402203e26d4d0474f69b99b5a06ffb765630

Patch 2: UBUNTU: SAUCE: pci-hyperv: Use only 16 bit integer for PCI domain
https://git.launchpad.net/~canonical-kernel/ubuntu/+source/linux-azure/+git/groovy/commit/drivers/pci/controller/pci-hyperv.c?h=Ubuntu-azure-5.8.0-1016.17&id=b72f314d15ab0ad7b80b761d8167962300c1bd35

[Test Case]

As per #1816106, instances with multiple GPUs should still be able to see all of them.

[Where problems could occur]

Issues or oops in instances types with multiple GPUs.

Marcelo Cerri (mhcerri) on 2021-01-12
Changed in linux (Ubuntu Focal):
status: New → In Progress
Changed in linux (Ubuntu Groovy):
status: New → In Progress

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1911205

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Marcelo Cerri (mhcerri) on 2021-01-12
no longer affects: linux (Ubuntu)
no longer affects: linux (Ubuntu Focal)
no longer affects: linux (Ubuntu Groovy)
Changed in linux-azure (Ubuntu Focal):
status: New → In Progress
Changed in linux-azure (Ubuntu Groovy):
status: New → In Progress
Changed in linux-azure (Ubuntu):
assignee: nobody → Marcelo Cerri (mhcerri)
Changed in linux-azure (Ubuntu Focal):
assignee: nobody → Marcelo Cerri (mhcerri)
Changed in linux-azure (Ubuntu Groovy):
assignee: nobody → Marcelo Cerri (mhcerri)
Stefan Bader (smb) on 2021-01-19
Changed in linux-azure (Ubuntu Focal):
importance: Undecided → Medium
Changed in linux-azure (Ubuntu Groovy):
importance: Undecided → Medium
Changed in linux-azure (Ubuntu Focal):
status: In Progress → Fix Committed
Changed in linux-azure (Ubuntu Groovy):
status: In Progress → Fix Committed
Launchpad Janitor (janitor) wrote :
Download full text (63.7 KiB)

This bug was fixed in the package linux-azure - 5.4.0-1040.42

---------------
linux-azure (5.4.0-1040.42) focal; urgency=medium

  * focal/linux-azure: 5.4.0-1040.42 -proposed tracker (LP: #1913138)

  * Focal update: v5.4.80 upstream stable release (LP: #1908561)
    - [Config] azure: updateconfigs for INFINIBAND_VIRT_DMA

  * Focal update: v5.4.84 upstream stable release (LP: #1910816)
    - [Config] azure: updateconfigs for PGTABLE_MAPPING

  * CIFS: Request for backport (LP: #1911438)
    - cifs: fix interrupted close commands
    - cifs: do not fail __smb_send_rqst if non-fatal signals are pending

  * Sync FSGSBASE patches in azure kernel with mainline (LP: #1913294)
    - x86/process/64: Make save_fsgs_for_kvm() ready for FSGSBASE
    - x86/entry/64: Do not use RDPID in paranoid entry to accomodate KVM

  * Revert or drop obsolete patches for PCI (LP: #1911205)
    - Revert "UBUNTU: SAUCE: pci-hyperv: Use only 16 bit integer for PCI domain"
    - Revert "Revert "PCI: hv: Make sure the bus domain is really unique""

  * Disable Atari partition support for cloud kernels (LP: #1908264)
    - [Config] azure: Disable ATARI_PARTITION

  * Enable boot from NVMe device for all images (LP: #1908571)
    - [Config] azure: CONFIG_PCI_HYPERV=y

  [ Ubuntu: 5.4.0-66.74 ]

  * focal/linux: 5.4.0-66.74 -proposed tracker (LP: #1913152)
  * Add support for selective build of special drivers (LP: #1912789)
    - [Packaging] Add support for ODM drivers
    - [Packaging] Turn on ODM support for amd64
  * Packaging resync (LP: #1786013)
    - update dkms package versions
    - update dkms package versions
  * Introduce the new NVIDIA 460-server series and update the 460 series
    (LP: #1913200)
    - [Config] dkms-versions -- drop NVIDIA 435 455 and 440-server
    - [Config] dkms-versions -- add the 460-server nvidia driver
  * Enable mute and micmute LED on HP EliteBook 850 G7 (LP: #1910102)
    - ALSA: hda/realtek: Enable mute and micmute LED on HP EliteBook 850 G7
  * SYNA30B4:00 06CB:CE09 Mouse on HP EliteBook 850 G7 not working at all
    (LP: #1908992)
    - HID: multitouch: Enable multi-input for Synaptics pointstick/touchpad device
  * HD Audio Device PCI ID for the Intel Cometlake-R platform (LP: #1912427)
    - SAUCE: ALSA: hda: Add Cometlake-R PCI ID
  * switch to an autogenerated nvidia series based core via dkms-versions
    (LP: #1912803)
    - [Packaging] nvidia -- use dkms-versions to define versions built
    - [Packaging] update-version-dkms -- maintain flags fields
    - [Config] dkms-versions -- add transitional/skip information for nvidia
      packages
  * udpgro.sh in net from ubuntu_kernel_selftests seems not reflecting sub-test
    result (LP: #1908499)
    - selftests: fix the return value for UDP GRO test
  * qede: Kubernetes Internal DNS Failure due to QL41xxx NIC not supporting IPIP
    tx csum offload (LP: #1909062)
    - qede: fix offload for IPIP tunnel packets
  * Use DCPD to control HP DreamColor panel (LP: #1911001)
    - SAUCE: drm/dp: Another HP DreamColor panel brigntness fix
  * kvm: Windows 2k19 with Hyper-v role gets stuck on pending hypervisor
    requests on cascadelake based kvm hosts (LP: #1911848)
    - KVM:...

Changed in linux-azure (Ubuntu Focal):
status: Fix Committed → Fix Released
Launchpad Janitor (janitor) wrote :
Download full text (130.6 KiB)

This bug was fixed in the package linux-azure - 5.8.0-1023.25

---------------
linux-azure (5.8.0-1023.25) groovy; urgency=medium

  * groovy/linux-azure: 5.8.0-1023.25 -proposed tracker (LP: #1914792)

  * Groovy update: upstream stable patchset 2021-01-12 (LP: #1911235)
    - [Config] azure: update config for INFINIBAND_VIRT_DMA

  * Groovy update: upstream stable patchset 2021-01-13 (LP: #1911476)
    - [Config] azure: updateconfigs for ZSMALLOC_PGTABLE_MAPPING

  * Revert or drop obsolete patches for PCI (LP: #1911205)
    - Revert "UBUNTU: SAUCE: pci-hyperv: Use only 16 bit integer for PCI domain"
    - Revert "Revert "PCI: hv: Make sure the bus domain is really unique""

  * Disable Atari partition support for cloud kernels (LP: #1908264)
    - [Config] azure: Disable ATARI_PARTITION

  * Enable boot from NVMe device for all images (LP: #1908571)
    - [Config] azure: CONFIG_PCI_HYPERV=y

  [ Ubuntu: 5.8.0-44.50 ]

  * groovy/linux: 5.8.0-44.50 -proposed tracker (LP: #1914805)
  * Packaging resync (LP: #1786013)
    - update dkms package versions
    - update dkms package versions
  * Introduce the new NVIDIA 460-server series and update the 460 series
    (LP: #1913200)
    - [Config] dkms-versions -- drop NVIDIA 435 455 and 440-server
    - [Config] dkms-versions -- add the 460-server nvidia driver
  * [SRU][G/H/U/OEM-5.10] re-enable s0ix of e1000e (LP: #1910541)
    - Revert "UBUNTU: SAUCE: e1000e: bump up timeout to wait when ME un-configure
      ULP mode"
    - e1000e: Only run S0ix flows if shutdown succeeded
    - Revert "e1000e: disable s0ix entry and exit flows for ME systems"
    - e1000e: Export S0ix flags to ethtool
  * suspend only works once on ThinkPad X1 Carbon gen 7 (LP: #1865570) //
    [SRU][G/H/U/OEM-5.10] re-enable s0ix of e1000e (LP: #1910541)
    - e1000e: bump up timeout to wait when ME un-configures ULP mode
  * Cannot probe sata disk on sata controller behind VMD: ata1.00: failed to
    IDENTIFY (I/O error, err_mask=0x4) (LP: #1894778)
    - PCI: vmd: Offset Client VMD MSI-X vectors
  * Enable mute and micmute LED on HP EliteBook 850 G7 (LP: #1910102)
    - ALSA: hda/realtek: Enable mute and micmute LED on HP EliteBook 850 G7
  * SYNA30B4:00 06CB:CE09 Mouse on HP EliteBook 850 G7 not working at all
    (LP: #1908992)
    - HID: multitouch: Enable multi-input for Synaptics pointstick/touchpad device
  * HD Audio Device PCI ID for the Intel Cometlake-R platform (LP: #1912427)
    - SAUCE: ALSA: hda: Add Cometlake-R PCI ID
  * switch to an autogenerated nvidia series based core via dkms-versions
    (LP: #1912803)
    - [Packaging] nvidia -- use dkms-versions to define versions built
    - [Packaging] update-version-dkms -- maintain flags fields
    - [Config] dkms-versions -- add transitional/skip information for nvidia
      packages
  * udpgro.sh in net from ubuntu_kernel_selftests seems not reflecting sub-test
    result (LP: #1908499)
    - selftests: fix the return value for UDP GRO test
  * [UBUNTU 21.04] vfio: pass DMA availability information to userspace
    (LP: #1907421)
    - vfio/type1: Refactor vfio_iommu_type1_ioctl()
    - vfio iommu: Add dma available capability
  * qede: Kubernetes Internal DN...

Changed in linux-azure (Ubuntu Groovy):
status: Fix Committed → Fix Released
Launchpad Janitor (janitor) wrote :
Download full text (149.7 KiB)

This bug was fixed in the package linux-azure - 5.8.0-1024.26+21.04.1

---------------
linux-azure (5.8.0-1024.26+21.04.1) hirsute; urgency=medium

  * hirsute/linux-azure: 5.8.0-1024.26+21.04.1 -proposed tracker (LP: #1916129)

  * Packaging resync (LP: #1786013)
    - update dkms package versions

  [ Ubuntu: 5.8.0-1024.26 ]

  * groovy/linux-azure: 5.8.0-1024.26 -proposed tracker (LP: #1916130)
  * Azure kernel for focal misses binder and ashmem kernel modules
    (LP: #1914568)
    - [Config] Enable CONFIG_ASHMEM=m, CONFIG_ANDROID_BINDER_IPC=m
    - azure: [Config] Disable ION
  * Please trust Canonical Livepatch Service kmod signing key (LP: #1898716)
    - azure: [Config] enable CONFIG_MODVERSIONS=y
    - azure: [Packaging] build canonical-certs.pem from branch/arch certs
    - azure: [Config] Allow ASM_MODVERSIONS
  * groovy/linux: 5.8.0-45.51 -proposed tracker (LP: #1916143)
  * Please trust Canonical Livepatch Service kmod signing key (LP: #1898716)
    - [Config] enable CONFIG_MODVERSIONS=y
    - [Packaging] build canonical-certs.pem from branch/arch certs
    - [Config] add Canonical Livepatch Service key to SYSTEM_TRUSTED_KEYS
    - [Config] add ubuntu-drivers key to SYSTEM_TRUSTED_KEYS
    - [Config] Allow ASM_MODVERSIONS and MODULE_REL_CRCS
  * CVE-2021-20194
    - bpf, cgroup: Fix optlen WARN_ON_ONCE toctou
    - bpf, cgroup: Fix problematic bounds check
  * Missing device id for Intel TGL-H ISH [8086:43fc] in intel-ish-hid driver
    (LP: #1914543)
    - HID: intel-ish-hid: ipc: Add Tiger Lake H PCI device ID
  * Prevent thermal shutdown during boot process (LP: #1906168)
    - thermal/core: Emit a warning if the thermal zone is updated without ops
    - thermal/core: Add critical and hot ops
    - thermal/drivers/acpi: Use hot and critical ops
    - thermal/drivers/rcar: Remove notification usage
    - thermal: int340x: Fix unexpected shutdown at critical temperature
    - thermal: intel: pch: Fix unexpected shutdown at critical temperature
  * geneve overlay network on vlan interface broken with offload enabled
    (LP: #1914447)
    - net/mlx5e: Fix SWP offsets when vlan inserted by driver
  * Groovy update: upstream stable patchset 2021-02-11 (LP: #1915473)
    - net: cdc_ncm: correct overhead in delayed_ndp_size
    - net: hns3: fix the number of queues actually used by ARQ
    - net: hns3: fix a phy loopback fail issue
    - net: stmmac: dwmac-sun8i: Balance internal PHY resource references
    - net: stmmac: dwmac-sun8i: Balance internal PHY power
    - net: vlan: avoid leaks on register_vlan_dev() failures
    - net/sonic: Fix some resource leaks in error handling paths
    - net: ipv6: fib: flush exceptions when purging route
    - tools: selftests: add test for changing routes with PTMU exceptions
    - net: fix pmtu check in nopmtudisc mode
    - net: ip: always refragment ip defragmented packets
    - octeontx2-af: fix memory leak of lmac and lmac->name
    - nexthop: Fix off-by-one error in error path
    - nexthop: Unlink nexthop group entry in error path
    - s390/qeth: fix L2 header access in qeth_l3_osa_features_check()
    - net: dsa: lantiq_gswip: Exclude RMII from modes that report 1 GbE
    - net/mlx5: Us...

Changed in linux-azure (Ubuntu):
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers