lpfc.ko module doesn't work

Bug #1746970 reported by Koms Prav on 2018-02-02
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Medium
Joseph Salisbury
Artful
Medium
Joseph Salisbury
Bionic
Medium
Joseph Salisbury

Bug Description

== SRU Justification ==
The bug reporter stated that the HBA Emulex LPe12000 Fibre channel adapter
does not see the disk on direct attach. Broadcom support confirmed that bug
and suggested to update module. It was found that commit 2877cbf is the
fix for this bug.

Commit 2877cbf is in mainline as of v4.14-rc1.

== Fix ==
commit 2877cbffb79ed121a6bcc5edbe629d3aba36cd29
Author: Dick Kennedy <email address hidden>
Date: Wed Aug 23 16:55:31 2017 -0700

    scsi: lpfc: Fix loop mode target discovery

== Regression Potential ==
Low. Clean cherry pick and only adds a new check for loop mode.

== Test Case ==
A test kernel was built with this patch and tested by the original bug reporter.
The bug reporter states the test kernel resolved the bug.

== Original Bug Description ==
Hi.

I have problem with HBA Emulex LPe12000 Fibre channel adapter. It doesn't see disk on direct attach.

[ 4.015493] lpfc 0000:04:00.0: 0:1303 Link Up Event x1 received Data: x1 x1 x10 x2 x0 x0 0
[ 4.015570] lpfc 0000:04:00.0: 0:1309 Link Up Event npiv not supported in loop topology
[ 4.016671] lpfc 0000:04:00.0: 0:(0):2858 FLOGI failure Status:x3/x18 TMO:x0 Data x1000 x0
[ 4.017627] lpfc 0000:04:00.0: 0:(0):2858 FLOGI failure Status:x3/x18 TMO:x0 Data x1000 x0
[ 4.018561] lpfc 0000:04:00.0: 0:(0):2858 FLOGI failure Status:x3/x18 TMO:x0 Data x1000 x0
[ 4.018636] lpfc 0000:04:00.0: 0:(0):0100 FLOGI failure Status:x3/x18 TMO:x0
[ 4.019713] lpfc 0000:04:00.0: 0:(0):0266 Issue NameServer Req x117 err 1 Data: x80000 x0

I have made a little investigation.

Ubuntu 16.04.3 LTS, kernel 4.10.0-28, lpfc 11.2.0.2 - no problem.
Ubuntu 17.10.1, kernel 4.13.0-21, lpfc 11.4.0.1 - problem.
Debian 9.3, kernel 4.9.0-4, lpfc 11.2.0.0 - no problem.
Fedora 27, kernel 4.13.9-300, lpfc 11.4.0.1 - problem.
Proxmox 5.0, kernel 4.10.15-1, lpfc 11.2.0.2 - no problem.
Proxmox 5.1, kernel 4.13.13-5, lpfc 11.4.0.1 - problem.

Another user with this problem said that if fiber switch is added between the server and the storage it's working as expected.

I think problem is in lpfc.ko 11.4.0.1. Any kernel (4.13 now) that has this module version is affected. Broadcom support confirmed that bug and suggested to update module. But Broadcom supports only RHEL and SLES. I tried to compile, but no luck. Please, could you update lpfc.ko module in linux-image-generic package?

There are version 11.4.142.21 for SLES 12 and 11.4.204.11 for RHEL 6 and 7 on Broadcom site.

Thanks.

---

04:00.0 Fibre Channel [0c04]: Emulex Corporation Saturn-X: LightPulse Fibre Channel Host Adapter [10df:f100] (rev 03)
 Subsystem: Emulex Corporation Saturn-X: LightPulse Fibre Channel Host Adapter [10df:f100]
 Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Interrupt: pin A routed to IRQ 27
 Region 0: Memory at fbaff000 (64-bit, non-prefetchable) [size=4K]
 Region 2: Memory at fbaf8000 (64-bit, non-prefetchable) [size=16K]
 Expansion ROM at fba80000 [disabled] [size=256K]
 Capabilities: [58] Power Management version 3
  Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
  Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
 Capabilities: [60] MSI: Enable- Count=1/16 Maskable+ 64bit+
  Address: 0000000000000000 Data: 0000
  Masking: 00000000 Pending: 00000000
 Capabilities: [78] MSI-X: Enable- Count=32 Masked-
  Vector table: BAR=2 offset=00002000
  PBA: BAR=2 offset=00003000
 Capabilities: [84] Vital Product Data
  Product Name: LPE12000, 8GB/S, 1-PORT FC, PCI EXPRESS HBA, SFP, DIAG, OPTICS, DELL
  Read-only fields:
   [PN] Part number: LPE12000-M8
   [SN] Serial number: FC24751316
   [V0] Vendor specific: FC24751316
   [V1] Vendor specific: Emulex LPe12000-M8 8Gb PCIe Fibre Channel Adapter
   [V2] Vendor specific: LPe12000-M8
   [V3] Vendor specific: T2:78,T3:79,7A,7B,7D,7E,7F,T7:73,TB:73,TFF:78
   [V4] Vendor specific:
   [RV] Reserved: checksum bad, 20 byte(s) reserved
  End
 Capabilities: [94] Express (v2) Endpoint, MSI 00
  DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s <1us, L1 unlimited
   ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset- SlotPowerLimit 25.000W
  DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
   RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
   MaxPayload 256 bytes, MaxReadReq 4096 bytes
  DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr- TransPend-
  LnkCap: Port #0, Speed 5GT/s, Width x8, ASPM L0s, Exit Latency L0s <1us, L1 <64us
   ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
  LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk+
   ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
  LnkSta: Speed 5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
  DevCap2: Completion Timeout: Range ABCD, TimeoutDis+, LTR-, OBFF Not Supported
  DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
  LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
    Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
    Compliance De-emphasis: -6dB
  LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete-, EqualizationPhase1-
    EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest-
 Capabilities: [100 v1] Advanced Error Reporting
  UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
  UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt+ RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
  UESvrt: DLP+ SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
  CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr-
  CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+
  AERCap: First Error Pointer: 00, GenCap+ CGenEn- ChkCap+ ChkEn-
 Capabilities: [12c v1] Power Budgeting <?>
 Kernel modules: lpfc
---
ApportVersion: 2.20.7-0ubuntu3.7
Architecture: amd64
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
CasperVersion: 1.387
DistroRelease: Ubuntu 17.10
LiveMediaBuild: Ubuntu 17.10 "Artful Aardvark" - Release amd64 (20180105.1)
MachineType: HP ProLiant DL160 G6
Package: linux (not installed)
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=ru_RU.UTF-8
 SHELL=/bin/bash
ProcFB: 0 mgadrmfb
ProcKernelCmdLine: file=/cdrom/preseed/hostname.seed boot=casper initrd=/casper/initrd.lz quiet splash --- debian-installer/language=ru keyboard-configuration/layoutcode?=ru
ProcVersionSignature: Ubuntu 4.13.0-21.24-generic 4.13.13
PulseList:
 Error: command ['pacmd', 'list'] failed with exit code 1: Home directory not accessible: Permission denied
 No PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-4.13.0-21-generic N/A
 linux-backports-modules-4.13.0-21-generic N/A
 linux-firmware 1.169.1
RfKill:

Tags: artful
Uname: Linux 4.13.0-21-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups:

_MarkForUpload: True
dmi.bios.date: 05/19/2010
dmi.bios.vendor: HP
dmi.bios.version: O33
dmi.chassis.type: 23
dmi.chassis.vendor: HP
dmi.modalias: dmi:bvnHP:bvrO33:bd05/19/2010:svnHP:pnProLiantDL160G6:pvr:cvnHP:ct23:cvr:
dmi.product.name: ProLiant DL160 G6
dmi.sys.vendor: HP

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1746970

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Mathieu Gallas (mamatov) wrote :
Download full text (3.7 KiB)

Hi, I confirm the bug on direct attach.

Using a fiber switch is working as expected for me.

---

1a:00.0 Fibre Channel: Emulex Corporation Zephyr-X LightPulse Fibre Channel Host Adapter (rev 02)
        Subsystem: Emulex Corporation Zephyr-X LightPulse Fibre Channel Host Adapter
        Physical Slot: 3
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 30
        Region 0: Memory at 97a00000 (64-bit, non-prefetchable) [size=4K]
        Region 2: Memory at 97a01000 (64-bit, non-prefetchable) [size=256]
        Region 4: I/O ports at 2100 [size=256]
        Expansion ROM at 97a40000 [disabled] [size=256K]
        Capabilities: [58] Power Management version 2
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [60] MSI: Enable+ Count=1/8 Maskable- 64bit+
                Address: 00000000fee00000 Data: 40d2
        Capabilities: [70] Vital Product Data
                Product Name: LPE11002 4GB/S 2-PORT FC: PCI EXPRESS HBA SFF DIAG OPTICS
                Read-only fields:
                        [PN] Part number: 42C2071
                        [SN] Serial number: 11S43W7511Y650HY1C2YTK
                        [V0] Vendor specific: FC14803247
                        [VL] Vendor specific: FRU 43W7512
                        [V1] Vendor specific: IBM 42C2071 4Gb 2-Port PCIe FC HBA for System x
                        [V2] Vendor specific: 42C2071
                        [V3] Vendor specific: T2:E1,E8,T3:E1,E2,E3,E5,E6,E7,T6:E1,T7:E1,TB:E1
                        [V4] Vendor specific: 0
                        [V5] Vendor specific: LPe11002
                        [RV] Reserved: checksum good, 1 byte(s) reserved
                End
        Capabilities: [44] Express (v1) Endpoint, MSI 00
                DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s <4us, L1 <16us
                        ExtTag+ AttnBtn- AttnInd- PwrInd- RBE- FLReset- SlotPowerLimit 0.000W
                DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+
                        RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 4096 bytes
                DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+ AuxPwr- TransPend-
                LnkCap: Port #0, Speed 2.5GT/s, Width x4, ASPM L0s, Exit Latency L0s <4us, L1 unlimited
                        ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp-
                LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- CommClk-
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk- DLActive- BWMgmt- ABWMgmt-
        Capabilities: [100 v1] Advanced Error Reporting
                UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
                UEMsk: DLP...

Read more...

apport information

tags: added: apport-collected artful
description: updated
Koms Prav (komspr) wrote : CRDA.txt

apport information

apport information

apport information

apport information

Koms Prav (komspr) wrote : Lspci.txt

apport information

Koms Prav (komspr) wrote : Lsusb.txt

apport information

apport information

apport information

apport information

apport information

apport information

apport information

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.15 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.15

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
tags: added: kernel-da-key
Koms Prav (komspr) wrote :

Installed latest upstream kernel. And it works.

Linux ubuntu 4.15.0-041500-generic #201802011154 SMP Thu Feb 1 11:55:45 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

# modinfo lpfc
filename: /lib/modules/4.15.0-041500-generic/kernel/drivers/scsi/lpfc/lpfc.ko
version: 0:11.4.0.4
author: Broadcom
description: Emulex LightPulse Fibre Channel SCSI driver 11.4.0.4

# lshw
           *-fiber
                description: Fibre Channel
                product: Saturn-X: LightPulse Fibre Channel Host Adapter
                vendor: Emulex Corporation
                physical id: 0
                bus info: pci@0000:04:00.0
                logical name: scsi6
                version: 03
                width: 64 bits
                clock: 33MHz
                capabilities: pm msi msix vpd pciexpress bus_master cap_list rom
                configuration: driver=lpfc latency=0
                resources: irq:85 memory:fbaff000-fbafffff memory:fbaf8000-fbafbfff memory:fba80000-fbabffff
              *-disk
                   description: SCSI Disk
                   product: MSA2312fc
                   vendor: HP
                   physical id: 0.0.0
                   bus info: scsi@6:0.0.0
                   logical name: /dev/sdb
                   version: M114
                   serial: tx1lD9-opeP-7ldc-GwFV-d5ZA-u2xA-GVEQWH
                   size: 3350GiB
                   capabilities: 10000rpm lvm2
                   configuration: ansiversion=5 logicalsectorsize=512 sectorsize=512

So it would be great to see new module version in stable 4.13 kernel.

tags: added: kernel-fixed-upstream
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Oliver (oliwel) wrote :

Adding https://github.com/torvalds/linux/commit/2877cbffb79ed121a6bcc5edbe629d3aba36cd29

On top of UBUNTU: Ubuntu-hwe-4.13.0-33.36~16.04.1 solves the problem for me

Oliver (oliwel) wrote :
tags: added: patch
Changed in linux (Ubuntu):
status: Confirmed → In Progress
assignee: nobody → Joseph Salisbury (jsalisbury)
Changed in linux (Ubuntu Artful):
status: New → In Progress
importance: Undecided → Medium
Changed in linux (Ubuntu Bionic):
status: In Progress → Fix Committed
Changed in linux (Ubuntu Artful):
assignee: nobody → Joseph Salisbury (jsalisbury)
Joseph Salisbury (jsalisbury) wrote :

I built a test kernel with commit 2877cbffb79. The test kernel can be downloaded from:
http://kernel.ubuntu.com/~jsalisbury/lp1746970

Can you test this kernel and see if it resolves this bug?

Note, to test this kernel, you need to install both the linux-image and linux-image-extra .deb packages.

Thanks in advance!

Koms Prav (komspr) wrote :

Joseph Salisbury, I tested your kernel and it works as expected, no bug.
Thanks!

Steve Markgraf (steve-m) wrote :

I can also confirm that the Kernel posted in #20 resolves the issue with my Emulex Zephyr-X FC adapter.

Changed in linux (Ubuntu Artful):
status: In Progress → Fix Committed
Stefan Bader (smb) wrote :

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-artful' to 'verification-done-artful'. If the problem still exists, change the tag 'verification-needed-artful' to 'verification-failed-artful'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags: added: verification-needed-artful
Koms Prav (komspr) wrote :

I verified kernel from -proposal (Ubuntu 17.10 (GNU/Linux 4.13.0-38-generic x86_64)) and it works as expected.

tags: added: verification-done-artful
removed: verification-needed-artful
Launchpad Janitor (janitor) wrote :
Download full text (18.9 KiB)

This bug was fixed in the package linux - 4.13.0-38.43

---------------
linux (4.13.0-38.43) artful; urgency=medium

  * linux: 4.13.0-38.43 -proposed tracker (LP: #1755762)

  * Servers going OOM after updating kernel from 4.10 to 4.13 (LP: #1748408)
    - i40e: Fix memory leak related filter programming status
    - i40e: Add programming descriptors to cleaned_count

  * [SRU] Lenovo E41 Mic mute hotkey is not responding (LP: #1753347)
    - platform/x86: ideapad-laptop: Increase timeout to wait for EC answer

  * fails to dump with latest kpti fixes (LP: #1750021)
    - kdump: write correct address of mem_section into vmcoreinfo

  * headset mic can't be detected on two Dell machines (LP: #1748807)
    - ALSA: hda/realtek - Support headset mode for ALC215/ALC285/ALC289
    - ALSA: hda - Fix headset mic detection problem for two Dell machines
    - ALSA: hda - Fix a wrong FIXUP for alc289 on Dell machines

  * CIFS SMB2/SMB3 does not work for domain based DFS (LP: #1747572)
    - CIFS: make IPC a regular tcon
    - CIFS: use tcon_ipc instead of use_ipc parameter of SMB2_ioctl
    - CIFS: dump IPC tcon in debug proc file

  * i2c-thunderx: erroneous error message "unhandled state: 0" (LP: #1754076)
    - i2c: octeon: Prevent error message on bus error

  * hisi_sas: Add disk LED support (LP: #1752695)
    - scsi: hisi_sas: directly attached disk LED feature for v2 hw

  * EDAC, sb_edac: Backport 1 patch to Ubuntu 17.10 (Fix missing DIMM sysfs
    entries with KNL SNC2/SNC4 mode) (LP: #1743856)
    - EDAC, sb_edac: Fix missing DIMM sysfs entries with KNL SNC2/SNC4 mode

  * [regression] Colour banding and artefacts appear system-wide on an Asus
    Zenbook UX303LA with Intel HD 4400 graphics (LP: #1749420)
    - drm/edid: Add 6 bpc quirk for CPT panel in Asus UX303LA

  * DVB Card with SAA7146 chipset not working (LP: #1742316)
    - vmalloc: fix __GFP_HIGHMEM usage for vmalloc_32 on 32b systems

  * [Asus UX360UA] battery status in unity-panel is not changing when battery is
    being charged (LP: #1661876) // AC adapter status not detected on Asus
    ZenBook UX410UAK (LP: #1745032)
    - ACPI / battery: Add quirk for Asus UX360UA and UX410UAK

  * ASUS UX305LA - Battery state not detected correctly (LP: #1482390)
    - ACPI / battery: Add quirk for Asus GL502VSK and UX305LA

  * support thunderx2 vendor pmu events (LP: #1747523)
    - perf pmu: Extract function to get JSON alias map
    - perf pmu: Pass pmu as a parameter to get_cpuid_str()
    - perf tools arm64: Add support for get_cpuid_str function.
    - perf pmu: Add helper function is_pmu_core to detect PMU CORE devices
    - perf vendor events arm64: Add ThunderX2 implementation defined pmu core
      events
    - perf pmu: Add check for valid cpuid in perf_pmu__find_map()

  * lpfc.ko module doesn't work (LP: #1746970)
    - scsi: lpfc: Fix loop mode target discovery

  * Ubuntu 17.10 crashes on vmalloc.c (LP: #1739498)
    - powerpc/mm/book3s64: Make KERN_IO_START a variable
    - powerpc/mm/slb: Move comment next to the code it's referring to
    - powerpc/mm/hash64: Make vmalloc 56T on hash

  * ethtool -p fails to light NIC LED on HiSilicon D05 systems (LP: #1748567)
    - net...

Changed in linux (Ubuntu Artful):
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers