qla2xxx driver fails all paths

Bug #2027988 reported by Gabriel Tzagkarakis
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-signed-hwe-5.19 (Ubuntu)
New
Undecided
Unassigned

Bug Description

After upgrading an HP BL460 G7 from 20.04 to 22.04, the storage mounted from an hp msa2324fc using the QLogic QMH2462 dual channel adapter fails all paths after a while, usually less than an hour.
This occured with kernel 5.15.0-76-generic installed by upgrading, but gave 5.19.0-46-generic a shot hoping to be bug free.

In the same blade enclosure are other server blades connected to the same storage via the same switches
with ubuntu 16.04 18.04 20.04 which operate as intended plus a couple more non-blade servers.

When the issue occurs all cpu goes to 100 iowait, load average keeps climbing up until i do
echo 1 > /sys/class/fc_host/host2/issue_lip ( and host3 ) and flush the multipath device after using umount ( if mounted ) because it gets stuck without even mounting it.

I verified multipathd.conf configuration if correct.
=== multipath.conf ===
blacklist {
        devnode "^sda"
}

defaults {
        polling_interval 5
        fast_io_fail_tmo 5
        dev_loss_tmo 10
        checker_timeout 15
}

devices {
 device {
  vendor "HP"
  product "MSA2312fc|MSA2324fc"
  hardware_handler "0"
  uid_attribute ID_SERIAL
  path_selector "round-robin 0"
  path_grouping_policy group_by_prio
  failback immediate
  rr_weight uniform
  no_path_retry 18
  rr_min_io 100
  path_checker tur
  prio alua
 }
}

multipaths {
 multipath {
  wwid 3600c0ff000daa122bf012d5001000000
  alias wtsql
  path_grouping_policy group_by_prio
 }
}
=== end multipath.conf ===

qla2xxx module parameters: "ql2xmaxqdepth=16 qlport_down_retry=10 ql2xloginretrycount=30 ql2xextended_error_logging=0x1e400000"

lsb_release -rd
Description: Ubuntu 22.04.2 LTS
Release: 22.04

apt-cache policy linux-generic-hwe-22.04
linux-generic-hwe-22.04:
  Installed: 5.19.0.46.47~22.04.21
  Candidate: 5.19.0.46.47~22.04.21
  Version table:
 *** 5.19.0.46.47~22.04.21 500
        500 http://gr.archive.ubuntu.com/ubuntu jammy-updates/main amd64 Packages
        500 http://security.ubuntu.com/ubuntu jammy-security/main amd64 Packages
        100 /var/lib/dpkg/status
     5.15.0.25.27 500
        500 http://gr.archive.ubuntu.com/ubuntu jammy/main amd64 Packages

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: linux-image-5.19.0-46-generic 5.19.0-46.47~22.04.1
ProcVersionSignature: Ubuntu 5.19.0-46.47~22.04.1-generic 5.19.17
Uname: Linux 5.19.0-46-generic x86_64
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: unknown
Date: Mon Jul 17 16:40:10 2023
SourcePackage: linux-signed-hwe-5.19
UpgradeStatus: Upgraded to jammy on 2023-07-12 (5 days ago)
mtime.conffile..etc.logrotate.d.apport: 2017-08-03T08:14:27.712298

Revision history for this message
Gabriel Tzagkarakis (gabrieltz) wrote :
Revision history for this message
Gabriel Tzagkarakis (gabrieltz) wrote :
Revision history for this message
Gabriel Tzagkarakis (gabrieltz) wrote :
Revision history for this message
Gabriel Tzagkarakis (gabrieltz) wrote :

When all paths are fails multipath -ll or any other command that accesses those disks ( fdisk , hdparm ) hangs indefinitely until echo 1 > issue_lip is issued

dmesg output after the issue_lip is attached as well

Revision history for this message
Gabriel Tzagkarakis (gabrieltz) wrote :

I upgraded to linux-image-6.2.0-26-generic this morning and problem has not manifested after 4 hours of use

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.