Server Crash while running IO and switch port bounce test with 2K login session
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Incomplete
|
Undecided
|
Unassigned | ||
Jammy |
New
|
Undecided
|
Unassigned |
Bug Description
[Impact]
Server crash and Call trace reported on one of the servers running IO and
switch port bounce test from the 2K login session configuration.
Call Trace:
[56048.470488] Call Trace:
[56048.470489] _raw_spin_
[56048.470489] lpfc_dmp_
[56048.470490] lpfc_cmpl_
[56048.470490] lpfc_sli_
[56048.470490] lpfc_els_
[56048.470491] lpfc_els_
[56048.470491] lpfc_sli4_
[56048.470492] lpfc_do_
[56048.470492] ? __schedule+
[56048.470492] ? finish_
[56048.470493] ? lpfc_unregister
[56048.470493] kthread+0x112/0x130
[56048.470493] ? kthread_
[56048.470494] ret_from_
[56048.470494] Kernel panic - not syncing: Hard LOCKUP
[56048.470495] CPU: 0 PID: 682 Comm: lpfc_worker_0 Kdump: loaded Tainted: G
IOE --------- - - 4.18.0-
[56048.470496] Hardware name: Dell Inc. PowerEdge R740/0DY2X0, BIOS 2.11.2
004/21/2021
[56048.470496] Call Trace:
[56048.470496] <NMI>
[56048.470496] dump_stack+
[56048.470497] panic+0xe7/0x2a9
[56048.470497] ? __switch_
[56048.470497] nmi_panic.
[56048.470498] watchdog_
[56048.470498] __perf_
[56048.470499] handle_
[56048.470499] ? __set_pte_
[56048.470499] ? __native_
[56048.470500] ? ghes_copy_
[56048.470500] ? __ghes_
[56048.470500] intel_pmu_
[56048.470501] perf_event_
[56048.470501] nmi_handle+
[56048.470501] default_
[56048.470502] do_nmi+0x128/0x190
[56048.470502] end_repeat_
[56048.470503] RIP: 0010:native_
[56048.470504] Code: 0f ba 2f 08 0f 92 c0 0f b6 c0 c1 e0 08 89 c2 8b 07 30 e4
09 d0 a9 00 01 ff ff 75 47 85 c0 74 0e 8b 07 84 c0 74 08 f3 90 8b 07 <84> c0 75
f8 b8 01 00 00 00 66 89 07 c3 8b 37 81 fe 00 01 00 00 75
[56048.470504] RSP: 0018:ffffacebc7
[56048.470505] RAX: 0000000000000101 RBX: 0000000000000246 RCX:
000000000000001f
[56048.470505] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
ffff94dcf5341dc0
[56048.470506] RBP: ffff94dcf5340000 R08: 0000000000000002 R09:
0000000000029600
[56048.470506] R10: 000060d29656a45c R11: ffff94dcf534fd12 R12:
ffff94dcf5341db0
[56048.470507] R13: ffff94dcf5341dc0 R14: ffff94dcc4ae8a00 R15:
0000000000000003
[56048.470507] ? native_
[56048.470507] ? native_
[56048.470508] </NMI>
[56048.470508] _raw_spin_
[56048.470509] lpfc_dmp_
[56048.470509] lpfc_cmpl_
[56048.470509] lpfc_sli_
[56048.470510] lpfc_els_
[56048.470510] lpfc_els_
[56048.470510] lpfc_sli4_
[56048.470511] lpfc_do_
[56048.470511] ? __schedule+
[56048.470511] ? finish_
[56048.470512] ? lpfc_unregister
[56048.470512] kthread+0x112/0x130
[56048.470513] ? kthread_
[56048.470513] ret_from_
[root@ms-
[root@ms-
/etc/redhat-release
Red Hat Enterprise Linux release 8.3 (Ootpa)
[root@ms-
/sys/module/
0:14.0.390.2
[root@ms-
/sys/class/
Emulex LightPulse LPe32002-M2 2-Port 32Gb Fibre Channel Adapter
Emulex LightPulse LPe32002-M2 2-Port 32Gb Fibre Channel Adapter
[root@ms-
/sys/class/
14.0.390.1, sli-4:2:c
14.0.390.1, sli-4:2:c
[root@ms-
/sys/class/
0x10000090faf09459
0x10000090faf0945a
[root@ms-
HBA Attributes for 10:00:00:
Host Name : ms-svr3-
Manufacturer : Emulex Corporation
Serial Number : FC70793283
Model : LPe32002-M2
Model Desc : Emulex LightPulse LPe32002-M2 2-Port 32Gb Fibre
Channel Adapter
Node WWN : 20 00 00 90 fa f0 94 59
Node Symname :
HW Version : 0000000c 00000001 00000000
FW Version : 14.0.390.1
Vendor Spec ID : 10DF
Number of Ports : 1
Driver Name : lpfc
Driver Version : 14.0.390.2; HBAAPI(I) v2.3.d, 07-12-10
Device ID : E300
HBA Type : LPe32002-M2
Operational FW : 14.0.390.1
IEEE Address : 00 90 fa f0 94 59
Boot Code : Enabled
Boot Version : 14.0.390.1
Board Temperature : Normal
Function Type : FC
Sub Device ID : E300
PCI Bus Number : 94
PCI Func Number : 0
Sub Vendor ID : 10DF
IPL Filename : H62LEX1
Service Processor FW Name : 14.0.390.1
ULP FW Name : 14.0.390.1
FC Universal BIOS Version : 14.0.390.1
FC x86 BIOS Version : 14.0.390.1
FC EFI BIOS Version : 14.0.388.0
FC FCODE Version : 14.0.386.0
Flash Firmware Version : 14.0.390.1
Secure Firmware : Enabled
[root@ms-
Port Attributes for 10:00:00:
Node WWN : 20 00 00 90 fa f0 94 59
Port WWN : 10 00 00 90 fa f0 94 59
Port Symname :
Port FCID : 0000
Port Type : Unknown
Port State : Link Down
Port Service Type : 8
Port Supported FC4 : 00 00 01 00 00 00 00 01
Port Active FC4 : 00 00 01 00 00 00 00 01
Port Supported Speed : 8 16 32 Gbit/sec
Configured Port Speed : Auto Detect
Port Speed : Not Available
Max Frame Size : 2048
OS Device Name : /sys/class/
Num Discovered Ports : 0
Fabric Name : 00 00 00 00 00 00 00 00
Function Type : FC
FEC : Enabled
[Fixes]
The following patch will resolve the issue:
scsi: lpfc: Move cfg_log_verbose check before calling lpfc_dmp_dbg()
In an attempt to log message 0126 with LOG_TRACE_EVENT, the following hard
lockup call trace hangs the system.
[Testcase]
[root@ms-
[reply] [-]Comment 3James Smart 2022-04-13 09:12:37 PDT
Patches pushed upstream 4/12/22:
https://<email address hidden>/T/#t
tags: | added: servcert-345 |
Changed in linux (Ubuntu): | |
status: | Expired → New |
This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:
apport-collect 1971193
and then change the status of the bug to 'Confirmed'.
If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.
This change has been made by an automated script, maintained by the Ubuntu Kernel Team.