kernel module bnx2x

Bug #2009806 reported by gu
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

I encountered a bug. The network card driver caused a kernel crash. I am not sure whether it is a hardware failure or a software bug. I ask the community to help analyze it.

kernel 4.15.0-177
bnx2x driver 1.712.30-0
firmware:7.13.1.0.fw

NETDEV WATCHDOG: p4p1 (bnx2x): transmit queue 6 timed ou

Mar 8 03:13:55 tst04cn01 kernel: [209257.142599] ------------[ cut here ]------------
Mar 8 03:13:55 tst04cn01 kernel: [209257.142614] NETDEV WATCHDOG: p4p1 (bnx2x): transmit queue 6 timed out
Mar 8 03:13:55 tst04cn01 kernel: [209257.142634] WARNING: CPU: 3 PID: 0 at /build/linux-hwe-6eCHXL/linux-hwe-4.15.0/net/sched/sch_generic.c:323 dev_watchdog+0x222/0x230
Mar 8 03:13:55 tst04cn01 kernel: [209257.142637] Modules linked in: vhost_net vhost tap act_police cls_u32 sch_ingress cls_fw sch_sfq sch_htb 8021q garp mrp ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables ipmi_ssif crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd kvm_intel kvm irqbypass ipmi_si ipmi_devintf ipmi_msghandler sunrpc br_netfilter bridge stp llc bonding lp parport autofs4 xfs btrfs xor zstd_compress raid6_pq ixgbe igb bnx2x dca mdio ahci ptp i2c_algo_bit libahci megaraid_sas pps_core libcrc32c
Mar 8 03:13:55 tst04cn01 kernel: [209257.142670] CPU: 3 PID: 0 Comm: swapper/3 Not tainted 4.15.0-177-generic #186~16.04.1-Ubuntu
Mar 8 03:13:55 tst04cn01 kernel: [209257.142671] Hardware name: Dell Inc. PowerEdge R730xd/0H21J3, BIOS 2.14.0 01/14/2022
Mar 8 03:13:55 tst04cn01 kernel: [209257.142673] RIP: 0010:dev_watchdog+0x222/0x230
Mar 8 03:13:55 tst04cn01 kernel: [209257.142674] RSP: 0018:ffff923a8f243e68 EFLAGS: 00010282
Mar 8 03:13:55 tst04cn01 kernel: [209257.142676] RAX: 0000000000000000 RBX: 0000000000000006 RCX: 0000000000000000
Mar 8 03:13:55 tst04cn01 kernel: [209257.142677] RDX: 0000000000040400 RSI: 00000000000000f6 RDI: 0000000000000300
Mar 8 03:13:55 tst04cn01 kernel: [209257.142678] RBP: ffff923a8f243e98 R08: 0000000000020cf0 R09: ffffffff86d694c4
Mar 8 03:13:55 tst04cn01 kernel: [209257.142679] R10: 0000000000000000 R11: 000000000000061b R12: 000000000000005b
Mar 8 03:13:55 tst04cn01 kernel: [209257.142680] R13: ffff923a843a0000 R14: ffff923a843a0478 R15: ffff923a843af100
Mar 8 03:13:55 tst04cn01 kernel: [209257.142681] FS: 0000000000000000(0000) GS:ffff923a8f240000(0000) knlGS:0000000000000000
Mar 8 03:13:55 tst04cn01 kernel: [209257.142682] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 8 03:13:55 tst04cn01 kernel: [209257.142683] CR2: 00005609afeef260 CR3: 000000298a80a004 CR4: 00000000001626e0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142684] Call Trace:
Mar 8 03:13:55 tst04cn01 kernel: [209257.142686] <IRQ>
Mar 8 03:13:55 tst04cn01 kernel: [209257.142690] ? dev_deactivate_queue.constprop.33+0x60/0x60
Mar 8 03:13:55 tst04cn01 kernel: [209257.142693] call_timer_fn+0x32/0x140
Mar 8 03:13:55 tst04cn01 kernel: [209257.142695] run_timer_softirq+0x1e9/0x430
Mar 8 03:13:55 tst04cn01 kernel: [209257.142697] ? ktime_get+0x3e/0xb0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142701] ? lapic_next_deadline+0x2c/0x40
Mar 8 03:13:55 tst04cn01 kernel: [209257.142705] __do_softirq+0xf5/0x2a8
Mar 8 03:13:55 tst04cn01 kernel: [209257.142710] irq_exit+0xca/0xd0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142712] smp_apic_timer_interrupt+0x79/0x150
Mar 8 03:13:55 tst04cn01 kernel: [209257.142714] apic_timer_interrupt+0x90/0xa0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142715] </IRQ>
Mar 8 03:13:55 tst04cn01 kernel: [209257.142716] RIP: 0010:cpu_idle_poll+0x3c/0x150
Mar 8 03:13:55 tst04cn01 kernel: [209257.142717] RSP: 0018:ffffa5c10011bea8 EFLAGS: 00000202 ORIG_RAX: ffffffffffffff11
Mar 8 03:13:55 tst04cn01 kernel: [209257.142719] RAX: 0000000080200000 RBX: ffff923a88f30000 RCX: 000000000001f6c0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142720] RDX: 00000000392002d2 RSI: ffff923a8f25f6c0 RDI: 0000000000000000
Mar 8 03:13:55 tst04cn01 kernel: [209257.142721] RBP: ffffa5c10011beb8 R08: 0000000000000000 R09: 0000000000000000
Mar 8 03:13:55 tst04cn01 kernel: [209257.142722] R10: ffffa5c10011be18 R11: 00000000000003f8 R12: 0000000000000003
Mar 8 03:13:55 tst04cn01 kernel: [209257.142723] R13: ffff923a88f30000 R14: 0000000000000000 R15: 0000000000000000
Mar 8 03:13:55 tst04cn01 kernel: [209257.142725] ? cpu_idle_poll+0x11/0x150
Mar 8 03:13:55 tst04cn01 kernel: [209257.142729] do_idle+0x4c/0x200
Mar 8 03:13:55 tst04cn01 kernel: [209257.142731] cpu_startup_entry+0x73/0x80
Mar 8 03:13:55 tst04cn01 kernel: [209257.142733] start_secondary+0x1ab/0x200
Mar 8 03:13:55 tst04cn01 kernel: [209257.142737] secondary_startup_64+0xa5/0xb0
Mar 8 03:13:55 tst04cn01 kernel: [209257.142739] Code: 35 00 49 63 4e e8 eb 92 4c 89 ef c6 05 65 52 d6 00 01 e8 82 14 fd ff 89 d9 48 89 c2 4c 89 ee 48 c7 c7 28 3a 59 86 e8 5e 2b 7e ff <0f> 0b eb c0 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48
Mar 8 03:13:55 tst04cn01 kernel: [209257.142764] ---[ end trace 5f4d5379e29afb76 ]---
Mar 8 03:13:55 tst04cn01 kernel: [209257.142960] bnx2x: [bnx2x_sp_rtnl_task:10349(p4p1)]Indicating link is down due to Tx-timeout
Mar 8 03:13:55 tst04cn01 kernel: [209257.220334] bnx2x: [bnx2x_stats_comp:205(p4p1)]timeout waiting for stats finished
Mar 8 03:13:55 tst04cn01 kernel: [209257.305371] bnx2x: [bnx2x_stats_comp:205(p4p1)]timeout waiting for stats finished
Mar 8 03:13:57 tst04cn01 kernel: [209259.386610] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[0]: txdata->tx_pkt_prod(26205) != txdata->tx_pkt_cons(26191)
Mar 8 03:13:59 tst04cn01 kernel: [209261.468729] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[1]: txdata->tx_pkt_prod(10280) != txdata->tx_pkt_cons(10264)
Mar 8 03:14:01 tst04cn01 kernel: [209263.552667] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[2]: txdata->tx_pkt_prod(50370) != txdata->tx_pkt_cons(50354)
Mar 8 03:14:03 tst04cn01 kernel: [209265.648772] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[4]: txdata->tx_pkt_prod(34060) != txdata->tx_pkt_cons(34042)
Mar 8 03:14:05 tst04cn01 kernel: [209267.758666] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[5]: txdata->tx_pkt_prod(22349) != txdata->tx_pkt_cons(22344)
Mar 8 03:14:07 tst04cn01 kernel: [209269.864665] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[6]: txdata->tx_pkt_prod(64995) != txdata->tx_pkt_cons(63781)
Mar 8 03:14:10 tst04cn01 kernel: [209271.978689] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[0]: txdata->tx_pkt_prod(26205) != txdata->tx_pkt_cons(26191)
Mar 8 03:14:12 tst04cn01 kernel: [209274.090660] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[1]: txdata->tx_pkt_prod(10280) != txdata->tx_pkt_cons(10264)
Mar 8 03:14:14 tst04cn01 kernel: [209276.212140] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[2]: txdata->tx_pkt_prod(50370) != txdata->tx_pkt_cons(50354)
Mar 8 03:14:16 tst04cn01 kernel: [209278.327848] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[4]: txdata->tx_pkt_prod(34060) != txdata->tx_pkt_cons(34042)
Mar 8 03:14:18 tst04cn01 kernel: [209280.452718] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[5]: txdata->tx_pkt_prod(22349) != txdata->tx_pkt_cons(22344)
Mar 8 03:14:20 tst04cn01 kernel: [209282.538695] bnx2x: [bnx2x_clean_tx_queue:1207(p4p1)]timeout waiting for queue[6]: txdata->tx_pkt_prod(64995) != txdata->tx_pkt_cons(63781)
Mar 8 03:14:30 tst04cn01 kernel: [209292.513780] bnx2x: [bnx2x_state_wait:310(p4p1)]timeout waiting for state 0
Mar 8 03:14:30 tst04cn01 kernel: [209292.575878] bnx2x: [bnx2x_del_all_macs:8545(p4p1)]Failed to delete MACs: -16
Mar 8 03:14:30 tst04cn01 kernel: [209292.637424] bnx2x: [bnx2x_chip_cleanup:9365(p4p1)]Failed to schedule DEL commands for UC MACs list: -16
Mar 8 03:14:40 tst04cn01 kernel: [209302.760151] bnx2x: [bnx2x_state_wait:310(p4p1)]timeout waiting for state 1
Mar 8 03:14:40 tst04cn01 kernel: [209302.820645] bnx2x: [bnx2x_chip_cleanup:9375(p4p1)]Failed to delete all VLANs
Mar 8 03:14:50 tst04cn01 kernel: [209312.916796] bnx2x: [bnx2x_state_wait:310(p4p1)]timeout waiting for state 9
Mar 8 03:15:00 tst04cn01 kernel: [209322.836940] bnx2x: [bnx2x_state_wait:310(p4p1)]timeout waiting for state 2
Mar 8 03:15:00 tst04cn01 kernel: [209322.894601] bnx2x: [bnx2x_func_stop:9124(p4p1)]FUNC_STOP ramrod failed. Running a dry transaction

Revision history for this message
gu (cshaven) wrote :

attachment

Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 2009806

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
gu (cshaven)
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
gu (cshaven) wrote :

Temporarily unable to run apport-collect, can it be analyzed according to the existing logs, looking forward to the team's reply

Revision history for this message
gu (cshaven) wrote :

Hi community, can you help analyze it?

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.