Comment 27 for bug 1913350

Revision history for this message
In , paula (paula-linux-kernel-bugs) wrote :

I'm going to tag along on this bug as I'm seeing likely the same problem.
The crash that I see is not that ramdom, I can easily trigger it by doing
a 1GB curl transfer which won't ever complete as the crash happens to frequently. As far as I can tell a reboot is necessary to clear the fault.
I use one machine as a testbed for wireless cards as it has a mPCIe slot.
Thus far I've used Intel 5100, 6205, 200AX, and now 7260AC cards in this machine.
I've only had problems with 7260AC cards, of which I have two, both exhibit the same problem.
So far I have found that the problem only occurs when connected at 2.4GHz.
Also, only when connected at 40MHz, 20MHz connections OK, xfer BW ~10MB/s.
Additionally, only when connected at 40MHz at boot.
If found that if I connect at 2.4GHz/20MHz and make a test transfer, I can then
reconfigure the AP to force 40MHz BW following which subsequent transfers
complete successfully at ~24MB/s, what I'm expecting from that configuration.
The 5GHz band does not exhibit any firmware crashes but does suffer from highly
variable transfer bandwidth.
When connected at 80MHz BW, maxes out at 30MB/s but that's not consistent. Most
of the time I only see half that, 15MB/s and that's without moving anything.
I'm using Debian Buster with kernel 4.19, though I've also tried 5.6 and 5.7
backported kernels and backported firmware. Same problem seen with the newer kernels and firmware.
I've looked at a number of wireless cards over the years and I haven't seen this kind of flaky problem before. I hope this report is taken seriously.
I regard this card as particularly important in that it is the newest and most
capable Intel Wireless card available in the mPCIE form factor. I would really like to see it working properly under linux.

qm77 motherboard, 3820QM CPU
Linux imb170 4.19.0-10-amd64 #1 SMP Debian 4.19.132-1 (2020-07-24) x86_64 GNU/Linux
03:00.0 Network controller: Intel Corporation Wireless 7260 (rev bb)
       description: Wireless interface
       product: Wireless 7260
       vendor: Intel Corporation
       physical id: 0
       bus info: pci@0000:03:00.0
       logical name: wlan0
       version: bb
       serial: 00:16:6f:e7:16:2a
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress bus_master cap_list ethernet physical wireless
       configuration: broadcast=yes driver=iwlwifi driverversion=4.19.0-10-amd64 firmware=17.3216344376.0 ip=192.168.1.126 latency=0 link=yes multicast=yes wireless=IEEE 802.11
       resources: irq:38 memory:f7a00000-f7a01fff
odule = "iwlwifi"

  Attributes:
    coresize = "249856"
    initsize = "0"
    initstate = "live"
    refcnt = "1"
    taint = ""
    uevent = <store method only>

  Parameters:
    11n_disable = "0"
    amsdu_size = "0"
    antenna_coupling = "0"
    bt_coex_active = "Y"
    d0i3_disable = "Y"
    d0i3_timeout = "1000"
    disable_11ac = "N"
    disable_11ax = "N"
    fw_monitor = "N"
    fw_restart = "Y"
    lar_disable = "N"
    led_mode = "0"
    nvm_file = "(null)"
    power_level = "0"
    power_save = "N"
    remove_when_gone = "N"
    swcrypto = "0"
    uapsd_disable = "3"

[ 179.224409] iwlwifi 0000:03:00.0: Failed to wake NIC for hcmd
[ 179.224451] iwlwifi 0000:03:00.0: Error sending SCAN_OFFLOAD_REQUEST_CMD: enqueue_hcmd failed: -5
[ 179.224458] iwlwifi 0000:03:00.0: Scan failed! ret -5
[ 182.226400] ------------[ cut here ]------------
[ 182.226405] Timeout waiting for hardware access (CSR_GP_CNTRL 0xffffffff)
[ 182.226455] WARNING: CPU: 1 PID: 0 at drivers/net/wireless/intel/iwlwifi/pcie/trans.c:2033 iwl_trans_pcie_grab_nic_access+0x1e8/0x220 [iwlwifi]
[ 182.226456] Modules linked in: cpufreq_powersave cpufreq_conservative cpufreq_userspace snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ccm algif_aead cbc des_generic intel_rapl arc4 algif_skcipher cmac sha512_ssse3 sha512_generic md4 algif_hash af_alg x86_pkg_temp_thermal intel_powerclamp coretemp iwlmvm kvm_intel i915 kvm mac80211 snd_hda_intel irqbypass snd_hda_codec crct10dif_pclmul crc32_pclmul snd_hda_core ghash_clmulni_intel mei_wdt iwlwifi drm_kms_helper snd_hwdep snd_pcm intel_cstate btusb snd_timer btrtl btbcm snd ppdev evdev btintel pcc_cpufreq intel_uncore mei_me sg drm cfg80211 mei soundcore bluetooth intel_rapl_perf i2c_algo_bit pcspkr iTCO_wdt parport_pc iTCO_vendor_support parport drbg ansi_cprng video button ecdh_generic rfkill nfsd auth_rpcgss nfs_acl lockd grace
[ 182.226496] sunrpc ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic fscrypto ecb sd_mod crc32c_intel ahci xhci_pci libahci xhci_hcd libata nvme ehci_pci aesni_intel ehci_hcd e1000e aes_x86_64 scsi_mod crypto_simd usbcore cryptd glue_helper nvme_core i2c_i801 lpc_ich mfd_core usb_common thermal fan
[ 182.226521] CPU: 1 PID: 0 Comm: swapper/1 Not tainted 4.19.0-10-amd64 #1 Debian 4.19.132-1
[ 182.226522] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./IMB-170, BIOS P1.90 04/30/2018
[ 182.226534] RIP: 0010:iwl_trans_pcie_grab_nic_access+0x1e8/0x220 [iwlwifi]
[ 182.226537] Code: 07 e2 49 8d 56 08 bf 00 02 00 00 e8 a2 5e fe e0 e9 33 ff ff ff 89 c6 48 c7 c7 80 f5 ac c0 c6 05 69 46 02 00 01 e8 c2 8a fc e0 <0f> 0b e9 ee fe ff ff 48 8b 7b 30 48 c7 c1 e8 f5 ac c0 31 d2 31 f6
[ 182.226539] RSP: 0018:ffff8d3156443e00 EFLAGS: 00010086
[ 182.226541] RAX: 0000000000000000 RBX: ffff8d314fb60018 RCX: 0000000000000006
[ 182.226542] RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff8d31564566b0
[ 182.226544] RBP: 0000000000000000 R08: 0000000000000306 R09: 0000000000000004
[ 182.226545] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8d314fb6a258
[ 182.226547] R13: ffff8d3156443e30 R14: 00000000ffffffff R15: 0000000000000004
[ 182.226549] FS: 0000000000000000(0000) GS:ffff8d3156440000(0000) knlGS:0000000000000000
[ 182.226550] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 182.226552] CR2: 00007fe3e5625d78 CR3: 000000014c80a006 CR4: 00000000001606e0
[ 182.226554] Call Trace:
[ 182.226558] <IRQ>
[ 182.226569] iwl_read_prph+0x32/0x90 [iwlwifi]
[ 182.226581] iwl_trans_pcie_log_scd_error+0x13a/0x210 [iwlwifi]
[ 182.226591] iwl_pcie_txq_stuck_timer+0x46/0x70 [iwlwifi]
[ 182.226601] ? iwl_pcie_clear_cmd_in_flight+0x80/0x80 [iwlwifi]
[ 182.226608] call_timer_fn+0x2b/0x130
[ 182.226612] run_timer_softirq+0x1c7/0x3e0
[ 182.226617] ? tick_sched_timer+0x37/0x70
[ 182.226621] ? __hrtimer_run_queues+0x110/0x280
[ 182.226626] ? recalibrate_cpu_khz+0x10/0x10
[ 182.226628] ? ktime_get+0x3a/0xa0
[ 182.226634] __do_softirq+0xde/0x2d8
[ 182.226640] irq_exit+0xba/0xc0
[ 182.226644] smp_apic_timer_interrupt+0x74/0x140
[ 182.226648] apic_timer_interrupt+0xf/0x20
[ 182.226650] </IRQ>
[ 182.226654] RIP: 0010:cpuidle_enter_state+0xb9/0x320
[ 182.226656] Code: e8 4c b9 b0 ff 80 7c 24 0b 00 74 17 9c 58 0f 1f 44 00 00 f6 c4 02 0f 85 3b 02 00 00 31 ff e8 1e a9 b6 ff fb 66 0f 1f 44 00 00 <48> b8 ff ff ff ff f3 01 00 00 48 2b 1c 24 ba ff ff ff 7f 48 39 c3
[ 182.226658] RSP: 0018:ffffa9cf80cd3e90 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff13
[ 182.226660] RAX: ffff8d31564620c0 RBX: 0000002a6c74a32e RCX: 000000000000001f
[ 182.226662] RDX: 0000002a6c74a32e RSI: 000000002f83de23 RDI: 0000000000000000
[ 182.226663] RBP: ffff8d315646a300 R08: 0000000000000002 R09: 0000000000021980
[ 182.226664] R10: 0000007a3e62ed4b R11: ffff8d31564610a8 R12: 0000000000000005
[ 182.226666] R13: ffffffffa2ab71f8 R14: 0000000000000005 R15: 0000000000000000
[ 182.226673] do_idle+0x228/0x270
[ 182.226677] cpu_startup_entry+0x6f/0x80
[ 182.226680] start_secondary+0x1a4/0x200
[ 182.226685] secondary_startup_64+0xa4/0xb0
[ 182.226688] ---[ end trace ed4ef1147e5a66cf ]---
[ 182.226694] iwlwifi 0000:03:00.0: iwlwifi transaction failed, dumping registers
[ 182.226704] iwlwifi 0000:03:00.0: iwlwifi device config registers:
[ 182.226760] iwlwifi 0000:03:00.0: 00000000: 08b18086 00100000 028000bb 00000000 00000004 00000000 00000000 00000000
[ 182.226767] iwlwifi 0000:03:00.0: 00000020: 00000000 00000000 00000000 40708086 00000000 000000c8 00000000 00000100
[ 182.226773] iwlwifi 0000:03:00.0: iwlwifi device memory mapped registers:
[ 182.226812] iwlwifi 0000:03:00.0: 00000000: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
[ 182.226819] iwlwifi 0000:03:00.0: 00000020: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
[ 182.226828] iwlwifi 0000:03:00.0: iwlwifi device AER capability structure:
[ 182.226860] iwlwifi 0000:03:00.0: 00000000: 14010001 00100000 00000000 00462031 000031c1 00002000 00000014 40000001
[ 182.226866] iwlwifi 0000:03:00.0: 00000020: 0000000f f7a00460 00000000
[ 182.226870] iwlwifi 0000:03:00.0: iwlwifi parent port (0000:00:1c.2) config registers:
[ 182.226899] iwlwifi 0000:00:1c.2: 00000000: 1e148086 00100007 060400c4 00810010 00000000 00000000 00030300 200000f0
[ 182.226907] iwlwifi 0000:00:1c.2: 00000020: f7a0f7a0 0001fff1 00000000 00000000 00000000 00000040 00000000 0010030b
[ 182.281442] iwlwifi 0000:03:00.0: Queue 4 is active on fifo 2 and stuck for 10000 ms. SW [138, 143] HW [90, 90] FH TRB=0x05a5a5a5a