iwlwifi 0000:04:00.0: Queue 10 is active on fifo 2 and stuck for 10000 ms. SW [247, 164] HW [90, 90] FH TRB=0x05a5a5a5a

Bug #1793997 reported by berend
34
This bug affects 6 people
Affects Status Importance Assigned to Milestone
linux-signed (Ubuntu)
Confirmed
Medium
Unassigned

Bug Description

Every few days iwlwifi decides to stop working. Only recourse is restart.

From syslog:

Sep 24 11:42:10 bonobo kernel: [109525.538612] iwlwifi 0000:04:00.0: iwlwifi transaction failed, dumping registers
Sep 24 11:42:10 bonobo kernel: [109525.538622] iwlwifi 0000:04:00.0: iwlwifi device config registers:
Sep 24 11:42:10 bonobo kernel: [109525.538669] iwlwifi 0000:04:00.0: 00000000: 08b18086 00100000 028000bb 00000000 00000004 00000000 00000000 00000000
Sep 24 11:42:10 bonobo kernel: [109525.538674] iwlwifi 0000:04:00.0: 00000020: 00000000 00000000 00000000 40708086 00000000 000000c8 00000000 00000100
Sep 24 11:42:10 bonobo kernel: [109525.538677] iwlwifi 0000:04:00.0: iwlwifi device memory mapped registers:
Sep 24 11:42:10 bonobo kernel: [109525.538709] iwlwifi 0000:04:00.0: 00000000: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
Sep 24 11:42:10 bonobo kernel: [109525.538713] iwlwifi 0000:04:00.0: 00000020: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
Sep 24 11:42:10 bonobo kernel: [109525.538719] iwlwifi 0000:04:00.0: iwlwifi device AER capability structure:
Sep 24 11:42:10 bonobo kernel: [109525.538746] iwlwifi 0000:04:00.0: 00000000: 14010001 00100000 00000000 00462031 00003141 00002000 00000014 40000001
Sep 24 11:42:10 bonobo kernel: [109525.538749] iwlwifi 0000:04:00.0: 00000020: 0000000f ec100460 00000000
Sep 24 11:42:10 bonobo kernel: [109525.538752] iwlwifi 0000:04:00.0: iwlwifi parent port (0000:00:1c.3) config registers:
Sep 24 11:42:10 bonobo kernel: [109525.538775] iwlwifi 0000:00:1c.3: 00000000: 8c168086 00100007 060400d5 00810010 00000000 00000000 00040400 200000f0
Sep 24 11:42:10 bonobo kernel: [109525.538779] iwlwifi 0000:00:1c.3: 00000020: ec10ec10 0001fff1 00000000 00000000 00000000 00000040 00000000 00100405
Sep 24 11:42:10 bonobo kernel: [109525.538784] ------------[ cut here ]------------
Sep 24 11:42:10 bonobo kernel: [109525.538785] Timeout waiting for hardware access (CSR_GP_CNTRL 0xffffffff)
Sep 24 11:42:10 bonobo kernel: [109525.538842] WARNING: CPU: 5 PID: 0 at /build/linux-SlLHxe/linux-4.15.0/drivers/net/wireless/intel/iwlwifi/pcie/trans.c:1973 iwl_trans_pcie_grab_nic_access+0xea/0xf0 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.538843] Modules linked in: rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache rfcomm ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 xt_comment vmnet(OE) pci_stub vmw_vsock_vmci_transport vsock xfrm_user vboxpci(OE) vmw_vmci xfrm4_tunnel tunnel4 vboxnetadp(OE) ipcomp xfrm_ipcomp vmmon(OE) vboxnetflt(OE) esp4 vboxdrv(OE) ah4 af_key xfrm_algo ccm xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables devlink iptable_filter cmac bnep ec_sys binfmt_misc zfs(PO) zunicode(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) algif_skcipher af_alg dm_crypt intel_rapl x86_pkg_temp_thermal
Sep 24 11:42:10 bonobo kernel: [109525.538898] intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd snd_hda_codec_hdmi arc4 intel_cstate snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_usb_audio snd_hda_core snd_usbmidi_lib snd_hwdep snd_pcm btusb btrtl snd_seq_midi btbcm snd_seq_midi_event btintel uvcvideo intel_rapl_perf bluetooth serio_raw snd_rawmidi videobuf2_vmalloc iwlmvm mxm_wmi mac80211 rtsx_pci_ms videobuf2_memops ecdh_generic input_leds iwlwifi snd_seq videobuf2_v4l2 memstick videobuf2_core cfg80211 snd_seq_device videodev snd_timer media joydev snd mei_me soundcore nvidia_uvm(POE) mei mac_hid ie31200_edac shpchp lpc_ich wmi_bmof sch_fq_codel parport_pc ppdev sunrpc lp parport ip_tables
Sep 24 11:42:10 bonobo kernel: [109525.538952] x_tables autofs4 btrfs xor zstd_compress raid6_pq dm_mirror dm_region_hash dm_log hid_generic usbhid hid nvidia_drm(POE) nvidia_modeset(POE) rtsx_pci_sdmmc nvidia(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm rtsx_pci psmouse ahci r8169 libahci ipmi_devintf mii ipmi_msghandler video wmi
Sep 24 11:42:10 bonobo kernel: [109525.538982] CPU: 5 PID: 0 Comm: swapper/5 Tainted: P OE 4.15.0-34-generic #37-Ubuntu
Sep 24 11:42:10 bonobo kernel: [109525.538983] Hardware name: System76, Inc. Bonobo WS /Bonobo WS , BIOS 4.6.5 05/15/2015
Sep 24 11:42:10 bonobo kernel: [109525.538993] RIP: 0010:iwl_trans_pcie_grab_nic_access+0xea/0xf0 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.538995] RSP: 0018:ffff89ce6ed43dc8 EFLAGS: 00010086
Sep 24 11:42:10 bonobo kernel: [109525.538997] RAX: 0000000000000000 RBX: ffff89ce442c0018 RCX: 0000000000000006
Sep 24 11:42:10 bonobo kernel: [109525.538999] RDX: 0000000000000007 RSI: 0000000000000092 RDI: ffff89ce6ed56490
Sep 24 11:42:10 bonobo kernel: [109525.539000] RBP: ffff89ce6ed43de0 R08: 0000000000000001 R09: 0000000000000e23
Sep 24 11:42:10 bonobo kernel: [109525.539001] R10: ffffeee191aff540 R11: 0000000000000000 R12: ffff89ce442c8f00
Sep 24 11:42:10 bonobo kernel: [109525.539002] R13: ffff89ce6ed43df0 R14: 000000000000000a R15: ffff89ce442c0018
Sep 24 11:42:10 bonobo kernel: [109525.539005] FS: 0000000000000000(0000) GS:ffff89ce6ed40000(0000) knlGS:0000000000000000
Sep 24 11:42:10 bonobo kernel: [109525.539006] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 24 11:42:10 bonobo kernel: [109525.539008] CR2: 00002181c2c8b000 CR3: 00000002e640a005 CR4: 00000000001606e0
Sep 24 11:42:10 bonobo kernel: [109525.539009] Call Trace:
Sep 24 11:42:10 bonobo kernel: [109525.539011] <IRQ>
Sep 24 11:42:10 bonobo kernel: [109525.539020] iwl_read_prph+0x38/0x90 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.539030] iwl_trans_pcie_log_scd_error+0x125/0x1f0 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.539038] ? iwl_pcie_txq_build_tfd+0xe0/0xe0 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.539045] iwl_pcie_txq_stuck_timer+0x46/0x70 [iwlwifi]
Sep 24 11:42:10 bonobo kernel: [109525.539052] call_timer_fn+0x30/0x130
Sep 24 11:42:10 bonobo kernel: [109525.539056] run_timer_softirq+0x3fb/0x450
Sep 24 11:42:10 bonobo kernel: [109525.539059] ? ktime_get+0x43/0xa0
Sep 24 11:42:10 bonobo kernel: [109525.539063] ? lapic_next_deadline+0x26/0x30
Sep 24 11:42:10 bonobo kernel: [109525.539066] __do_softirq+0xe4/0x2bb
Sep 24 11:42:10 bonobo kernel: [109525.539071] irq_exit+0xb8/0xc0
Sep 24 11:42:10 bonobo kernel: [109525.539073] smp_apic_timer_interrupt+0x79/0x130
Sep 24 11:42:10 bonobo kernel: [109525.539075] apic_timer_interrupt+0x84/0x90
Sep 24 11:42:10 bonobo kernel: [109525.539076] </IRQ>
Sep 24 11:42:10 bonobo kernel: [109525.539082] RIP: 0010:cpuidle_enter_state+0xa7/0x2f0
Sep 24 11:42:10 bonobo kernel: [109525.539083] RSP: 0018:ffffb61c431c7e68 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff11
Sep 24 11:42:10 bonobo kernel: [109525.539085] RAX: ffff89ce6ed62880 RBX: 0000639ce5c60570 RCX: 000000000000001f
Sep 24 11:42:10 bonobo kernel: [109525.539087] RDX: 0000639ce5c60570 RSI: 00005408b814daf7 RDI: 0000000000000000
Sep 24 11:42:10 bonobo kernel: [109525.539088] RBP: ffffb61c431c7ea8 R08: 0000000000000bfb R09: 00000000000001ed
Sep 24 11:42:10 bonobo kernel: [109525.539089] R10: ffffb61c431c7e38 R11: 00000000000009c1 R12: ffff89ce6ed6ca68
Sep 24 11:42:10 bonobo kernel: [109525.539091] R13: 0000000000000004 R14: ffffffff9a771c78 R15: 0000000000000000
Sep 24 11:42:10 bonobo kernel: [109525.539095] ? cpuidle_enter_state+0x97/0x2f0
Sep 24 11:42:10 bonobo kernel: [109525.539098] cpuidle_enter+0x17/0x20
Sep 24 11:42:10 bonobo kernel: [109525.539103] call_cpuidle+0x23/0x40
Sep 24 11:42:10 bonobo kernel: [109525.539106] do_idle+0x18c/0x1f0
Sep 24 11:42:10 bonobo kernel: [109525.539109] cpu_startup_entry+0x73/0x80
Sep 24 11:42:10 bonobo kernel: [109525.539112] start_secondary+0x1ab/0x200
Sep 24 11:42:10 bonobo kernel: [109525.539116] secondary_startup_64+0xa5/0xb0
Sep 24 11:42:10 bonobo kernel: [109525.539118] Code: 00 00 e8 aa 1f 36 d8 eb 9d 48 89 df be 24 00 00 00 c6 05 f2 62 02 00 01 e8 a4 ec fe ff 48 c7 c7 20 61 84 c1 89 c6 e8 16 5c a6 d7 <0f> 0b eb bb 66 90 0f 1f 44 00 00 55 49 c7 c0 60 61 84 c1 48 c7
Sep 24 11:42:10 bonobo kernel: [109525.539163] ---[ end trace 4fee7c8dba8479d7 ]---
Sep 24 11:42:10 bonobo kernel: [109525.592084] iwlwifi 0000:04:00.0: Queue 10 is active on fifo 2 and stuck for 10000 ms. SW [247, 164] HW [90, 90] FH TRB=0x05a5a5a5a

Revision history for this message
berend (berenddeboer) wrote :

lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 18.04.1 LTS
Release: 18.04
Codename: bionic

Noted elsewhere, for example here: https://askubuntu.com/questions/1062676/losing-connection-to-wifi-randomly-ubuntu-18-04-lts-on-dell-xps-15-9530

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1793997/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
berend (berenddeboer)
affects: ubuntu → linux-signed-hwe (Ubuntu)
affects: linux-signed-hwe (Ubuntu) → linux-signed (Ubuntu)
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-signed (Ubuntu):
status: New → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Did this issue start happening after an update/upgrade? Was there a prior kernel version where you were not having this particular problem?

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.19 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.19-rc5

Changed in linux-signed (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
berend (berenddeboer) wrote :

I think I may have seen it in 17.10.

Definitely not in 16.04. I don't think I had it with 17.04.

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

Started happen to me few days ago as well. Happens with 4.15.0-34-generic and 4.15.0-34-generic.

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

I've meant with 33 and 34.

Revision history for this message
berend (berenddeboer) wrote :

Although I could boot the mainline kernel, I can't use it, as it doesn't have zfs. Any tips how I can get zfs working? I assume I can't just copy the zfs module?

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :
Download full text (7.0 KiB)

I've tested with mainline kernel and it is crashes as well. Also I've tried booting previous Ubuntu releases (4.15.0-20-generic, 4.15.0-29-generic, 4.15.0-32-generic, 4.15.0-33-generic) and bug happens with all of them. This is strange because I've used these kernels without any problems for months. Wi-Fi worked extremely stable until problem recently appeared. So looks like bug was present for long time, however it have not manifested itself. But some non-kernel update uncovered it.

Bug is pretty severe on my system, it happens in few minutes of active Wi-Fi usage (eg. open speedtest.net and run it for few times). If I do not use Wi-Fi at maximum speed it (eg. just listening online radio) it may work for hours.

Log from mainline kernel:
[ 119.671276] ------------[ cut here ]------------
[ 119.671280] Timeout waiting for hardware access (CSR_GP_CNTRL 0xffffffff)
[ 119.671333] WARNING: CPU: 7 PID: 0 at drivers/net/wireless/intel/iwlwifi/pcie/trans.c:2009 iwl_trans_pcie_grab_nic_access+0x1e8/0x220 [iwlwifi]
[ 119.671335] Modules linked in: rfcomm ccm ip6t_REJECT nf_reject_ipv6 ip6table_nat nf_nat_ipv6 ipt_MASQUERADE xt_CHECKSUM iptable_nat nf_nat_ipv4 nf_nat iptable_mangle bridge stp llc ip6table_filter ip6_tables ipt_REJECT nf_reject_ipv4 xt_comment xt_mac xt_tcpudp xt_conntrack nf_conntrack vmw_vsock_vmci_transport vsock nf_defrag_ipv6 nf_defrag_ipv4 vmw_vmci iptable_filter bpfilter cmac bnep binfmt_misc nls_iso8859_1 arc4 cmdlinepart intel_spi_platform intel_spi spi_nor asus_nb_wmi mtd asus_wmi mxm_wmi sparse_keymap gpio_ich snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic uvcvideo snd_hda_intel videobuf2_vmalloc videobuf2_memops intel_rapl videobuf2_v4l2 x86_pkg_temp_thermal videobuf2_common intel_powerclamp snd_hda_codec videodev btusb coretemp snd_hda_core btrtl btbcm snd_hwdep media
[ 119.671388] btintel kvm_intel bluetooth snd_pcm snd_seq_dummy iwlmvm snd_seq_oss mac80211 snd_seq_midi intel_cstate ecdh_generic iwlwifi snd_seq_midi_event snd_rawmidi intel_rapl_perf joydev rtsx_pci_ms memstick cfg80211 snd_seq input_leds serio_raw snd_seq_device snd_timer snd mei_me mei soundcore mac_hid ie31200_edac lpc_ich wmi sch_fq_codel nfsd auth_rpcgss nfs_acl lockd grace sunrpc parport_pc ppdev lp parport ip_tables x_tables autofs4 btrfs xor zstd_compress raid6_pq libcrc32c algif_skcipher af_alg dm_crypt hid_generic usbhid hid i915 kvmgt vfio_mdev mdev vfio_iommu_type1 crct10dif_pclmul vfio crc32_pclmul ghash_clmulni_intel pcbc kvm rtsx_pci_sdmmc irqbypass i2c_algo_bit aesni_intel drm_kms_helper syscopyarea aes_x86_64 sysfillrect crypto_simd cryptd sysimgblt fb_sys_fops glue_helper drm
[ 119.671457] psmouse ahci rtsx_pci r8169 libahci video
[ 119.671467] CPU: 7 PID: 0 Comm: swapper/7 Not tainted 4.19.0-041900rc5-generic #201809231830
[ 119.671468] Hardware name: ASUSTeK COMPUTER INC. N751JK/N751JK, BIOS N751JK.205 03/11/2015
[ 119.671480] RIP: 0010:iwl_trans_pcie_grab_nic_access+0x1e8/0x220 [iwlwifi]
[ 119.671482] Code: 05 ee 49 8d 56 08 bf 00 20 00 00 e8 92 1a 87 ec e9 31 ff ff ff 89 c6 48 c7 c7 60 f3 c5 c0 c6 05 5c 7a 02 00 01 e8 fa 3f 85 ec <0f> 0b e9 ec fe ff ff 48 8b 7b 30 48 c7 c1 c8 f3 c5 c0 31 d2 31 f6
[ 119.671...

Read more...

Changed in linux-signed (Ubuntu):
status: Incomplete → Confirmed
tags: added: kernel-bug-exists-upstream
Revision history for this message
berend (berenddeboer) wrote :
Download full text (8.3 KiB)

Latest Ubuntu kernel 4.15.0-36 is even more unstable. Can last only 24 hours before it crashes. Here is the latest log:

Oct 3 09:55:58 bonobo kernel: [17560.275628] iwlwifi 0000:04:00.0: iwlwifi transaction failed, dumping registers
Oct 3 09:55:58 bonobo kernel: [17560.275637] iwlwifi 0000:04:00.0: iwlwifi device config registers:
Oct 3 09:55:58 bonobo kernel: [17560.275688] iwlwifi 0000:04:00.0: 00000000: 08b18086 00100000 028000bb 00000000 00000004 00000000 00000000 00000000
Oct 3 09:55:58 bonobo kernel: [17560.275694] iwlwifi 0000:04:00.0: 00000020: 00000000 00000000 00000000 40708086 00000000 000000c8 00000000 00000100
Oct 3 09:55:58 bonobo kernel: [17560.275698] iwlwifi 0000:04:00.0: iwlwifi device memory mapped registers:
Oct 3 09:55:58 bonobo kernel: [17560.275732] iwlwifi 0000:04:00.0: 00000000: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
Oct 3 09:55:58 bonobo kernel: [17560.275738] iwlwifi 0000:04:00.0: 00000020: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
Oct 3 09:55:58 bonobo kernel: [17560.275745] iwlwifi 0000:04:00.0: iwlwifi device AER capability structure:
Oct 3 09:55:58 bonobo kernel: [17560.275774] iwlwifi 0000:04:00.0: 00000000: 14010001 00100000 00000000 00462031 00003141 00002000 00000014 40000001
Oct 3 09:55:58 bonobo kernel: [17560.275778] iwlwifi 0000:04:00.0: 00000020: 0000000f ec100460 00000000
Oct 3 09:55:58 bonobo kernel: [17560.275783] iwlwifi 0000:04:00.0: iwlwifi parent port (0000:00:1c.3) config registers:
Oct 3 09:55:58 bonobo kernel: [17560.275807] iwlwifi 0000:00:1c.3: 00000000: 8c168086 00100007 060400d5 00810010 00000000 00000000 00040400 200000f0
Oct 3 09:55:58 bonobo kernel: [17560.275813] iwlwifi 0000:00:1c.3: 00000020: ec10ec10 0001fff1 00000000 00000000 00000000 00000040 00000000 00100405
Oct 3 09:55:58 bonobo kernel: [17560.275818] ------------[ cut here ]------------
Oct 3 09:55:58 bonobo kernel: [17560.275820] Timeout waiting for hardware access (CSR_GP_CNTRL 0xffffffff)
Oct 3 09:55:58 bonobo kernel: [17560.275891] WARNING: CPU: 7 PID: 0 at /build/linux-39dmni/linux-4.15.0/drivers/net/wireless/intel/iwlwifi/pcie/trans.c:1973 iwl_trans_pcie_grab_nic_access+0xea/0xf0 [iwlwifi]
Oct 3 09:55:58 bonobo kernel: [17560.275892] Modules linked in: rfcomm ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 xt_comment vmnet(OE) pci_stub vmw_vsock_vmci_transport vsock vboxpci(OE) xfrm_user vboxnetadp(OE) vmw_vmci xfrm4_tunnel tunnel4 vboxnetflt(OE) ipcomp vmmon(OE) xfrm_ipcomp vboxdrv(OE) esp4 ah4 af_key xfrm_algo ccm xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat ec_sys nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack libcrc32c ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables devlink iptable_filter cmac bnep binfmt_misc zfs(PO) zunicode(PO) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) algif_skcipher af_alg dm_crypt snd_hda_codec_hdmi intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp arc4
Oct 3 09:55:58 bonobo kernel: [17560.275963] kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc ...

Read more...

Revision history for this message
berend (berenddeboer) wrote :

I have been working with the manufacturer, System 76, and they asked me for a "dmesg | grep iwlwifi".

There were quite a few messages like this:

[24160.313920] iwlwifi 0000:04:00.0: Due to high temperature thermal throttling initiated
[24162.314845] iwlwifi 0000:04:00.0: Temperature is back to normal thermal throttling stopped

They think this might be a hardware issue. I'll be monitoring the temperature for a bit, and if iwlwifi crash and temperature are indeed correlated, take them up on their offer to send me a new wifi card.

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

I've grepped through dmesg and journalctl there is zero messages about temperature from iwlwifi in my case. I've some from CPU, but they are probably not related.

Revision history for this message
berend (berenddeboer) wrote :

Another crash, without any temperature message.

What I forgot to mention is that a crash only happens after a sleep. I don't think I have seen a crash after a fresh boot, only the next day after the machine has slept at least once.

> dmesg | grep iwlwifi
[45654.419172] iwlwifi 0000:04:00.0: iwlwifi transaction failed, dumping registers
[45654.419184] iwlwifi 0000:04:00.0: iwlwifi device config registers:
[45654.419236] iwlwifi 0000:04:00.0: 00000000: 08b18086 00100000 028000bb 00000000 00000004 00000000 00000000 00000000
[45654.419243] iwlwifi 0000:04:00.0: 00000020: 00000000 00000000 00000000 40708086 00000000 000000c8 00000000 00000100
[45654.419247] iwlwifi 0000:04:00.0: iwlwifi device memory mapped registers:
[45654.419282] iwlwifi 0000:04:00.0: 00000000: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
[45654.419288] iwlwifi 0000:04:00.0: 00000020: ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff ffffffff
[45654.419296] iwlwifi 0000:04:00.0: iwlwifi device AER capability structure:
[45654.419325] iwlwifi 0000:04:00.0: 00000000: 14010001 00100000 00000000 00462031 000031c1 00002000 00000014 40000001
[45654.419329] iwlwifi 0000:04:00.0: 00000020: 0000000f ec100460 00000000
[45654.419335] iwlwifi 0000:04:00.0: iwlwifi parent port (0000:00:1c.3) config registers:
[45654.419361] iwlwifi 0000:00:1c.3: 00000000: 8c168086 00100007 060400d5 00810010 00000000 00000000 00040400 200000f0
[45654.419367] iwlwifi 0000:00:1c.3: 00000020: ec10ec10 0001fff1 00000000 00000000 00000000 00000040 00000000 00100405
[45654.419450] WARNING: CPU: 4 PID: 0 at /build/linux-39dmni/linux-4.15.0/drivers/net/wireless/intel/iwlwifi/pcie/trans.c:1973 iwl_trans_pcie_grab_nic_access+0xea/0xf0 [iwlwifi]
[45654.419536] intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp arc4 snd_hda_codec_realtek snd_hda_codec_generic kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel uvcvideo snd_hda_intel videobuf2_vmalloc videobuf2_memops aes_x86_64 crypto_simd snd_hda_codec glue_helper cryptd videobuf2_v4l2 snd_hda_core snd_usb_audio intel_cstate videobuf2_core snd_usbmidi_lib snd_seq_midi intel_rapl_perf videodev btusb snd_hwdep snd_seq_midi_event media btrtl snd_pcm snd_rawmidi btbcm iwlmvm mac80211 btintel iwlwifi bluetooth snd_seq input_leds ecdh_generic rtsx_pci_ms joydev nvidia_uvm(POE) snd_seq_device memstick serio_raw snd_timer cfg80211 mxm_wmi snd wmi_bmof soundcore shpchp sch_fq_codel mei_me mei mac_hid lpc_ich ie31200_edac parport_pc ppdev sunrpc lp parport
[45654.419675] RIP: 0010:iwl_trans_pcie_grab_nic_access+0xea/0xf0 [iwlwifi]
[45654.419715] iwl_read_prph+0x38/0x90 [iwlwifi]
[45654.419729] iwl_trans_pcie_log_scd_error+0x125/0x1f0 [iwlwifi]
[45654.419742] ? iwl_pcie_txq_build_tfd+0xe0/0xe0 [iwlwifi]
[45654.419753] iwl_pcie_txq_stuck_timer+0x46/0x70 [iwlwifi]
[45654.472666] iwlwifi 0000:04:00.0: Queue 10 is active on fifo 2 and stuck for 10000 ms. SW [31, 126] HW [90, 90] FH TRB=0x05a5a5a5a

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

In my case crash can happen on freshly booted system as well. And it is more likely to happen when I'm using Wi-Fi actively, although it sometimes happens even if I barely use Wi-Fi (eg. I'm connected via ethernet cable and Wi-Fi at the same time and routing is configured to use the cable).

@berend have you tried replacing the card?

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

Joseph do you need any more information? Any ideas to try?

Revision history for this message
berend (berenddeboer) wrote :

Haven't asked for a new wifi card yet, as I don't think it's the hardware. But I might get annoyed enough to try.

Revision history for this message
Yura Pakhuchiy (yura-p) wrote :

lspci no longer shows "Intel Corporation Wireless 7260" entry on my system. So I guess it is a hardware problem. Funny thing that bluetooth which is located on the same chip still works.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.