ath10k_pci often crashes in focal

Bug #1886588 reported by Lars Bahner
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

WIreless connectivity has been very flaky in focal ever since release. Thought I'd start investigating a bit. This is all I found, but it clearly signisfies an error. Please let med know, if I can provide more information.

Description: Ubuntu 20.04 LTS
Release: 20.04

[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: firmware crashed! (guid ba24fee9-d0cb-42e2-9aef-06f0d07a053e)
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: qca6174 hw3.2 target 0x05030000 chip_id 0x00340aff sub 1a56:1535
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: kconfig debug 0 debugfs 1 tracing 1 dfs 0 testmode 0
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: firmware ver WLAN.RM.4.4.1-00140-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp crc32 29eb8ca1
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: board_file api 2 bmi_id N/A crc32 4ac0889b
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: htt-ver 3.60 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: failed to get memcpy hi address for firmware address 4: -16
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: failed to read firmware dump area: -16
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: Copy Engine register dump:
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [00]: 0x00034400 11 11 3 3
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [01]: 0x00034800 3 2 175 176
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [02]: 0x00034c00 12 11 10 11
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [03]: 0x00035000 12 12 14 12
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [04]: 0x00035400 6557 6549 11 203
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [05]: 0x00035800 0 0 64 0
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [06]: 0x00035c00 3 1 10 8
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: [07]: 0x00036000 1 0 1 0
[ma. juli 6 23:16:50 2020] ath10k_pci 0000:3b:00.0: failed to read hi_board_data address: -28
[ma. juli 6 23:16:51 2020] ieee80211 phy0: Hardware restart was requested
[ma. juli 6 23:16:51 2020] ath10k_pci 0000:3b:00.0: unsupported HTC service id: 1536
[ma. juli 6 23:16:51 2020] ath10k_pci 0000:3b:00.0: device successfully recovered
[ma. juli 6 23:19:57 2020] mce: CPU2: Core temperature above threshold, cpu clock throttled (total events = 6714)
[ma. juli 6 23:19:57 2020] mce: CPU1: Package temperature above threshold, cpu clock throttled (total events = 20892)
[ma. juli 6 23:19:57 2020] mce: CPU5: Package temperature above threshold, cpu clock throttled (total events = 20892)
[ma. juli 6 23:19:57 2020] mce: CPU4: Package temperature above threshold, cpu clock throttled (total events = 20892)
[ma. juli 6 23:19:57 2020] mce: CPU2: Package temperature above threshold, cpu clock throttled (total events = 20893)
[ma. juli 6 23:19:57 2020] mce: CPU0: Package temperature above threshold, cpu clock throttled (total events = 20893)
[ma. juli 6 23:19:57 2020] mce: CPU3: Package temperature above threshold, cpu clock throttled (total events = 20893)
[ma. juli 6 23:19:57 2020] mce: CPU2: Core temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU1: Package temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU5: Package temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU4: Package temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU2: Package temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU0: Package temperature/speed normal
[ma. juli 6 23:19:57 2020] mce: CPU3: Package temperature/speed normal
[ma. juli 6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled state
[ma. juli 6 23:20:08 2020] veth76e34af: renamed from eth0
[ma. juli 6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled state
[ma. juli 6 23:20:08 2020] device veth76af7ee left promiscuous mode
[ma. juli 6 23:20:08 2020] docker0: port 1(veth76af7ee) entered disabled state
[ma. juli 6 23:20:18 2020] wlp59s0: deauthenticating from 00:22:07:80:6c:2e by local choice (Reason: 3=DEAUTH_LEAVING)
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: qca6174 hw3.2 target 0x05030000 chip_id 0x00340aff sub 1a56:1535
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: kconfig debug 0 debugfs 1 tracing 1 dfs 0 testmode 0
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: firmware ver WLAN.RM.4.4.1-00140-QCARMSWPZ-1 api 6 features wowlan,ignore-otp,mfp crc32 29eb8ca1
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: board_file api 2 bmi_id N/A crc32 4ac0889b
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: unsupported HTC service id: 1536
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0: htt-ver 3.60 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
[ma. juli 6 23:21:46 2020] ath: EEPROM regdomain: 0x6c
[ma. juli 6 23:21:46 2020] ath: EEPROM indicates we should expect a direct regpair map
[ma. juli 6 23:21:46 2020] ath: Country alpha2 being used: 00
[ma. juli 6 23:21:46 2020] ath: Regpair used: 0x6c
[ma. juli 6 23:21:46 2020] ath10k_pci 0000:3b:00.0 wlp59s0: renamed from wlan0
[ma. juli 6 23:21:47 2020] ath10k_pci 0000:3b:00.0: unsupported HTC service id: 1536

ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: linux-modules-extra-5.4.0-26-generic 5.4.0-26.30
ProcVersionSignature: Ubuntu 5.4.0-29.33-generic 5.4.30
Uname: Linux 5.4.0-29-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
ApportVersion: 2.20.11-0ubuntu27
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: bahner 3397 F.... pulseaudio
CasperMD5CheckResult: skip
Date: Mon Jul 6 23:25:53 2020
InstallationDate: Installed on 2020-05-16 (51 days ago)
InstallationMedia: Ubuntu 20.04 LTS "Focal Fossa" - Release amd64 (20200423)
Lsusb:
 Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
 Bus 001 Device 003: ID 27c6:5395 Shenzhen Goodix Technology Co.,Ltd. Fingerprint Reader
 Bus 001 Device 002: ID 0cf3:e300 Qualcomm Atheros Communications
 Bus 001 Device 004: ID 0c45:671d Microdia Integrated_Webcam_HD
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
MachineType: Dell Inc. XPS 15 9570
ProcEnviron:
 LANGUAGE=nb_NO:nb:no_NO:no:nn_NO:nn:en
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=nb_NO.UTF-8
 SHELL=/bin/bash
ProcFB: 0 i915drmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.4.0-29-generic root=UUID=e486dbb6-3ebb-484b-a363-b27da572e54a ro quiet splash
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-5.4.0-29-generic N/A
 linux-backports-modules-5.4.0-29-generic N/A
 linux-firmware 1.187
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 04/21/2020
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 1.16.2
dmi.board.name: 0D0T05
dmi.board.vendor: Dell Inc.
dmi.board.version: A00
dmi.chassis.type: 10
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvr1.16.2:bd04/21/2020:svnDellInc.:pnXPS159570:pvr:rvnDellInc.:rn0D0T05:rvrA00:cvnDellInc.:ct10:cvr:
dmi.product.family: XPS
dmi.product.name: XPS 15 9570
dmi.product.sku: 087C
dmi.sys.vendor: Dell Inc.

Revision history for this message
Lars Bahner (bahner) wrote :
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Alex Hung (alexhung) wrote :

I have also a Qualcomm Atheros QCA6174 and can observe it fails from time to time, not in focal only. More discussion can be found in LP:1872351

While there aren't solution now, I use "sudo modprobe -r ath10k_pci ; sudo modprobe ath10k_pci" to avoid rebooting the system.

Revision history for this message
Alex Hung (alexhung) wrote :

The log in description clearly points out "firmware crashed", and that may be something we can forward to the hardware vendor.

Revision history for this message
Alex Hung (alexhung) wrote :

vicamo posted a probable solution @ https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1891405. See #19 for more details

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.