AER PCIe Bus Error for Asus X541UVK

Bug #1944752 reported by Riccardo Belli
12
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

I already posted this on the Ubuntu Forums and I'm revisiting it now with more information and accuracy.

Whenever I try to install any flavor of *buntu, I always run into this problem; a process taking up all of my CPU and after a while, it crashes the system and the whole screen goes black with text spamming. It shows some PCI errors non-stop, and following some guides online i tried to fix it but never actually succeeded.

The distro I was trying to install this time was the default 20.04.2 Ubuntu.

My PC is a laptop, it's an ASUS X541UVK with an i7 7500U and Geforce 920MX. At first I thought that there was a problem with the Nvidia GPU but it appears to be with the Realtek BT card. I went to follow this guide:

https://forums.linuxmint.com/viewtopic.php?t=305970

(since using pci=nomsi once solved the issue some time ago)
And the reproduced steps from the guide above are here (from the live usb):

https://imgur.com/a/enq5UCt

However this still doesn't solve my problems. I submitted my information at the time on this bug:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1521173

But now I made a duplicate of the bug since my pc has slightly different hardware. Also, I have a little bit more free time now. So, this time, I tried again with Ubuntu 20.04.3 and the results were pretty much the same; after booting up Ubuntu reported an "internal error" and I just managed to open the Gnome System Monitor (and sort the processes by CPU usage) just before the entire pc froze. Here are some pictures:

https://imgur.com/a/Sd7TO4y

Then, as suggested in the original bug report ( https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1521173/comments/139 ), I tried booting using the "pci=noaer" option, and the desktop at first booted fine, much quicker than before also, but for a few seconds the screen went black and these appeared on screen:

https://imgur.com/a/Ppq2D0W

Then after that it just went back to normal, and I didn't notice any anomaly. Also here (again, as suggested in the other bug report) I copy-pasted the results of "lspci -nn" and "dmesg":

https://pastebin.com/HMvdhvnz

https://pastebin.com/pnjPNcWr

I really hope this helps towards finding the issue :) for anything else for me to try, feel free to suggest and I'll try it and report the results.

Tags: patch
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1944752

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Bjorn Helgaas (bjorn-helgaas) wrote :

The "ACPI region does not cover" errors are not related to the AER PCIe bus error.

Neither is the nouveau MMIO read fault (that looks like a null pointer problem).

Naveen Naidu is looking at the AER problem, so I hope we'll have a patch that makes "pci=noaer" unnecessary.

Revision history for this message
Riccardo Belli (rbelli97) wrote :

Ok great! If there is anything else I can do or try let me know.

tags: added: patch
Revision history for this message
Naveen Naidu (theprophet26) wrote (last edit ):

Hello Riccardo o/

Please find the patch attached.

Unfortunately, I do not have a system that outputs the same AER errors as yours so I am unable to test it out. It would really help if you could please test this patch out and let me know, if it solves the AER message spew.

Thanks,
Naveen

[UPDATE]: Deleted the patch attached here and re-attached below after fixing it up. Apologies for the inconvenience.

Revision history for this message
Riccardo Belli (rbelli97) wrote :

Hi,
I currently do not have access to a running linux system to compile and test the patch with, since the only pc that I have is my laptop affected by this bug, and I also really need windows on it for some windows-specific software for my university. So, It's not possible for me right now to test the patch.
I'll leave some more specific info about my system (hoping that somebody else has the same exact machine) for anyone that is willing to test this (I'm copying this from System Information).

System Model X541UVK
Processor Intel(R) Core(TM) i7-7500U CPU @ 2.70GHz
Adapter Description Intel(R) HD Graphics 620
Adapter Description NVIDIA GeForce 920MX
BIOS Version/Date American Megatrends Inc. X541UVK.308, 30-Jan-18

Thanks to everyone for the help, I'll keep in touch.
Riccardo

Revision history for this message
Naveen Naidu (theprophet26) wrote :
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.