Acer Aspire A315 IOAPIC failure on Ubuntu 18.04, kernel hangs, can't load, kernel freeze (AMD Ryzen 5/Radeon/Raven) / AMDGPU Hybrid crash

Bug #1776563 reported by Richard Baka on 2018-06-12
52
This bug affects 10 people
Affects Status Importance Assigned to Milestone
Linux
Incomplete
Medium
amd
Undecided
Unassigned
linux (Ubuntu)
Medium
Unassigned
linux-firmware (Ubuntu)
Undecided
Unassigned

Bug Description

CPU: Ryzen 5 2500U
VGA: Radeon 535
Notebook: Acer Aspire A315

This is a brand new notebook on the market with Ryzen 5/Radeon.
The default kernel of Ubuntu(18.04) hangs at loading with message:

tsc: Refined TSC clocksource calibration: 1996.250 MHz
clocksource: tsc: mask: 0xffffffffffffffff max_cycles: (...), max_idle_ns: (...)
Soft lockup

Using pci=noacpi kernel parameter kernel loads without any problem but my notebook produces more heat than on Win10. If I know right Acer notebooks need ACPI to the correct power management.

The same thing happens on mainline 4.17,4.18rc1-2.
BIOS upgrade to the latest version: 1.08 hasn't helped

This problem has been reported upstream:
https://bugzilla.kernel.org/show_bug.cgi?id=200087

The latest correctly working kernel was 4.13.* but the heat problem was present with this too.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1776563

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: bionic

apport-collect 1776563 can't be entered because the kernel can not load.

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
summary: - Acer Aspire A315 (Ryzen5/Radeon/FHD) Ubuntu 18.04 kernel cant load
+ Ubuntu 18.04 kernel can't load kernel on Acer Aspire A315
+ (Ryzen5/Radeon/FHD)
summary: - Ubuntu 18.04 kernel can't load kernel on Acer Aspire A315
- (Ryzen5/Radeon/FHD)
+ Ubuntu 18.04 can't load kernel on Acer Aspire A315 (Ryzen5/Radeon/FHD)
no longer affects: bugzilla (Ubuntu)
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in xserver-xorg-video-amdgpu (Ubuntu):
status: New → Confirmed
Freihut (freihut) wrote :

Had this on my A315 too, but I returned it to the vendor. Seems to be an UEFI-Bug, because it doesn't happened with my Ryzen 2500U from HP. Could also be related to that Ryzen/Radeon 535 combination (Vega/CGN 3).

On Grub-Menu press E and add "pci=noacpi" as kernel-parameter (where normally "quite splash" is). Then go on booting by pressing F10.
Sometimes (XFCE) it was also necessary to add "nomodeset" to boot, Gnome for example didn't need it (AFAIK).

I remember, I also needed to install amd's pro driver (for 18.04) via amdgpu-pro-install to get rid of the "nomodeset". I was able to run amdgpu-pro-uninstall later and still not needed the "nomodeset". Could be related to my system, but you may give it a try.
I was also using Kernel 4.17 (Mainline), which is available on http://kernel.ubuntu.com/~kernel-ppa/mainline/ or with UKUU https://www.omgubuntu.co.uk/2017/02/ukuu-easy-way-to-install-mainline-kernel-ubuntu

Richard Baka (bakarichard91) wrote :

Thanks Freihut, I will try this.

Richard Baka (bakarichard91) wrote :

It works but very slow. This could be an ACPI problem.

Richard Baka (bakarichard91) wrote :

I installed the new amdgpu pro driver and everything is very fast now. This bug should be reported to freedesktop, would you like somebody to do it? :D

Richard Baka (bakarichard91) wrote :

*Sorry correction: Who would like to do it? :D

Richard Baka (bakarichard91) wrote :

"The fact that ACPI was designed by a group of monkeys high on LSD, and is some of the worst designs in the industry obviously makes running it at any point pretty damn ugly."
Torvalds, Linus (2005-07-31). Message. linux-kernel mailing list. IU. Retrieved on 2006-08-28.

Richard Baka (bakarichard91) wrote :

Power management doesn't work well this way. It was hot a little. I've changed back to win10. This should be fixed by kernel developers or with a downstream patch.

2 comments hidden view all 297 comments

Created attachment 276583
dmesg after starting kernel with pci=noacpi

This is a brand new notebook on the market with Ryzen 5/Radeon. With disabled ACPI kernel boots without any problem but my notebook produces more heat than on Win10. Otherwise this happens when it is stayed on the bios screen in a while.

CPU: AMD Ryzen 5 2500U
GPU1: AMD Radeon Vega 8
GPU2: AMD Radeon 535

(I wrote to Acer to fix their bios problems but they said Linux is not supported. I don't think they are right but what can I do?)

Created attachment 276585
attachment-31427-0.html

Out of office 6/18-6/27

Created attachment 276587
Soft lockup failure without noacpi

Nothing changes with disabled iommu.

5 comments hidden view all 297 comments
6 comments hidden view all 297 comments

Created attachment 276589
dmesg after amd_iommu_dump=1

[ 0.000000] AMD-Vi: Using IVHD type 0x11
[ 0.000000] AMD-Vi: device: 00:00.2 cap: 0040 seg: 0 flags: b0 info 0000
[ 0.000000] AMD-Vi: mmio-addr: 00000000fd900000
[ 0.000000] AMD-Vi: DEV_SELECT_RANGE_START devid: 00:01.0 flags: 00
[ 0.000000] AMD-Vi: DEV_RANGE_END devid: ff:1f.6
[ 0.000000] AMD-Vi: DEV_ALIAS_RANGE devid: ff:00.0 flags: 00 devid_to: 00:14.4
[ 0.000000] AMD-Vi: DEV_RANGE_END devid: ff:1f.7
[ 0.000000] AMD-Vi: DEV_SPECIAL(HPET[0]) devid: 00:14.0
[ 0.000000] AMD-Vi: DEV_SPECIAL(IOAPIC[33]) devid: 00:14.0
[ 0.000000] AMD-Vi: DEV_SPECIAL(IOAPIC[34]) devid: 00:00.1
[ 0.000000] [Firmware Bug]: AMD-Vi: No southbridge IOAPIC found

no longer affects: xserver-xorg-video-amdgpu (Ubuntu)

Created attachment 276591
Error message before freezing (without quite splash)

Please try booting with linux 4.18-rc1 or later. Also, please try 4.18-rc1+ with/without ACPI

Hi Erik,

Absolutely the same thing on 4.18rc1 and on rc2 too.

Fedora loads without any additional parameters(mysterious).

[ 0.000000] Switched APIC routing to physical flat.
[ 0.002000] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1
[ 0.007000] tsc: Fast TSC calibration using PIT
[ 0.008000] tsc: Detected 1996.299 MHz processor
[ 0.008000] clocksource: tsc-early: mask: 0xffffffffffffffff max_cycles: 0x398d0c7513b, max_idle_ns: 881590744042 ns
[ 0.008000] Calibrating delay loop (skipped), value calculated using timer frequency.. 3992.59 BogoMIPS (lpj=1996299)

Heat production may be still present but I can't measure it because there is no temperature values in "sensors" (there is 5 values in Win10).

Created attachment 277069
Fedora loads without noacpi

summary: - Ubuntu 18.04 can't load kernel on Acer Aspire A315 (Ryzen5/Radeon/FHD)
+ Acer Aspire A315 ACPI failure on Ubuntu 18.04 (Ryzen5/Radeon/FHD)
summary: - Acer Aspire A315 ACPI failure on Ubuntu 18.04 (Ryzen5/Radeon/FHD)
+ Acer Aspire A315 ACPI failure on Ubuntu 18.04 (Ryzen5/Radeon)
11 comments hidden view all 297 comments
summary: - Acer Aspire A315 ACPI failure on Ubuntu 18.04 (Ryzen5/Radeon)
+ Acer Aspire A315 ACPI failure on Ubuntu, kernel hangs, can't load 18.04
+ (Ryzen5/Radeon)
summary: - Acer Aspire A315 ACPI failure on Ubuntu, kernel hangs, can't load 18.04
+ Acer Aspire A315 ACPI failure on Ubuntu 18.04, kernel hangs, can't load
(Ryzen5/Radeon)
description: updated
summary: Acer Aspire A315 ACPI failure on Ubuntu 18.04, kernel hangs, can't load
- (Ryzen5/Radeon)
+ (AMD Ryzen 5/Radeon/Raven)
summary: - Acer Aspire A315 ACPI failure on Ubuntu 18.04, kernel hangs, can't load
- (AMD Ryzen 5/Radeon/Raven)
+ Acer Aspire A315 ACPI failure on Ubuntu 18.04, kernel hangs, can't load,
+ kernel freeze (AMD Ryzen 5/Radeon/Raven)
12 comments hidden view all 297 comments

Erik, I think this is in connection with clocksource calibration but I'm not an expert.

This works:
[ 0.007000] tsc: Fast TSC calibration using PIT
[ 0.008000] tsc: Detected 1996.299 MHz processor
[ 0.008000] clocksource: tsc-early: mask: 0xffffffffffffffff max_cycles: 0x398d0c7513b, max_idle_ns: 881590744042 ns

This doesn't:
[...] tsc: Refined tsc clocksource calibration: ...
[...] clocksource: tsc: mask: 0xfff...f (...)

Changed in linux:
importance: Unknown → Medium
status: Unknown → Incomplete
2 comments hidden view all 297 comments

Hi, I was trying another kernel parameters and noapic seems to work. It is not needed to disable the whole ACPI "service", however I don't know how important apic is. On kernel 4.18 even temperature sensors appear.
Power management is almost perfect if cpu governor is set to powersave.

At least amdgpu crashes now so kernel doesn't start without nomodeset. Could this be an acpi problem or I should ask kernel firmware developers?

Hi,
amdgpu doesn't crash on my a315-41g-r40x (BIOS V1.08) with
  linux-next-next-20180713 compiled with VGA_SWITCHEROO=N
and with
  kernel parameters: ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2

gg71, where have you been till now? :D
Thanks, I will try it.

gg71, it works almost perfectly, thanks again. I have been working on this for ca one month. Please write a mail to me if you have any new info.

4 comments hidden view all 297 comments

The solution for Acer A315-41G-* notebooks: (USE AT YOUR OWN RISK - PLS be very careful)

1. Load kernel with these parameters: ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2 nomodeset
This is how it can be done (1. answer/first half 1-4): https://askubuntu.com/questions/19486/how-do-i-add-a-kernel-boot-parameter

1/b.(if it is not installed) Install ubuntu and load installed kernel again using the parameters (see 1.)

2. Start a terminal and do these steps:
> cd ~
> mkdir kernelbuild
> cd kernelbuild
> wget -c https://cdn.kernel.org/pub/linux/kernel/v4.x/linux-4.17.6.tar.xz
> tar -xvf linux-4.17.6.tar.xz
> cd linux-4.17.6
> sudo apt install git build-essential kernel-package fakeroot libncurses5-dev libssl-dev ccache bison flex
> make menuconfig
+> Save,OK,EXIT
> nano .config
+> ctrl+w and search for CONFIG_VGA_SWITCHEROO=y
+> replace y with n (this is not ideal and should be fixed later)
+> ctrl+o, enter
> make -j4 (this will take a while, be patient)
> make modules_install
> sudo make install
> sudo nano /etc/default/grub
+> Edit the correct line and add the parameters: GRUB_CMDLINE_LINUX_DEFAULT="quiet splash ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2"
+>CTRL+O, enter
>sudo update-grub
+> reboot and start the correct kernel

If you install xsensors (sudo apt install xsensors) and start it (xsensors) you can monitor the temperature values of your notebook. (Recommended)

Richard Baka (bakarichard91) wrote :

Dear Ubuntu maintainers,

couldn't this be fixed by an ubuntu kernel patch? The hardest part is to disable gpu switching at kernel load time. APIC fixing parameters can be hardcoded for these models I think or search for the correct pci controller using a smart script.

This was a hell of an investigation, never again. Thanks for gg71, he/she is a lifesaver.

4 comments hidden view all 297 comments

Hi Richard:

This issue should be related to the buggy BIOS ivrs table.
Kernel panic when found no southbridge device ID.

Could you try boot kernel with "amd_iommu_dump=1 amd_iommu=off" (remove other kernel parameters you tried to solve this issue).

If it works, please attach the dmesg here.
I will try to make a kernel patch to make kernel boot with irq map disabled instead of panic.

Richard Baka (bakarichard91) wrote :

Hi AaronMa,

thanks for the response. I tried it but it didn't work. I think iommu problem is not the main reason of the kernel hang. Otherwise it can be disabled in BIOS and there is no change.

The main reason is: https://bugzilla.kernel.org/attachment.cgi?id=276587 like you can se on this picture is that IOAPIC[4] and IOAPIC[5] are not in the invrs table so we should search the correct pci controllers using lspci and give them to the kernel.

In this way:
LINUX_DEFAULT="quiet splash ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2"

Kernel can be started even with noapic but two sensors will be missing and the advanced touchpad functions will not work. This is the reason of CONFIG_VGA_SWITCHEROO=n compile time kernel parameter.

There is an another problem: this notebook has two GPUs and amdgpu (or the kernel, I don't know) can not handle this correctly so gpu switching has to be disabled

Richard Baka (bakarichard91) wrote :

Kernel can be started even with noapic but two sensors will be missing and the advanced touchpad functions will not work.

!!!This line is not here: This is the reason of CONFIG_VGA_SWITCHEROO=n compile time kernel parameter.

There is an another problem: this notebook has two GPUs and amdgpu (or the kernel, I don't know) can not handle this correctly so gpu switching has to be disabled
!!!But here: This is the reason of CONFIG_VGA_SWITCHEROO=n compile time kernel parameter.

Richard Baka (bakarichard91) wrote :

AaronMa,

This is the iommu debug:

[ 0.000000] AMD-Vi: Using IVHD type 0x11
[ 0.000000] AMD-Vi: device: 00:00.2 cap: 0040 seg: 0 flags: b0 info 0000
[ 0.000000] AMD-Vi: mmio-addr: 00000000fd900000
[ 0.000000] AMD-Vi: DEV_SELECT_RANGE_START devid: 00:01.0 flags: 00
[ 0.000000] AMD-Vi: DEV_RANGE_END devid: ff:1f.6
[ 0.000000] AMD-Vi: DEV_ALIAS_RANGE devid: ff:00.0 flags: 00 devid_to: 00:14.4
[ 0.000000] AMD-Vi: DEV_RANGE_END devid: ff:1f.7
[ 0.000000] AMD-Vi: DEV_SPECIAL(HPET[0]) devid: 00:14.0
[ 0.000000] AMD-Vi: DEV_SPECIAL(IOAPIC[33]) devid: 00:14.0
[ 0.000000] AMD-Vi: DEV_SPECIAL(IOAPIC[34]) devid: 00:00.1
[ 0.000000] [Firmware Bug]: AMD-Vi: No southbridge IOAPIC found

I will give you the correct iommu "addresses" after dinner :).

Richard Baka (bakarichard91) wrote :

HOT NEWS!!

CONFIG_VGA_SWITCHEROO=n can be avoided using these kernel parameters amdgpu.runpm=0 radeon.modeset=0.
Further investigation is in progress...

Richard Baka (bakarichard91) wrote :

This could be the better solution because of the notebook's lowest heating but I'm not sure.

Richard Baka (bakarichard91) wrote :
Download full text (4.5 KiB)

Hi all,

After a bit of testing the power management seems to be better but it is far away from perfect. I don't see any anomaly watching temperature sensors (instead of ath10k_hwmon-pci(?!??)) but my notebook is definitely warm if I hold it on my lap.
This is more better on win10, I don't know why.

mosomaci@pc:~$ sensors
k10temp-pci-00c3
Adapter: PCI adapter
Tdie: +55.0°C (high = +70.0°C)
Tctl: +55.0°C

amdgpu-pci-0100
Adapter: PCI adapter
vddgfx: +0.81 V
fan1: N/A
temp1: +50.0°C (crit = +104000.0°C, hyst = -273.1°C)
power1: 1.13 kW (cap = 28.00 W)

ath10k_hwmon-pci-0300
Adapter: PCI adapter
temp1: +91.0°C

amdgpu-pci-0400
Adapter: PCI adapter
vddgfx: N/A
vddnb: N/A
fan1: N/A
temp1: +55.0°C (crit = +80.0°C, hyst = +0.0°C)
power1: N/A

Could our APIC fix not a perfect solution for this problem? I know that the DSDT is totally broken:

[ 0.088280] ACPI: Added _OSI(Module Device)
[ 0.088280] ACPI: Added _OSI(Processor Device)
[ 0.088280] ACPI: Added _OSI(3.0 _SCP Extensions)
[ 0.088280] ACPI: Added _OSI(Processor Aggregator Device)
[ 0.088280] ACPI: Added _OSI(Linux-Dell-Video)
[ 0.092591] ACPI: [Firmware Bug]: BIOS _OSI(Linux) query ignored
[ 0.100296] ACPI BIOS Error (bug): Failure creating [\_SB.PCI0.LPC0.EC0._Q46], AE_ALREADY_EXISTS (20180531/dswload2-316)
[ 0.100309] ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20180531/psobject-221)
[ 0.100313] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100321] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.UX**], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100326] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100332] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.M000], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100336] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100343] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.M049], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100347] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100353] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.M280], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100357] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100364] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.M009], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100369] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100372] ACPI Error: Skipping While/If block (20180531/psloop-594)
[ 0.100378] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.LPC0.EC0.M000], AE_NOT_FOUND (20180531/psargs-330)
[ 0.100383] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100390] ACPI Error: Cannot release Mutex [QMUX], not acquired (20180531/exmutex-359)
[ 0.100394] ACPI Error: Ignore error and continue table load (20180531/psobject-604)
[ 0.100402] ACPI BIOS Error (bug): Could not resolve [\_SB.PCI0.GPP2.BCM5], AE_NOT_FOUND (20180531...

Read more...

summary: - Acer Aspire A315 ACPI failure on Ubuntu 18.04, kernel hangs, can't load,
- kernel freeze (AMD Ryzen 5/Radeon/Raven)
+ Acer Aspire A315 IOAPIC failure on Ubuntu 18.04, kernel hangs, can't
+ load, kernel freeze (AMD Ryzen 5/Radeon/Raven) / AMDGPU Hybrid crash
tags: added: patch
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Triaged
Changed in linux-firmware (Ubuntu):
status: New → Confirmed
siyia (siyia) on 2018-11-18
Changed in amd:
status: New → Fix Committed
Aroiu Ovidiu (ixanneo) on 2018-11-20
Changed in amd:
status: Fix Committed → Fix Released
217 comments hidden view all 297 comments
Richard Baka (bakarichard91) wrote :

I patched the ssdt tables and my ubuntu 18.10 works perfectly. There is no need to fix everything but the most important parts.

Kalin Stoyanov (kgstoyanov) wrote :

Do you mean "ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2"? That made my laptop perform worse than with "noacpi" (on ubuntu 18.04 with acer 315-41 (not G)) - it started heating up in idle, and battery drained faster. I didn't try building the kernel like you suggested earlier but to be honest at this point I am sick of reinstalling linux over and over again, and want something that would be guaranteed to work.

Aroiu Ovidiu (ixanneo) wrote :

With "pci=noacpi amd_iommu=on" and touchpad on "basic" I have good result on Ubuntu 18.10. Sleep/resume working fine, idle cpu temp 38 degree, performance fine too.

siyia (siyia) wrote :

post output of watch lscpu

siyia (siyia) wrote :

after resuming from suspend

Aroiu Ovidiu (ixanneo) wrote :

I can't :( Just instaled Solus, update it and now it's stuck on bios splash.... I'll install Ubuntu again when I have time and post lscpu.

siyia (siyia) wrote :

In manjaro 18 pci=noacpi disables dgpu.

siyia (siyia) wrote :

noapic works fine only the bloody suspend isnt working

Richard Baka (bakarichard91) wrote :

noapic, noacpi, etc: all of them are incorrect fixes for this problems. These are just workarounds. With using these you limit the one of the most important part of your notebook hardvare functions/behaviour. It's better if you install Windows 10 instead of using these limitations.

https://wiki.ubuntu.com/Kernel/Reference/ACPITricksAndTips

siyia (siyia) wrote :

anything is better than Windows 10

Richard Baka (bakarichard91) wrote :

I sent you my hacked acpi tables, why don't you use them?

siyia (siyia) wrote :

Well, i used your tables and they work like pci=noacpi, they disable my dgpu and suspend doesnt work. If you have patched your tables again, against bios 1.11 please send them again for testing.

Richard Baka (bakarichard91) wrote :

siyia, why would I ask it if they didn't worked well? They work correctly so you did something wrong. Show me a dmesg with custom table loading.

siyia (siyia) wrote :

all i did differently was load my own dsdt with your ssdts tables, with your dsdt table and your ssdts my pc wouldnt boot,please post them here as attachments so everyone can test please

Richard Baka (bakarichard91) wrote :

siyia, you have exactly the same laptop model as I have, they should work. You do something wrong. Maybe you use an incorrect grub or kernel parameter.

this a section of my /boot/grub/grub.cfg. Yours should look like this.
### BEGIN /etc/grub.d/40_custom_proxy ###
menuentry "Ubuntu" --class ubuntu --class gnu-linux --class gnu --class os $menuentry_id_option 'gnulinux-simple-0986d268-5f46-4555-9214-1d9dffa79286' {
 recordfail
 load_video
 gfxmode $linux_gfx_mode
 insmod gzio
 if [ x$grub_platform = xxen ]; then insmod xzio; insmod lzopio; fi
 insmod part_gpt
 insmod ext2
 set root='hd0,gpt2'
   acpi /boot/amls/ssdt10.aml
   acpi /boot/amls/ssdt11.aml
   acpi /boot/amls/ssdt13.aml
   acpi /boot/amls/ssdt14.aml
   acpi /boot/amls/ssdt1.aml
   acpi /boot/amls/ssdt2.aml
   acpi /boot/amls/ssdt3.aml
   acpi /boot/amls/ssdt4.aml
   acpi /boot/amls/ssdt5.aml
   acpi /boot/amls/ssdt6.aml
   acpi /boot/amls/ssdt8.aml
   acpi /boot/amls/ssdt9.aml
 if [ x$feature_platform_search_hint = xy ]; then
   search --no-floppy --fs-uuid --set=root --hint-bios=hd0,gpt2 --hint-efi=hd0,gpt2 --hint-baremetal=ahci0,gpt2 0986d268-5f46-4555-9214-1d9dffa79286
 else
   search --no-floppy --fs-uuid --set=root 0986d268-5f46-4555-9214-1d9dffa79286
 fi
 linux /boot/vmlinuz-4.18.0-10-generic root=UUID=0986d268-5f46-4555-9214-1d9dffa79286 ro ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2 quiet splash $vt_handoff
 initrd /boot/initrd.img-4.18.0-10-generic

Additional kernel parameters aren't needed. Don't change iommu settings or anything else. Iommu should be enabled in bios menu.

siyia (siyia) wrote :

"ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2" why are you using ioapic remapping if they work?what exactly did your patched tables fixed?

Michael J (drvrmike1) wrote :

Hi All...Have A315-41-R8UU Bios v. 1.11. What I have tested: Debian 9 Stretch [with firmware] kernel 4.9x, xfce4...This boots, without issue, but haven't tested out it's stability long-term. No boot args needed. Brightness controls do not work. Also tested openSUSE Leap 15, no boot args, KDE edition. Brightness controls do not work. Will eventually freeze after some time idle. On both distros, get the same acpi bugs during boot. To try and mitigate some of the BIOS bugs, I turned off WoL, SATA ch1, Open Lid resume, Disable TPM (would get tpm errors on boot), D2D Recovery disabled, secure boot disabled, and basic touchpad. Oddly, when attempting to install xubuntu 18 LTS or debian 9 testing, that's when I get all the errors described above. I have added the ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2 to grub when trying to run the installers, and they will indeed run, at low resolution. Why would older kernels run without the boot params for acpi, while these newer kernels will not? Maybe this can help someone? This thing so far, is a paperweight.

Richard Baka (bakarichard91) wrote :

siyia (siyia), ok you are right, 1.11 messed it up, there is no suspend, I was wrong. With previous versions it worked perfectly but remapping was needed.

Richard Baka (bakarichard91) wrote :
Download full text (3.2 KiB)

Ok, I've jus done a restart and it works now. I don't understand.

[ 485.749519] [drm:amdgpu_device_ip_suspend [amdgpu]] *ERROR* suspend of IP block <vcn_v1_0> failed -12
[ 485.812592] amdgpu 0000:04:00.0: 000000005ebb32cf unpin not necessary
[ 485.896218] ACPI: EC: interrupt blocked
[ 485.934777] ACPI: Preparing to enter system sleep state S3
[ 485.936578] ACPI: EC: event blocked
[ 485.936579] ACPI: EC: EC stopped
[ 485.936580] PM: Saving platform NVS memory
[ 485.936876] Disabling non-boot CPUs ...
[ 485.951469] irq_migrate_all_off_this_cpu: 11 callbacks suppressed
[ 485.951470] IRQ 57: no longer affine to CPU1
[ 485.952546] smpboot: CPU 1 is now offline
[ 485.976477] smpboot: CPU 2 is now offline
[ 485.999454] IRQ 27: no longer affine to CPU3
[ 485.999466] IRQ 36: no longer affine to CPU3
[ 485.999474] IRQ 37: no longer affine to CPU3
[ 485.999484] IRQ 45: no longer affine to CPU3
[ 485.999491] IRQ 46: no longer affine to CPU3
[ 485.999502] IRQ 58: no longer affine to CPU3
[ 486.000520] smpboot: CPU 3 is now offline
[ 486.024544] smpboot: CPU 4 is now offline
[ 486.047503] IRQ 28: no longer affine to CPU5
[ 486.047512] IRQ 30: no longer affine to CPU5
[ 486.047522] IRQ 34: no longer affine to CPU5
[ 486.048585] smpboot: CPU 5 is now offline
[ 486.072036] smpboot: CPU 6 is now offline
[ 486.096002] smpboot: CPU 7 is now offline
[ 486.098409] ACPI: Low-level resume complete
[ 486.098463] ACPI: EC: EC started
[ 486.098463] PM: Restoring platform NVS memory
[ 486.287964] Enabling non-boot CPUs ...
[ 486.288008] x86: Booting SMP configuration:
[ 486.288008] smpboot: Booting Node 0 Processor 1 APIC 0x1
[ 486.290582] cache: parent cpu1 should not be sleeping
[ 486.290678] microcode: CPU1: patch_level=0x0810100b
[ 486.290825] CPU1 is up
[ 486.290862] smpboot: Booting Node 0 Processor 2 APIC 0x2
[ 486.293528] cache: parent cpu2 should not be sleeping
[ 486.293664] microcode: CPU2: patch_level=0x0810100b
[ 486.293875] CPU2 is up
[ 486.293899] smpboot: Booting Node 0 Processor 3 APIC 0x3
[ 486.296526] cache: parent cpu3 should not be sleeping
[ 486.296650] microcode: CPU3: patch_level=0x0810100b
[ 486.296872] CPU3 is up
[ 486.296946] smpboot: Booting Node 0 Processor 4 APIC 0x4
[ 486.300467] cache: parent cpu4 should not be sleeping
[ 486.300610] microcode: CPU4: patch_level=0x0810100b
[ 486.300850] CPU4 is up
[ 486.300874] smpboot: Booting Node 0 Processor 5 APIC 0x5
[ 486.303506] cache: parent cpu5 should not be sleeping
[ 486.303628] microcode: CPU5: patch_level=0x0810100b
[ 486.303966] CPU5 is up
[ 486.303996] smpboot: Booting Node 0 Processor 6 APIC 0x6
[ 486.306660] cache: parent cpu6 should not be sleeping
[ 486.306790] microcode: CPU6: patch_level=0x0810100b
[ 486.307076] CPU6 is up
[ 486.307100] smpboot: Booting Node 0 Processor 7 APIC 0x7
[ 486.309745] cache: parent cpu7 should not be sleeping
[ 486.309900] microcode: CPU7: patch_level=0x0810100b
[ 486.310242] CPU7 is up
[ 486.311320] ACPI: Waking up from system sleep state S3
[ 486.312939] ACPI: EC: interrupt unblocked
[ 486.377731] ACPI: EC: event unblocked
[ 486.379966] [drm] PCIE GART of 1024M enabled (table at 0x0000...

Read more...

siyia (siyia) wrote :

ok please send them to me at <email address hidden> or post them here in a .zip named with the laptop model so we can test.

siyia (siyia) wrote :

maybe my laptop is slightly different all though our iommu dump is the same, i have ryzen 3 with radeon 535, you have ryzen 5 with radeon 535

siyia (siyia) wrote :

also if you check with watch lscpu after resuming cpu frequency runs at max if you used remap kernel parameters

Richard Baka (bakarichard91) wrote :

Vendor ID: AuthenticAMD
CPU family: 23
Model: 17
Model name: AMD Ryzen 5 2500U with Radeon Vega Mobile Gfx
Stepping: 0
CPU MHz: 1402.460
CPU max MHz: 2000,0000
CPU min MHz: 1600,0000
BogoMIPS: 3992.22
Virtualization: AMD-V

acpitool -c
ű CPU type : AMD Ryzen 5 2500U with Radeon Vega Mobile Gfx
  Min/Max frequency : 1600/2000 MHz
  Current frequency : 1466 MHz
  Frequency governor : ondemand
  Freq. scaling driver : acpi-cpufreq
  Cache size : 1466.690 KB
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Bogomips : 3992.22
  Function Show_CPU_Info : could not read directory /proc/acpi/processor/
  Make sure your kernel has ACPI processor support enabled.

cpufreq: CPU frequency scaling tests.
--------------------------------------------------------------------------------
Test 1 of 7: CPU frequency table consistency
PASSED: Test 1, CPU frequency tables are consistent

Test 2 of 7: CPU frequency table duplicates
PASSED: Test 2, No duplicates in CPU frequency table

Test 3 of 7: CPU frequency firmware limits
PASSED: Test 3, CPU BIOS limit OK

Test 4 of 7: CPU frequency claimed maximum
PASSED: Test 4, CPU max frequencies OK

Test 5 of 7: CPU frequency SW_ANY control
PASSED: Test 5, P-state coordination under hardware control.

Test 6 of 7: CPU frequency SW_ALL control
PASSED: Test 6, Firmware SW_ALL OK

Test 7 of 7: CPU frequency performance tests.
PASSED: Test 7, CPU performance scaling OK

maxfreq: Test max CPU frequencies against max scaling frequency.
--------------------------------------------------------------------------------
Test 1 of 1: Maximum CPU frequency test.
This test checks the maximum CPU frequency as detected by the kernel for each
CPU against maxiumum frequency as specified by the BIOS frequency scaling
settings.
SKIPPED: Test 1, Cannot read CPU frequencies from /proc/cpuinfo, this generally
happens on AMD CPUs, skipping test.

siyia (siyia) wrote :

they seem alright, send me your patched tables or send them to my email please i want to test them again.

siyia (siyia) wrote :

you only use ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2 and load your tables, is this right?

Richard Baka (bakarichard91) wrote :

yep, I'm trying to make a better patch, please be patient :D

Michael J (drvrmike1) wrote :

FWIW...Last night, I upgraded Debian9 from stock kernel 4.9 -> 4.18 using debian-backports. Very important, forced the amd-gpu firmware to the backported version also. Have ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2 added to my grub config permanently. No boot errors and it is using the raven ridge dgpu drivers [DRM]. Suspend works a treat and left it on for almost 24hrs, no lockups. Runs cool, with the fan hardly coming on [low RPM]. Seems quite stable. This may be of help to anyone that needs a distro to use for the time being. According to my research, seems a lot of these issues are due to buggy BIOS acpi tables and lack of raven ridge dgpu support in the kernel.

Kalin Stoyanov (kgstoyanov) wrote :

Hi all, I decided to try "ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=0:00.2" and it actually worked for me. It is possible that I am a complete idiot and misspelled it 5 times in a row before, but I could have sworn it did nothing before. Now suspend and brightness keys work..:

k@k-Aspire-A315-41:~$ journalctl -b | grep AMD-Vi
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: IOMMU performance counters supported
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: Found IOMMU at 0000:00:00.2 cap 0x40
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: Extended features (0x4f77ef22294ada):
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: Interrupt remapping enabled
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: virtual APIC enabled
Dec 13 21:57:49 k-Aspire-A315-41 kernel: AMD-Vi: Lazy IO/TLB flushing enabled

However it the sensors are still not working:
(this is exactly the same as before)
k@k-Aspire-A315-41:~$ sensors
amdgpu-pci-0300
Adapter: PCI adapter
fan1: N/A
temp1: +0.0°C (crit = +0.0°C, hyst = +0.0°C)

k10temp-pci-00c3
Adapter: PCI adapter
temp1: +44.2°C (high = +70.0°C)

I am still running Ubuntu 18.04 with the oldest bios. Full model is A315-41-R88G
dmesg: https://paste.ubuntu.com/p/vmPdsQSRt4/
lspci: https://paste.ubuntu.com/p/g5NPcBhFpw/

siyia (siyia) wrote :

amdgpu-pci-0300 sensor works only if the gpu is under load

Kalin Stoyanov (kgstoyanov) wrote :

Ok, I'll try putting it under load when I have the time. But I thought there were supposed to be more cpu sensors.. or am I wrong?

Richard Baka (bakarichard91) wrote :

siyia (siyia), e-mail has been sent.

schlauf (schlauf) wrote :

I performed a full 18.10 install and my A315-41 with Ryzen 3 2200U performed flawlessly. Everything works out of the box, except for graphical performance.

Everything I am doing on the desktop has a little lag and it seems to me that CPU load is rising quickly already with simple tasks on the desktop. Is it possible that there is software rendering taking place? How would I find out? According to glxinfo, direct rendering is in effect.

Would it be the right action to install AMDGPU Pro driver? Or is there any other chance to make this piece of hardware sprint away?

siyia (siyia) wrote :

disable amd DC with kernel parameter amdgpu.dc=0

siyia (siyia) wrote :

on my polaris desktop i had low 2d performance and screen flickering the above parameter solved it,amdgpu dc is still under heavy development.

Aroiu Ovidiu (ixanneo) wrote :

2019 and no real fix....this is realy sad...
I'm going back to windows 10.....

siyia (siyia) wrote :

Hello all gentoo and gentoo based distros can boot on my laptop without any parameters!!!!

siyia (siyia) wrote :

Model is Aspire A315-41G

I made the same experience on A315-41 R2GU. But I needed two kernel options. If I remember correctly it was SME Encryption = No (under Processor and Features?) and another one, to prevent freezes in one of the later C-States.

Ok. According to https://wiki.gentoo.org/wiki/AMDGPU disabling SME is needed for AMDGPU: "Currently, amdgpu does not work with AMD Secure Memory Encryption (SME) enabled, so you need to disable its use either in the kernel configuration or on the kernel command line (i.e. in /etc/default/grub as part of GRUB_CMDLINE_LINUX)."

siyia (siyia) wrote :

i have only the udev gets stuck problem on other linuxes,with gentoo i do not need to add another kernel parameter weird isnt it?

Displaying first 40 and last 40 comments. View all 297 comments or add a comment.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.