amdgpu PSP load sos failed

Bug #1952747 reported by Tuomas Heino
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux-signed-hwe-5.11 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Regression to be analyzed further later.

03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 14 [Radeon RX 5500/5500M / Pro 5500M] (rev c5)

Most relevant part of boot logs (more in attachment):

Nov 30 13:08:00 ub3ff kernel: microcode: microcode updated early to revision 0x21, date = 2019-02-13
Nov 30 13:08:00 ub3ff kernel: Linux version 5.11.0-41-generic (buildd@lgw01-amd64-005) (gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #45~20.04.1-Ubuntu SMP Wed Nov 10 10:20:10 UTC 2021 (Ubuntu 5.11.0-41.45~20.04.1-generic 5.11.22)
Nov 30 13:08:00 ub3ff kernel: Command line: BOOT_IMAGE=/boot/vmlinuz-5.11.0-41-generic root=UUID=2b9e2f7d-99b7-45f9-aabc-5ca23e66a7a2 ro drm.debug=0x4 splash vt.handoff=7
[...]
Nov 30 13:08:04 ub3ff kernel: amdgpu 0000:03:00.0: amdgpu: Will use PSP to load VCN firmware
Nov 30 13:08:04 ub3ff systemd[1]: Finished Flush Journal to Persistent Storage.
Nov 30 13:08:04 ub3ff kernel: snd_hda_intel 0000:03:00.1: refused to change power state from D0 to D3hot
Nov 30 13:08:04 ub3ff kernel: [drm:psp_hw_start [amdgpu]] *ERROR* PSP load sos failed!
Nov 30 13:08:04 ub3ff kernel: [drm:psp_hw_init [amdgpu]] *ERROR* PSP firmware loading failed
Nov 30 13:08:04 ub3ff kernel: [drm:amdgpu_device_fw_loading [amdgpu]] *ERROR* hw_init of IP block <psp> failed -22
Nov 30 13:08:04 ub3ff kernel: amdgpu 0000:03:00.0: amdgpu: amdgpu_device_ip_init failed
Nov 30 13:08:04 ub3ff kernel: amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init
Nov 30 13:08:04 ub3ff kernel: amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device.
[...]
Nov 30 13:08:04 ub3ff kernel: RIP: 0010:drm_mm_takedown+0x23/0x30 [drm]
Nov 30 13:08:04 ub3ff kernel: RIP: 0033:0x7f147a8c589d
Nov 30 13:08:04 ub3ff kernel: RIP: 0010:drm_mm_takedown+0x23/0x30 [drm]
Nov 30 13:08:04 ub3ff kernel: RIP: 0033:0x7f147a8c589d
[...]
Nov 30 13:08:04 ub3ff kernel: amdgpu: probe of 0000:03:00.0 failed with error -22

ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: linux-image-5.11.0-41-generic 5.11.0-41.45~20.04.1
ProcVersionSignature: Ubuntu 5.11.0-41.45~20.04.1-generic 5.11.22
Uname: Linux 5.11.0-41-generic x86_64
ApportVersion: 2.20.11-0ubuntu27.21
Architecture: amd64
CasperMD5CheckResult: skip
CurrentDesktop: ubuntu:GNOME
Date: Tue Nov 30 13:58:57 2021
InstallationDate: Installed on 2020-04-10 (598 days ago)
InstallationMedia: Ubuntu 20.04 LTS "Focal Fossa" - Beta amd64 (20200402)
SourcePackage: linux-signed-hwe-5.11
UpgradeStatus: No upgrade log present (probably fresh install)

Revision history for this message
Tuomas Heino (iheino+ub) wrote :
Revision history for this message
Tuomas Heino (iheino+ub) wrote :

Turning off secondary and tertiary monitors for the duration of the initial boot appears to work around this issue. Suspend-to-ram works with all monitors powered (same hardware configuration as in #1926400).

Tuomas Heino (iheino+ub)
description: updated
summary: - amdgpu PSP load sos failed after hwe 5.11.0-27 -> -41 update
+ amdgpu PSP load sos failed
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-signed-hwe-5.11 (Ubuntu):
status: New → Confirmed
Revision history for this message
Jarett (jarett-millard) wrote :

I'm affected by this too. It appears to be a timing issue in the amdgpu driver, judging by this Superuser post: https://superuser.com/questions/1747738/amd-radeon-instinct-mi25-fails-to-initialize-drmamdgpu-device-fw-loading-amd

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.