Fix system hang while entering suspend with AMD Navi3x graphics
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
HWE Next |
New
|
Undecided
|
Unassigned | ||
linux (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Jammy |
Invalid
|
Undecided
|
Unassigned | ||
Noble |
Fix Released
|
Medium
|
Unassigned | ||
linux-firmware (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Jammy |
Fix Released
|
Undecided
|
Unassigned | ||
Noble |
Fix Released
|
Undecided
|
Unassigned | ||
linux-oem-6.5 (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Jammy |
Fix Released
|
Undecided
|
Unassigned | ||
Noble |
Invalid
|
Undecided
|
Unassigned |
Bug Description
SRU Jusitification for Kernel
[Impact]
The system with AMD W7500/W7600/W7700 graphics will randomly hang when entering suspend. The page fault would keep happening and the system can't handle other tasks.
BUG: unable to handle page fault for address: 000000000a980148
[Fix]
Backport the fix from upstream
drm/amdgpu: skip to program GFXDEC registers for suspend abort · torvalds/
drm/amdgpu: Reset dGPU if suspend got aborted · torvalds/
https:/
[Test Case]
1. Install AMD W7500/W7600/W7700 graphics
2. Install latest firmware with dcn_3_2_0_dmcub.bin for Navi31 and 32 and dcn_3_2_1_dmcub.bin for Navi33.
3. Running fwts s3 stress test to check if system hangs
[Where problems could occur]
Improve the error handling when suspend and add the fallback mechanism in MES engine. Only observed on particular AMD models. Need to test w/ more combinations
=======
SRU Jusitification for linux-firmware
[Impact]
The system will randomly hang due to page fault while suspending.
[Fix]
Add release FW binary from AMD to linux-firmware
dcn_3_2_0_dmcub.bin for Navi31 and 32: https:/
dcn_3_2_1_dmcub.bin for Navi33: https:/
[Test Case]
1. Install AMD W7500/W7600/W7700 graphics
2. Test with latest linux kernel and linux-firmware
3. Running fwts s3 stress test to check if system hangs
[Where problems could occur]
The dcn_3_2_0_dmcub only for Navi31 and dcn_3_2_1_dmcub only for Navi33. The impact are restricted to particular series.
Changed in linux-oem-6.5 (Ubuntu Jammy): | |
status: | New → In Progress |
tags: | added: oem-priority originate-from-2048051 somerville |
tags: | added: kern-10794 |
Changed in linux-oem-6.5 (Ubuntu): | |
status: | Confirmed → Invalid |
Changed in linux-oem-6.5 (Ubuntu Noble): | |
status: | Confirmed → Invalid |
Changed in linux-oem-6.5 (Ubuntu Jammy): | |
status: | In Progress → Fix Committed |
Changed in linux-firmware (Ubuntu): | |
status: | Confirmed → Invalid |
tags: |
added: verification-done-jammy-linux-oem-6.5 removed: verification-needed-jammy-linux-oem-6.5 |
tags: |
added: verification-done-noble-linux removed: verification-needed-noble-linux |
tags: |
added: verification-done-noble-linux-nvidia-lowlatency removed: verification-needed-noble-linux-nvidia-lowlatency |
Status changed to 'Confirmed' because the bug affects multiple users.