2019-04-17 02:20:00 |
You-Sheng Yang |
bug |
|
|
added bug |
2019-04-17 02:20:00 |
You-Sheng Yang |
attachment added |
|
kernel log https://bugs.launchpad.net/bugs/1825074/+attachment/5256441/+files/kern.txt.gz |
|
2019-04-17 02:22:44 |
You-Sheng Yang |
nominated for series |
|
Ubuntu Bionic |
|
2019-04-17 02:22:44 |
You-Sheng Yang |
bug task added |
|
linux (Ubuntu Bionic) |
|
2019-04-17 02:30:06 |
Ubuntu Kernel Bot |
linux (Ubuntu): status |
New |
Incomplete |
|
2019-04-17 02:30:07 |
Ubuntu Kernel Bot |
linux (Ubuntu Bionic): status |
New |
Incomplete |
|
2019-04-17 02:36:49 |
You-Sheng Yang |
bug |
|
|
added subscriber Canonical Hardware Enablement |
2019-04-17 02:36:53 |
You-Sheng Yang |
tags |
|
originate-from-1824453 somerville |
|
2019-04-17 02:37:38 |
You-Sheng Yang |
linux (Ubuntu Bionic): assignee |
|
You-Sheng Yang (vicamo) |
|
2019-04-17 02:37:46 |
You-Sheng Yang |
linux (Ubuntu): status |
Incomplete |
In Progress |
|
2019-04-17 02:37:50 |
You-Sheng Yang |
linux (Ubuntu Bionic): status |
Incomplete |
In Progress |
|
2019-04-17 05:37:12 |
You-Sheng Yang |
attachment added |
|
0001-drm-amdgpu-dump-amdgpu_wb-usage.patch https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1825074/+attachment/5256463/+files/0001-drm-amdgpu-dump-amdgpu_wb-usage.patch |
|
2019-04-17 06:53:07 |
You-Sheng Yang |
description |
[Summary]
When do the S3 stress test with AMD RX550 installed, the system hung after resume from S3 at 112nd S3.
The kernel message:
[ 8120.977916] amdgpu 0000:01:00.0: (-22) failed to allocate wb slot
[ 8120.977941] [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* amdgpu: failed testing IB on ring 11 (-22).
[ 8120.979662] [drm] ib test on ring 12 succeeded
[ 8120.981952] [drm] ib test on ring 13 succeeded
[ 8120.984578] [drm] ib test on ring 14 succeeded
[ 8120.984813] [drm] ib test on ring 15 succeeded
[ 8120.984825] [drm:amdgpu_device_resume [amdgpu]] *ERROR* ib ring test failed (-22).
[ 8120.997655] [drm] Type 1 DP-HDMI passive dongle 165Mhz:
[ 8121.022465] [drm] 92GH: [Block 0]
[ 8121.022465] [drm] 92GH: [Block 1]
[ 8121.022467] [drm] dc_link_detect: manufacturer_id = B838, product_id = 9202, serial_number = 1, manufacture_week = 29, manufacture_year = 18, display_name = 92GH, speaker_flag = 1, audio_mode_count = 1
[ 8121.022467] [drm] dc_link_detect: mode number = 0, format_code = 1, channel_count = 2, sample_rate = 7, sample_size = 7
[ 8121.022573] PM: resume of devices complete after 412.170 msecs
[ 8121.023076] acpi LNXPOWER:04: Turning OFF
[ 8121.023113] PM: Finishing wakeup.
[ 8121.023114] OOM killer enabled.
[ 8121.023114] Restarting tasks ...
[ 8121.023455] pci_bus 0000:04: Allocating resources
[ 8121.023471] pci 0000:03:00.0: bridge window [io 0x1000-0x0fff] to [bus 04] add_size 1000
[ 8121.023473] pci 0000:03:00.0: bridge window [mem 0x00100000-0x000fffff 64bit pref] to [bus 04] add_size 200000 add_align 100000
[ 8121.023474] pci 0000:03:00.0: bridge window [mem 0x00100000-0x000fffff] to [bus 04] add_size 200000 add_align 100000
[ 8121.023476] pci 0000:03:00.0: BAR 14: no space for [mem size 0x00200000]
[ 8121.023477] pci 0000:03:00.0: BAR 14: failed to assign [mem size 0x00200000]
[ 8121.023478] pci 0000:03:00.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
[ 8121.023478] pci 0000:03:00.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
[ 8121.023479] pci 0000:03:00.0: BAR 13: no space for [io size 0x1000]
[ 8121.023479] pci 0000:03:00.0: BAR 13: failed to assign [io size 0x1000]
[ 8121.023481] pci 0000:03:00.0: BAR 14: no space for [mem size 0x00200000]
[ 8121.023481] pci 0000:03:00.0: BAR 14: failed to assign [mem size 0x00200000]
[ 8121.023482] pci 0000:03:00.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
[ 8121.023482] pci 0000:03:00.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
[ 8121.023483] pci 0000:03:00.0: BAR 13: no space for [io size 0x1000]
[ 8121.023483] pci 0000:03:00.0: BAR 13: failed to assign [io size 0x1000]
[ 8121.023485] pci 0000:03:00.0: PCI bridge to [bus 04]
[ 8121.024358] done.
[ 8121.082344] video LNXVIDEO:00: Restoring backlight state
[ 8121.082346] PM: suspend exit
[ 8121.094634] IPv6: ADDRCONF(NETDEV_UP): eno1: link is not ready
[ 8121.112417] ata4: SATA link down (SStatus 4 SControl 300)
[ 8121.113212] ata3: SATA link down (SStatus 4 SControl 300)
[ 8121.113279] ata2: SATA link down (SStatus 4 SControl 300)
[ 8121.114133] ata1: SATA link down (SStatus 4 SControl 300)
[ 8121.192056] [drm] {1440x900, 1904x934@106500Khz}
[ 8121.282351] IPv6: ADDRCONF(NETDEV_UP): eno1: link is not ready
[ 8121.298481] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8121.298517] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8121.298536] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[ 8122.183439] [drm] RC6 on
[ 8124.257908] e1000e: eno1 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
[ 8124.258035] IPv6: ADDRCONF(NETDEV_CHANGE): eno1: link becomes ready
[ 8124.269506] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8124.269539] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8124.269558] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[ 8125.089361] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8125.089429] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8125.089448] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[Reproduce Steps]
1. apt-get install -y fwts
2. fwts s3 --s3-multiple=1000 --s3-min-delay=60 --s3-max-delay=60
[Results]
Expected: pass the S3 stress test
Actual: system hung at 112nd S3
[Additional Information]
Kernel Version: 4.15.0-1035-oem
GPU: AMD RX550 (OPGA14) 1002:699f |
[Impact]
Systems with video cards using amdgpu driver may fail to resume due to resource leakage.
[Fix]
73469585510d drm/amdgpu: fix&cleanups for wb_clear
[Test Case]
Verified with fwts for a thounsand runs.
[Regression Risk]
Low. This patch has been included in stable kernel v4.16.y and on, and
it's mostly a trivial bug fix.
==== Original Bug Report ====
[Summary]
When do the S3 stress test with AMD RX550 installed, the system hung after resume from S3 at 112nd S3.
The kernel message:
[ 8120.977916] amdgpu 0000:01:00.0: (-22) failed to allocate wb slot
[ 8120.977941] [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* amdgpu: failed testing IB on ring 11 (-22).
[ 8120.979662] [drm] ib test on ring 12 succeeded
[ 8120.981952] [drm] ib test on ring 13 succeeded
[ 8120.984578] [drm] ib test on ring 14 succeeded
[ 8120.984813] [drm] ib test on ring 15 succeeded
[ 8120.984825] [drm:amdgpu_device_resume [amdgpu]] *ERROR* ib ring test failed (-22).
[ 8120.997655] [drm] Type 1 DP-HDMI passive dongle 165Mhz:
[ 8121.022465] [drm] 92GH: [Block 0]
[ 8121.022465] [drm] 92GH: [Block 1]
[ 8121.022467] [drm] dc_link_detect: manufacturer_id = B838, product_id = 9202, serial_number = 1, manufacture_week = 29, manufacture_year = 18, display_name = 92GH, speaker_flag = 1, audio_mode_count = 1
[ 8121.022467] [drm] dc_link_detect: mode number = 0, format_code = 1, channel_count = 2, sample_rate = 7, sample_size = 7
[ 8121.022573] PM: resume of devices complete after 412.170 msecs
[ 8121.023076] acpi LNXPOWER:04: Turning OFF
[ 8121.023113] PM: Finishing wakeup.
[ 8121.023114] OOM killer enabled.
[ 8121.023114] Restarting tasks ...
[ 8121.023455] pci_bus 0000:04: Allocating resources
[ 8121.023471] pci 0000:03:00.0: bridge window [io 0x1000-0x0fff] to [bus 04] add_size 1000
[ 8121.023473] pci 0000:03:00.0: bridge window [mem 0x00100000-0x000fffff 64bit pref] to [bus 04] add_size 200000 add_align 100000
[ 8121.023474] pci 0000:03:00.0: bridge window [mem 0x00100000-0x000fffff] to [bus 04] add_size 200000 add_align 100000
[ 8121.023476] pci 0000:03:00.0: BAR 14: no space for [mem size 0x00200000]
[ 8121.023477] pci 0000:03:00.0: BAR 14: failed to assign [mem size 0x00200000]
[ 8121.023478] pci 0000:03:00.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
[ 8121.023478] pci 0000:03:00.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
[ 8121.023479] pci 0000:03:00.0: BAR 13: no space for [io size 0x1000]
[ 8121.023479] pci 0000:03:00.0: BAR 13: failed to assign [io size 0x1000]
[ 8121.023481] pci 0000:03:00.0: BAR 14: no space for [mem size 0x00200000]
[ 8121.023481] pci 0000:03:00.0: BAR 14: failed to assign [mem size 0x00200000]
[ 8121.023482] pci 0000:03:00.0: BAR 15: no space for [mem size 0x00200000 64bit pref]
[ 8121.023482] pci 0000:03:00.0: BAR 15: failed to assign [mem size 0x00200000 64bit pref]
[ 8121.023483] pci 0000:03:00.0: BAR 13: no space for [io size 0x1000]
[ 8121.023483] pci 0000:03:00.0: BAR 13: failed to assign [io size 0x1000]
[ 8121.023485] pci 0000:03:00.0: PCI bridge to [bus 04]
[ 8121.024358] done.
[ 8121.082344] video LNXVIDEO:00: Restoring backlight state
[ 8121.082346] PM: suspend exit
[ 8121.094634] IPv6: ADDRCONF(NETDEV_UP): eno1: link is not ready
[ 8121.112417] ata4: SATA link down (SStatus 4 SControl 300)
[ 8121.113212] ata3: SATA link down (SStatus 4 SControl 300)
[ 8121.113279] ata2: SATA link down (SStatus 4 SControl 300)
[ 8121.114133] ata1: SATA link down (SStatus 4 SControl 300)
[ 8121.192056] [drm] {1440x900, 1904x934@106500Khz}
[ 8121.282351] IPv6: ADDRCONF(NETDEV_UP): eno1: link is not ready
[ 8121.298481] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8121.298517] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8121.298536] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[ 8122.183439] [drm] RC6 on
[ 8124.257908] e1000e: eno1 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx
[ 8124.258035] IPv6: ADDRCONF(NETDEV_CHANGE): eno1: link becomes ready
[ 8124.269506] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8124.269539] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8124.269558] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[ 8125.089361] amdgpu 0000:01:00.0: couldn't schedule ib on ring <sdma1>
[ 8125.089429] [drm:amdgpu_job_run [amdgpu]] *ERROR* Error scheduling IBs (-22)
[ 8125.089448] [drm:amd_sched_main [amdgpu]] *ERROR* Failed to run job!
[Reproduce Steps]
1. apt-get install -y fwts
2. fwts s3 --s3-multiple=1000 --s3-min-delay=60 --s3-max-delay=60
[Results]
Expected: pass the S3 stress test
Actual: system hung at 112nd S3
[Additional Information]
Kernel Version: 4.15.0-1035-oem
GPU: AMD RX550 (OPGA14) 1002:699f |
|
2019-04-17 12:28:29 |
Ubuntu Foundations Team Bug Bot |
tags |
originate-from-1824453 somerville |
originate-from-1824453 patch somerville |
|
2019-04-17 12:28:30 |
Ubuntu Foundations Team Bug Bot |
bug |
|
|
added subscriber Terry Rudd |
2019-04-23 04:46:56 |
Khaled El Mously |
linux (Ubuntu Bionic): status |
In Progress |
Fix Committed |
|
2019-04-29 16:04:16 |
Ubuntu Kernel Bot |
tags |
originate-from-1824453 patch somerville |
originate-from-1824453 patch somerville verification-needed-bionic |
|
2019-04-30 03:28:51 |
You-Sheng Yang |
tags |
originate-from-1824453 patch somerville verification-needed-bionic |
originate-from-1824453 patch somerville verification-done-bionic |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
linux (Ubuntu Bionic): status |
Fix Committed |
Fix Released |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2017-5715 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2017-5753 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2017-5754 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-12126 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-12127 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-12130 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-16884 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-3620 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-3639 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2018-3646 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2019-3874 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2019-3882 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2019-9500 |
|
2019-05-14 19:00:51 |
Launchpad Janitor |
cve linked |
|
2019-9503 |
|
2022-09-05 10:29:44 |
Timo Aaltonen |
linux (Ubuntu): status |
In Progress |
Fix Released |
|
2022-09-05 10:29:47 |
Timo Aaltonen |
hwe-next: status |
New |
Fix Released |
|