kernel upgrade radeon "GPU lockup CP stall"

Bug #1056370 reported by Grant Bowman
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Invalid
Medium
Unassigned

Bug Description

Description: Ubuntu 12.04.1 LTS
Release: 12.04

I ran apt-get dist-upgrade and moved from linux-image-3.2.0-30-generic version 3.2.0-30.48 to linux-image-3.2.0-31-generic version 3.2.0-31.50. Upon reboot I had a black screen. Even virtual terminals didn't work. Rebooting back to -30 made the machine usable again.

After looking in /var/log/kern.log I googled for: radeon ubuntu 12.04 "GPU lockup CP stall"

Several Ubuntu bugs came up with some random, some suspend/resume related and some boot video problems: 986524 936159 980019

This upstream bug indicates that a kernel change exposed a bug in the driver/mesa/drmlib : https://bugs.freedesktop.org/show_bug.cgi?id=45366

Attached is some relevant apport-cli --save results.

ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: linux-image-3.2.0-30-generic 3.2.0-30.48
ProcVersionSignature: Ubuntu 3.2.0-30.48-generic 3.2.27
Uname: Linux 3.2.0-30-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24.
ApportVersion: 2.0.1-0ubuntu13
Architecture: amd64
Date: Tue Sep 25 10:12:48 2012
InstallationMedia: Ubuntu 12.04 LTS "Precise Pangolin" - Alpha amd64 (20120315)
MachineType: LENOVO 4383
ProcEnviron:
 TERM=screen
 SHELL=/bin/bash
 PATH=(custom, user)
 LANG=en_US.UTF-8
ProcFB: 0 radeondrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.2.0-30-generic root=UUID=09e2c052-9217-452d-a45c-e92a7801dbdb ro quiet splash vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-3.2.0-30-generic N/A
 linux-backports-modules-3.2.0-30-generic N/A
 linux-firmware 1.79.1
SourcePackage: linux
StagingDrivers: rts5139
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 01/12/2012
dmi.bios.vendor: LENOVO
dmi.bios.version: 41CN27WW(V2.03)
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: Inagua
dmi.board.vendor: LENOVO
dmi.board.version: Base Board Version
dmi.chassis.asset.tag: Chassis Asset Tag
dmi.chassis.type: 10
dmi.chassis.vendor: Chassis Manufacturer
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnLENOVO:bvr41CN27WW(V2.03):bd01/12/2012:svnLENOVO:pn4383:pvrLenovoG575:rvnLENOVO:rnInagua:rvrBaseBoardVersion:cvnChassisManufacturer:ct10:cvrChassisVersion:
dmi.product.name: 4383
dmi.product.version: Lenovo G575
dmi.sys.vendor: LENOVO

Revision history for this message
Grant Bowman (grantbow) wrote :
summary: - radeon kernel regression "GPU lockup CP stall"
+ kernel upgrade radeon "GPU lockup CP stall"
Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.6 kernel[0] (Not a kernel in the daily directory) and install both the linux-image and linux-image-extra .deb packages.

Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. Please only remove that one tag and leave the other tags. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.6-rc7-quantal/

tags: added: kernel-da-key needs-upstream-testing regression-update
Changed in linux (Ubuntu):
importance: Undecided → Medium
Revision history for this message
Grant Bowman (grantbow) wrote :

I will try that when I have time.

I have compared booting -31 and comparing the -30 kern.log results. I have attached the difference in output. It begins:

--- kern-30.log 2012-09-25 08:52:29.512919977 -0700
+++ kern-31.log 2012-09-25 08:51:14.992919712 -0700
@@ -1,5 +1,5 @@
 [drm] Initialized drm 1.1.0 20060810
-input: Video Bus as /devices/LNXSYSTM:00/device:00/PNP0A08:00/LNXVIDEO:00/input/input6
+input: Video Bus as /devices/LNXSYSTM:00/device:00/PNP0A08:00/LNXVIDEO:00/input/input5
 ACPI: Video Device [VGA] (multi-head: yes rom: no post: no)
 [drm] radeon defaulting to kernel modesetting.
 [drm] radeon kernel modesetting enabled.
@@ -64,7 +64,6 @@
 [drm] Internal thermal controller without fan control
 [drm] radeon: power management initialized
 psmouse serio1: synaptics: Touchpad model: 1, fw: 7.2, id: 0x1c0b1, caps: 0xd04733/0xa40000/0xa0000
-init: failsafe main process (792) killed by TERM signal
 [drm] fb mappable at 0xE0142000
 [drm] vram apper at 0xE0000000
 [drm] size 4325376
@@ -77,3 +76,354 @@
 [drm] Initialized radeon 2.12.0 20080528 for 0000:00:01.0 on minor 0
 snd_hda_intel 0000:00:14.2: PCI INT A -> GSI 16 (level, low) -> IRQ 16
 snd_hda_intel 0000:00:14.2: setting latency timer to 64
+radeon 0000:00:01.0: GPU lockup CP stall for more than 10000msec
+GPU lockup (waiting for 0x00000003 last fence id 0x00000001)
+radeon 0000:00:01.0: GPU softreset
+radeon 0000:00:01.0: GRBM_STATUS=0xF5704828
+radeon 0000:00:01.0: GRBM_STATUS_SE0=0xFE000001
+radeon 0000:00:01.0: GRBM_STATUS_SE1=0x00000007
+radeon 0000:00:01.0: SRBM_STATUS=0x20020940
+radeon 0000:00:01.0: Wait for MC idle timedout !
+radeon 0000:00:01.0: GRBM_SOFT_RESET=0x00007F6B
+radeon 0000:00:01.0: GRBM_STATUS=0x00003828
+radeon 0000:00:01.0: GRBM_STATUS_SE0=0x00000007
+radeon 0000:00:01.0: GRBM_STATUS_SE1=0x00000007
+radeon 0000:00:01.0: SRBM_STATUS=0x20020940
+radeon 0000:00:01.0: GPU reset succeed
+radeon 0000:00:01.0: Wait for MC idle timedout !
+radeon 0000:00:01.0: Wait for MC idle timedout !
+[drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
+radeon 0000:00:01.0: WB enabled
+[drm:r600_ring_test] *ERROR* radeon: ring test failed (scratch(0x8504)=0xCAFEDEAD)
+[drm:evergreen_resume] *ERROR* evergreen startup failed on resume
+[drm:drm_mode_getfb] *ERROR* invalid framebuffer id
+[drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(2).
+[drm:radeon_cs_ioctl] *ERROR* Failed to schedule IB !
+[drm:radeon_ib_schedule] *ERROR* radeon: couldn't schedule IB(3).

Revision history for this message
penalvch (penalvch) wrote :

Pending mainline test results.

Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
Revision history for this message
Grant Bowman (grantbow) wrote :

I am unclear how to perform the requested test and it seems some docs need updating.

I read the instructions provided but am not clear exactly how to perform the requested test with a mainline kernel. I also haven't had time to do this test until now so the versions you might want tested have probably changed. I read https://help.launchpad.net/Packaging/PPA/InstallingSoftware but wasn't clear if just running "sudo apt-add-repository ppa:kernel-ppa/ppa" was enough. Adding the ppa it gave a warning:

You are about to add the following PPA to your system:
 Ubuntu Kernel Team Daily Build PPA - this PPA typically contains experimental packages. The quality of these packages is such that you had better know what you're doing. Don't come crying to the kernel team if it kills all of your kittens.
 More info: https://launchpad.net/~kernel-ppa/+archive/ppa

The information at that URL talks about Lucid, not Precise.

running apt-get update now gives these errors which leads me to believe more is required somewhere.

W: Failed to fetch http://ppa.launchpad.net/kernel-ppa/ppa/ubuntu/dists/precise/main/source/Sources 404 Not Found
W: Failed to fetch http://ppa.launchpad.net/kernel-ppa/ppa/ubuntu/dists/precise/main/binary-amd64/Packages 404 Not Found
W: Failed to fetch http://ppa.launchpad.net/kernel-ppa/ppa/ubuntu/dists/precise/main/binary-i386/Packages 404 Not Found

Changed in linux (Ubuntu):
status: Expired → Incomplete
Revision history for this message
Grant Bowman (grantbow) wrote :

Thankfully booting the 3.2.0-39-generic kernel works fine.

Revision history for this message
penalvch (penalvch) wrote :

Grant Bowman, this bug report is being closed due to your last comment regarding this being fixed with an update. For future reference you can manage the status of your own bugs by clicking on the current status in the yellow line and then choosing a new status in the revealed drop down box. You can learn more about bug statuses at https://wiki.ubuntu.com/Bugs/Status. Thank you again for taking the time to report this bug and helping to make Ubuntu better. Please submit any future bugs you may find.

Changed in linux (Ubuntu):
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.