Bug #986524 “1002:6739 [Asus P6T] radeon 0000:02:00.0: GPU locku...” : Bugs : linux package : Ubuntu

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-04-21:

#1

lsb_release Edit (64 bytes, text/plain)
AcpiTables.txt Edit (311.2 KiB, text/plain; charset="utf-8")
AlsaDevices.txt Edit (1.1 KiB, text/plain; charset="utf-8")
AplayDevices.txt Edit (32.9 KiB, text/plain; charset="utf-8")
ArecordDevices.txt Edit (381 bytes, text/plain; charset="utf-8")
BootDmesg.txt Edit (65.0 KiB, text/plain; charset="utf-8")
Card0.Amixer.values.txt Edit (4.2 KiB, text/plain; charset="utf-8")
Card0.Codecs.codec.0.txt Edit (14.3 KiB, text/plain; charset="utf-8")
Card1.Amixer.values.txt Edit (3.1 KiB, text/plain; charset="utf-8")
Card2.Codecs.codec.0.txt Edit (1.1 KiB, text/plain; charset="utf-8")
Dependencies.txt Edit (2.0 KiB, text/plain; charset="utf-8")
Lspci.txt Edit (26.4 KiB, text/plain; charset="utf-8")
Lsusb.txt Edit (578 bytes, text/plain; charset="utf-8")
PciMultimedia.txt Edit (1.9 KiB, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (6.5 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (3.5 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (2.6 KiB, text/plain; charset="utf-8")
PulseList.txt Edit (31.3 KiB, text/plain; charset="utf-8")
UdevDb.txt Edit (169.7 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (334.2 KiB, text/plain; charset="utf-8")
WifiSyslog.txt Edit (106.5 KiB, text/plain; charset="utf-8")

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-04-21:

#3

$ cat /proc/version_signature
Ubuntu 3.2.0-23.36-generic 3.2.14

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-04-21:

#4

dmesg_screen_freeze.txt Edit (68.7 KiB, text/plain)

Exemple of dmesg of a crash session

Fabio Marconi (fabiomarconi) on 2012-04-23

affects:

ubuntu → linux (Ubuntu)

Brad Figg (brad-figg) on 2012-04-23

Changed in linux:
status:	New → Confirmed

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2012-04-23:

#5

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.4kernel[1] (Not a kernel in the daily directory). Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag(Only that one tag, please leave the other tags). This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.4-rc4-precise/

Changed in linux (Ubuntu):
importance:	Undecided → Medium
tags:	added: needs-upstream-testing
Changed in linux (Ubuntu):
status:	New → Incomplete

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-04-23:

#6

Thank you for your reply.
I have just install and reboot with the new kernel, I just have to wait if I have again this bug.

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-04-25:

#7

I had no crash on Monday and yesterday, after installing the kernel "v3.4-rc4-precise". For me it doesn't mean it will never happen again, but I have good hope, that this kernel fix the problem.

I will update this bug report if I see it again.

Thank you.

Changed in linux (Ubuntu):
status:	Incomplete → Confirmed

Yvon TANGUY (vono22) on 2012-04-25

tags:	removed: needs-upstream-testing
tags:	added: kernel-fixed-upstream

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-05-01:

#8

I'm hit by the same bug on my 5870. An additional calltrace that I could get from syslog is attached just for reference, it repeats a few times. Will try the newer kernel and report back.

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-05-01:

#9

This morning I had my first crash with the kernel 3.4-rc4.

Exactly the same, but instead of being almost 1 or 2 a day, is less than 1 a week.
That is far better, but not completely solve though.

I have no call trace, what can I do to have it too ?
Thanks.

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-05-01:

#10

@Yvon: I got it from /var/log/syslog. Does the hanging also happen for you after suspend/resume or sometimes directly after boot? Because that's waht happend to me...

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-05-01:

#11

No, it happen at any time. I do not do suspend/resume on this computer, it is a desktop computer.
I can have it like 1min, 3 hours or more after the system start. It is completely at a random time for me. Even some day, it do not crash at all, but sometimes it is 5 / 6 times a day...

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2012-05-01:

#12

This issue appears to be an upstream bug, since you tested the latest upstream kernel. Would it be possible for you to open an upstream bug report at bugzilla.kernel.org [1]? That will allow the upstream Developers to examine the issue, and may provide a quicker resolution to the bug.

If you are comfortable with opening a bug upstream, It would be great if you can report back the upstream bug number in this bug report. That will allow us to link this bug to the upstream report.

[1] https://wiki.ubuntu.com/Bugs/Upstream/kernel

Changed in linux (Ubuntu):
status:	Confirmed → Triaged

Revision history for this message

Bart Verwilst (verwilst) wrote on 2012-05-01:

#13

I also have a Radeon HD6850, and i can't even get to a working desktop. It shows the desktop wallpaper, and the mouse, while getting the same message in syslog. I tried the 3.4.0-... kernel provided in comment #5, but that gives no improvement.

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-05-02:

#14

Seems to be known: https://bugzilla.kernel.org/show_bug.cgi?id=42678

Joseph Salisbury (jsalisbury) on 2012-05-02

no longer affects:

linux

Bug Watch Updater (bug-watch-updater) on 2012-05-02

Changed in linux:
importance:	Unknown → Medium
status:	Unknown → Confirmed

Revision history for this message

Yvon TANGUY (vono22) wrote on 2012-05-03:

#15

Yes, it seems to be that one.

Revision history for this message

BenginM (sary) wrote on 2012-05-27:

#16

Salutation.

https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-ati/+bug/936159

https://bugs.launchpad.net/ubuntu/+source/xorg/+bug/1001066

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-06-07:

#17

Still happens after the latest mesa update :(

Revision history for this message

brimlar (brimlar) wrote on 2012-06-08:

#18

I just want to add that I started using Unity2D on 12.04 as a result of these crashes with this GPU lockup CP stall issue, and have had zero issues or crashes since doing so (if that helps anyone work around this in the meantime.)

On a strange note, I have not noticed this issue on a fresh Debian Testing KDE install on the same computer...I would think it would be using the same open Radeon driver (similar-ish kernel), but perhaps there is some difference I am unaware of. Sorry if this is meaningless / anecdotal.

Revision history for this message

Bart Verwilst (verwilst) wrote on 2012-07-05:

#19

https://bugzilla.kernel.org/show_bug.cgi?id=42678 was about the inability to recover from a lockup i was told ( in that same bugreport ). In the meantime, a couple of months later, i still can't work 10 minutes on the normal Unity desktop without having to hard reset my pc because of lockups. Unity 2D works fine, but I want to get back to 3D. So... how do we proceed? Is there anything I can test? I really want to get back to the 11.10 state ( without actually reverting to 11.10 ).

Revision history for this message

Bart Verwilst (verwilst) wrote on 2012-07-05:

#20

I have upgraded my stack with the packages from https://launchpad.net/~oibaf/+archive/graphics-drivers, and for the past 30 minutes i have been lock-free on Unity 3D. Still a little early to tell, but looking good! If this holds, maybe we can learn from this and try to distill a small fix?

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-07-06:

#21

Thanks for the tip, I'm also running those newer drivers now :) Let's see.

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-07-06:

#22

Side effect: after suspending and resuming, I always got display stutter when scrolling, moving windows, etc., at least in 11.10 and iirc also 12.04 -- with the new drivers, everything is approximately or exactly as smooth as before suspending. Nice.

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-07-12:

#23

Several days later and I have still not encountered any hangs or crashes.

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-08-17:

#24

Over a month later and still not a single hang with oibafs repo :)

Revision history for this message

Sledge Sulaweyo (sledge-sulaweyo) wrote on 2012-08-18:

#25

Changed to that driver as well and so far works fine for me as well

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2012-10-04:

#26

I also suffer from this bug in quantal, including with today's latest driver version:

ii xserver-xorg-video-radeon 1:6.99.99~git20120913.8637f772-0ubuntu1 i386 X.Org X server -- AMD/ATI Radeon display driver

Strangely I never saw this bug in precise.

The symptoms are not always the same:

- sometimes the GPU hangs and the kernel and X server recover correctly
- sometimes the GPU hangs again and again, the X server becomes unusable and I have to reboot (which is a data loss bug)
- once the X server even crashed (see bug #1061750)

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2012-10-04:

#27

Let me add the following comment from bug #980019:

In fact there are 2 different bugs:

1. the fact that the GPU hangs in the first place
2. the fact that the kernel is not able to recover

The first bug is probably in the xorg radeon driver and the second one in the radeon DRM driver in the kernel.

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2012-10-04:

#28

Download full text (3.2 KiB)

Here is a dmesg output in a case where the GPU hung twice in a row and the GPU resume failed the second time:

[72317.916119] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72317.916132] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d62ad last fence id 0x00000000001d62a9)
[72317.917267] radeon 0000:01:00.0: GPU softreset
[72317.917275] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[72317.917282] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[72317.917289] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200000C0
[72317.917303] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEE
[72317.932197] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[72317.948082] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[72317.948090] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[72317.948097] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200080C0
[72317.949094] radeon 0000:01:00.0: GPU reset succeed
[72317.953205] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[72317.953238] radeon 0000:01:00.0: WB enabled
[72317.953240] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8ec00
[72317.984407] [drm] ring test on 0 succeeded in 1 usecs
[72317.984644] [drm] ib test on ring 0 succeeded in 0 usecs
[72319.220350] type=1701 audit(1349193168.886:83): auid=4294967295 uid=1000 gid=1000 ses=4294967295 pid=26825 comm="chrome" reason="seccomp" sig=0 syscall=20 compat=0 ip=0xb3510424 code=0x50000
[72329.872338] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72329.872350] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d63b5 last fence id 0x00000000001d63b3)
[72329.873488] radeon 0000:01:00.0: GPU softreset
[72329.873496] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xE57C2030
[72329.873503] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00113303
[72329.873509] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200230C0
[72330.036637] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.036648] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEE
[72330.051531] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[72330.067415] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[72330.067421] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[72330.067428] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x2002B0C0
[72330.068435] radeon 0000:01:00.0: GPU reset succeed
[72330.232982] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.396837] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.399208] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[72330.399300] radeon 0000:01:00.0: WB enabled
[72330.399304] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8ec00
[72330.430520] [drm] ring test on 0 succeeded in 1 usecs
[72340.928074] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72340.928087] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d63b7 last fence id 0x00000000001d63b6)
[72340.928095] [drm:r600_ib_test] *ERROR* radeon: fence wait failed (-35).
[72340.928101] [drm:radeon_ib_ring_tests] *ERROR*...

Here is a dmesg output in a case where the GPU hung twice in a row and the GPU resume failed the second time:

[72317.916119] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72317.916132] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d62ad last fence id 0x00000000001d62a9)
[72317.917267] radeon 0000:01:00.0: GPU softreset 
[72317.917275] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[72317.917282] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[72317.917289] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200000C0
[72317.917303] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[72317.932197] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[72317.948082] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[72317.948090] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[72317.948097] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200080C0
[72317.949094] radeon 0000:01:00.0: GPU reset succeed
[72317.953205] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[72317.953238] radeon 0000:01:00.0: WB enabled
[72317.953240] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8ec00
[72317.984407] [drm] ring test on 0 succeeded in 1 usecs
[72317.984644] [drm] ib test on ring 0 succeeded in 0 usecs
[72319.220350] type=1701 audit(1349193168.886:83): auid=4294967295 uid=1000 gid=1000 ses=4294967295 pid=26825 comm="chrome" reason="seccomp" sig=0 syscall=20 compat=0 ip=0xb3510424 code=0x50000
[72329.872338] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72329.872350] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d63b5 last fence id 0x00000000001d63b3)
[72329.873488] radeon 0000:01:00.0: GPU softreset 
[72329.873496] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xE57C2030
[72329.873503] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00113303
[72329.873509] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200230C0
[72330.036637] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.036648] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[72330.051531] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[72330.067415] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[72330.067421] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[72330.067428] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x2002B0C0
[72330.068435] radeon 0000:01:00.0: GPU reset succeed
[72330.232982] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.396837] radeon 0000:01:00.0: Wait for MC idle timedout !
[72330.399208] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[72330.399300] radeon 0000:01:00.0: WB enabled
[72330.399304] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8ec00
[72330.430520] [drm] ring test on 0 succeeded in 1 usecs
[72340.928074] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[72340.928087] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000001d63b7 last fence id 0x00000000001d63b6)
[72340.928095] [drm:r600_ib_test] *ERROR* radeon: fence wait failed (-35).
[72340.928101] [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on GFX ring (-35).
[72340.928106] [drm:r600_resume] *ERROR* r600 startup failed on resume

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2012-10-04:

#29

Download full text (9.8 KiB)

Here is another instance of the problem where the GPU was in a hang/reset loop and was reset every 10 seconds until I rebooted the computer:

[47622.397324] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47622.397337] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03a3 last fence id 0x00000000003c037a)
[47622.398472] radeon 0000:01:00.0: GPU softreset
[47622.398481] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[47622.398488] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[47622.398495] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200210C0
[47622.549233] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.549238] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEE
[47622.564244] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47622.580250] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[47622.580252] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[47622.580255] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200290C0
[47622.581258] radeon 0000:01:00.0: GPU reset succeed
[47622.730801] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.879870] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.881837] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47622.881861] radeon 0000:01:00.0: WB enabled
[47622.881864] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47622.913269] [drm] ring test on 0 succeeded in 1 usecs
[47622.913316] [drm] ib test on ring 0 succeeded in 0 usecs
[47633.870095] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47633.870109] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03a7 last fence id 0x00000000003c03a4)
[47633.871254] radeon 0000:01:00.0: GPU softreset
[47633.871263] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[47633.871270] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[47633.871277] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200210C0
[47634.022603] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.022607] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEE
[47634.037612] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47634.053618] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xA0003030
[47634.053621] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0x00000003
[47634.053623] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0x200290C0
[47634.054626] radeon 0000:01:00.0: GPU reset succeed
[47634.205099] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.353397] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.355358] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47634.355382] radeon 0000:01:00.0: WB enabled
[47634.355385] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47634.387191] [drm] ring test on 0 succeeded in 1 usecs
[47634.387251] [drm] ib test on ring 0 succeeded in 0 usecs
[47645.348335] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47645.348348] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03ac last fence id 0x00000000003c03a8)
[47645.349492] radeon 0000:01:00.0:...

Here is another instance of the problem where the GPU was in a hang/reset loop and was reset every 10 seconds until I rebooted the computer:

[47622.397324] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47622.397337] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03a3 last fence id 0x00000000003c037a)
[47622.398472] radeon 0000:01:00.0: GPU softreset 
[47622.398481] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47622.398488] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47622.398495] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47622.549233] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.549238] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47622.564244] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47622.580250] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47622.580252] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47622.580255] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47622.581258] radeon 0000:01:00.0: GPU reset succeed
[47622.730801] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.879870] radeon 0000:01:00.0: Wait for MC idle timedout !
[47622.881837] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47622.881861] radeon 0000:01:00.0: WB enabled
[47622.881864] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47622.913269] [drm] ring test on 0 succeeded in 1 usecs
[47622.913316] [drm] ib test on ring 0 succeeded in 0 usecs
[47633.870095] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47633.870109] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03a7 last fence id 0x00000000003c03a4)
[47633.871254] radeon 0000:01:00.0: GPU softreset 
[47633.871263] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47633.871270] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47633.871277] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47634.022603] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.022607] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47634.037612] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47634.053618] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47634.053621] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47634.053623] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47634.054626] radeon 0000:01:00.0: GPU reset succeed
[47634.205099] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.353397] radeon 0000:01:00.0: Wait for MC idle timedout !
[47634.355358] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47634.355382] radeon 0000:01:00.0: WB enabled
[47634.355385] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47634.387191] [drm] ring test on 0 succeeded in 1 usecs
[47634.387251] [drm] ib test on ring 0 succeeded in 0 usecs
[47645.348335] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47645.348348] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03ac last fence id 0x00000000003c03a8)
[47645.349492] radeon 0000:01:00.0: GPU softreset 
[47645.349500] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47645.349507] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47645.349514] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47645.497740] radeon 0000:01:00.0: Wait for MC idle timedout !
[47645.497745] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47645.512750] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47645.528763] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47645.528768] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47645.528771] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47645.529775] radeon 0000:01:00.0: GPU reset succeed
[47645.678408] radeon 0000:01:00.0: Wait for MC idle timedout !
[47645.826061] radeon 0000:01:00.0: Wait for MC idle timedout !
[47645.828059] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47645.828085] radeon 0000:01:00.0: WB enabled
[47645.828088] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47645.859455] [drm] ring test on 0 succeeded in 1 usecs
[47645.859502] [drm] ib test on ring 0 succeeded in 0 usecs
[47656.819075] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47656.819088] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03b9 last fence id 0x00000000003c03ad)
[47656.820222] radeon 0000:01:00.0: GPU softreset 
[47656.820230] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47656.820237] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47656.820244] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47656.971673] radeon 0000:01:00.0: Wait for MC idle timedout !
[47656.971679] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47656.986684] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47657.002690] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47657.002693] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47657.002695] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47657.003698] radeon 0000:01:00.0: GPU reset succeed
[47657.152921] radeon 0000:01:00.0: Wait for MC idle timedout !
[47657.300085] radeon 0000:01:00.0: Wait for MC idle timedout !
[47657.302059] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47657.302084] radeon 0000:01:00.0: WB enabled
[47657.302087] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47657.333455] [drm] ring test on 0 succeeded in 1 usecs
[47657.333507] [drm] ib test on ring 0 succeeded in 0 usecs
[47668.291349] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47668.291362] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03be last fence id 0x00000000003c03ba)
[47668.292495] radeon 0000:01:00.0: GPU softreset 
[47668.292503] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47668.292510] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47668.292517] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47668.442939] radeon 0000:01:00.0: Wait for MC idle timedout !
[47668.442945] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47668.457950] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47668.473955] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47668.473958] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47668.473961] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47668.474964] radeon 0000:01:00.0: GPU reset succeed
[47668.623656] radeon 0000:01:00.0: Wait for MC idle timedout !
[47668.771301] radeon 0000:01:00.0: Wait for MC idle timedout !
[47668.773323] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47668.773348] radeon 0000:01:00.0: WB enabled
[47668.773351] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47668.804742] [drm] ring test on 0 succeeded in 1 usecs
[47668.804794] [drm] ib test on ring 0 succeeded in 0 usecs
[47679.798125] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[47679.803081] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03c1 last fence id 0x00000000003c03bf)
[47679.804220] radeon 0000:01:00.0: GPU softreset 
[47679.804229] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47679.804236] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47679.804243] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47679.954923] radeon 0000:01:00.0: Wait for MC idle timedout !
[47679.954929] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47679.969933] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47679.985939] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47679.985942] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47679.985944] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47679.986947] radeon 0000:01:00.0: GPU reset succeed
[47680.136039] radeon 0000:01:00.0: Wait for MC idle timedout !
[47680.283378] radeon 0000:01:00.0: Wait for MC idle timedout !
[47680.285378] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47680.285409] radeon 0000:01:00.0: WB enabled
[47680.285412] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47680.316816] [drm] ring test on 0 succeeded in 1 usecs
[47680.316868] [drm] ib test on ring 0 succeeded in 0 usecs
[47681.260091] radeon 0000:01:00.0: GPU lockup CP stall for more than 11462msec
[47681.265108] radeon 0000:01:00.0: GPU lockup (waiting for 0x00000000003c03c4 last fence id 0x00000000003c03c2)
[47681.266242] radeon 0000:01:00.0: GPU softreset 
[47681.266251] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47681.266258] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47681.266265] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200210C0
[47681.415697] radeon 0000:01:00.0: Wait for MC idle timedout !
[47681.415702] radeon 0000:01:00.0:   R_008020_GRBM_SOFT_RESET=0x00007FEE
[47681.430710] radeon 0000:01:00.0: R_008020_GRBM_SOFT_RESET=0x00000001
[47681.446724] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xA0003030
[47681.446728] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0x00000003
[47681.446731] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0x200290C0
[47681.447734] radeon 0000:01:00.0: GPU reset succeed
[47681.597282] radeon 0000:01:00.0: Wait for MC idle timedout !
[47681.746353] radeon 0000:01:00.0: Wait for MC idle timedout !
[47681.748308] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[47681.748333] radeon 0000:01:00.0: WB enabled
[47681.748336] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00 and cpu addr 0xffc8fc00
[47681.779711] [drm] ring test on 0 succeeded in 1 usecs
[47681.779758] [drm] ib test on ring 0 succeeded in 0 usecs

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2012-10-04:

#30

From bug #1061750 here is a reproducible way to trigger this bug:

1. install the google-chrome-unstable package from the Google apt repository. Currently it is this version:

ii google-chrome-unstable 24.0.1284.2-r159713 i386 The web browser from Google

The google-chrome-stable package is perhaps also worth a try (currently at version 22.0.1229.79-r158531)

2. run Google Chrome with:

google-chrome --ignore-gpu-blacklist

3. visit this page:

http://ie.microsoft.com/testdrive/Views/SiteMap/Default.html

4. try the "Fireworks" demo for a few seconds (other demos might also produce "interesting" results)

Other people running quantal can you also reproduce this bug ?

Revision history for this message

Nikolaus Waxweiler (madleser) wrote on 2012-11-10:

#31

I'm on 12.10 x86_64 now with stock mesa and haven't encountered the bug yet.

Revision history for this message

Erik S (ofenfisch) wrote on 2012-11-12:

#32

I did'nt had this issue with Ubuntu 11.10 and 12.04. 3 weeks ago I deleted all partitions and set up a brand new Xubuntu 12.10. These lockups occur randomly. Some of them blank the screen just once (with the 10sec of no-mouse-reaction before), the other ones keep blanking the screen every 10 secs, tty# works, but you're not able to get back a running system...
They only occur when I try to open/run graphic-heavy applications (Youtube-video, openGL-Games,...).

Yesterday it took me 3 hard resets and then I installed the PPA (comment #20) with:

sudo add-apt-repository ppa:oibaf/graphics-drivers

No more problems yesterday and today (til now)... Thanks for this, I'll keep testing ;)

Revision history for this message

Denis Pauk (pauk-denis) wrote on 2012-11-15:

#33

I have same bug(message) when boot laptop - they are blinking after several seconds(~2sec) and dont show login screen. After hard poweroff - and than poweron i dont see any problem.
---------------
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.103668] radeon 0000:00:01.0: GPU lockup CP stall for more than 10000msec
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.103686] radeon 0000:00:01.0: GPU lockup (waiting for 0x0000000000000003 last fence id 0x0000000000000001)
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.104758] radeon 0000:00:01.0: GPU softreset
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.104762] radeon 0000:00:01.0: GRBM_STATUS=0xA0003828
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.104766] radeon 0000:00:01.0: GRBM_STATUS_SE0=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.104770] radeon 0000:00:01.0: GRBM_STATUS_SE1=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.104773] radeon 0000:00:01.0: SRBM_STATUS=0x20020940
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263492] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263496] radeon 0000:00:01.0: GRBM_SOFT_RESET=0x00007F6B
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263601] radeon 0000:00:01.0: GRBM_STATUS=0x00003828
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263604] radeon 0000:00:01.0: GRBM_STATUS_SE0=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263607] radeon 0000:00:01.0: GRBM_STATUS_SE1=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.263611] radeon 0000:00:01.0: SRBM_STATUS=0x20020940
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.264606] radeon 0000:00:01.0: GPU reset succeed
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.420521] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.581382] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.583891] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.583998] radeon 0000:00:01.0: WB enabled
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.584004] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000018000c00 and cpu addr 0xffff8800618dcc00
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.755198] [drm:r600_ring_test] *ERROR* radeon: ring 0 test failed (scratch(0x8500)=0xCAFEDEAD)
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [ 28.755204] [drm:evergreen_resume] *ERROR* evergreen startup failed on resume
---------------

I have attached full log for this day.

I have same bug(message) when boot laptop - they are blinking after several seconds(~2sec) and dont show login screen. After hard poweroff - and than poweron i dont see any problem.  
---------------
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.103668] radeon 0000:00:01.0: GPU lockup CP stall for more than 10000msec
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.103686] radeon 0000:00:01.0: GPU lockup (waiting for 0x0000000000000003 last fence id 0x0000000000000001)
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.104758] radeon 0000:00:01.0: GPU softreset 
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.104762] radeon 0000:00:01.0:   GRBM_STATUS=0xA0003828
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.104766] radeon 0000:00:01.0:   GRBM_STATUS_SE0=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.104770] radeon 0000:00:01.0:   GRBM_STATUS_SE1=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.104773] radeon 0000:00:01.0:   SRBM_STATUS=0x20020940
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263492] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263496] radeon 0000:00:01.0:   GRBM_SOFT_RESET=0x00007F6B
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263601] radeon 0000:00:01.0:   GRBM_STATUS=0x00003828
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263604] radeon 0000:00:01.0:   GRBM_STATUS_SE0=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263607] radeon 0000:00:01.0:   GRBM_STATUS_SE1=0x00000007
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.263611] radeon 0000:00:01.0:   SRBM_STATUS=0x20020940
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.264606] radeon 0000:00:01.0: GPU reset succeed
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.420521] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.581382] radeon 0000:00:01.0: Wait for MC idle timedout !
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.583891] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.583998] radeon 0000:00:01.0: WB enabled
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.584004] radeon 0000:00:01.0: fence driver on ring 0 use gpu addr 0x0000000018000c00 and cpu addr 0xffff8800618dcc00
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.755198] [drm:r600_ring_test] *ERROR* radeon: ring 0 test failed (scratch(0x8500)=0xCAFEDEAD)
Nov 10 13:09:16 denis-Lenovo-IdeaPad-S206 kernel: [   28.755204] [drm:evergreen_resume] *ERROR* evergreen startup failed on resume
---------------

I have attached full log for this day.

Revision history for this message

Erik S (ofenfisch) wrote on 2012-11-17:

#34

No good news for me...

Today my system crashed 2 times while running the radeon-driver and "supertuxkart" (both from oibaf's repository)...

But I found (at least for me) a way to reproduce a freeze:
1. Start system.
2. Open terminal and type "supertuxkart". Install this, if you havent done yet...
3. Enter and klick through the menu of "new race"...
4. Start playing and wait for freeze
5. Black screen every 10 seconds
6. (Try to get a running tty and save some log-files)

Have you experienced just one lockup, or have you had a series of these lockups?
-> Mostly a series of lockups until I reboot my laptop. Sometimes even the kernel freezes...
-> With the stock radeon-driver 5-10 times a week. With oibaf's packages only with some applications (without them 0 times a week ;) )

When did you first notice it?
-> After deleting my old Ubuntu, and reinstalled Xubuntu

Under what conditions does it seem most likely to reproduce?
-> Start some OpenGL-heavy application/games (like supertuxkart) and play a bit around.
-> Even 10 seconds after boot up, this can occur. So: No relation to Suspend-to-Any.

I will try to get some backtraces... But somebody tell me please, where should I start? :)

Revision history for this message

Erik S (ofenfisch) wrote on 2012-11-17:

#35

I'm going to add the output of "dmesg" "/var/log/Xorg.0.log" and the regDump of radeon as maintained here: https://wiki.ubuntu.com/X/Troubleshooting/Freeze#Reporting_GPU_lockup_Bugs

Revision history for this message

Sylvain Rabot (sylr) wrote on 2013-02-05:

#38

I have the same problem on a freshly installed 12.10 amd64 when I play (for a few seconds) CS:1.6 on Steam Linux beta.
I tried comment #20 but it did not help.

rabosy@isis:~$ lspci -vvnn | grep VGA
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
BridgeCtl: Parity- SERR+ NoISA- VGA+ MAbort- >Reset- FastB2B-
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
01:00.0 VGA compatible controller [0300]: Advanced Micro Devices [AMD] nee ATI RV620 LE [Radeon HD 3450] [1002:95c5] (prog-if 00 [VGA controller])
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+

I have the same problem on a freshly installed 12.10 amd64 when I play (for a few seconds) CS:1.6 on Steam Linux beta.
I tried comment #20 but it did not help.

rabosy@isis:~$ lspci -vvnn | grep VGA
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
	BridgeCtl: Parity- SERR+ NoISA- VGA+ MAbort- >Reset- FastB2B-
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
	BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+
	BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
	Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
01:00.0 VGA compatible controller [0300]: Advanced Micro Devices [AMD] nee ATI RV620 LE [Radeon HD 3450] [1002:95c5] (prog-if 00 [VGA controller])
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+

Revision history for this message

Erik S (ofenfisch) wrote on 2013-02-05:

#39

This problem is still reproducible (I'm still using obiaf's PPA).

But I got a new clue: Lockups only occur, when I start graphic-heavy stuff (games, OpenGL, ... but not: Flash on Youtube or other media playback) with power-profile set to "low" on my card (HD 3650). Doing the same things on "high" causes no problems. I can even "trick" some Lockups with quick-switching from "low" to "high"-profile, when my mouse freezes and the screen starts blanking...

That's my workaround for now ;)

Revision history for this message

Tom Yan (tom-ty89) wrote on 2013-02-15:

#40

Thanks for the workaround, it works for me too.

But my case it's a bit different. I am using Arch Linux and an HD5850. With mesa 9.0.2 and xf86-video-ati 7.1.0, once the profile is not "default", the issue would be gone. With the git version of those from a unofficial repo, once the profile has been changed to non-default once (which means even if I change it back to default again), the issue would be gone too. (Though there would be some "stuttering" instead.)

Revision history for this message

Craig (craig-st) wrote on 2013-03-15:

#41

Download full text (6.8 KiB)

I think I've encountered the same bug:

***UNAME -A***:
Linux hostname 3.2.0-38-generic #61-Ubuntu SMP Tue Feb 19 12:18:21 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

***VIDEO DRIVER***:
fglrx-updates 2:9.000-0ubuntu0.3

***X WINDOWS***:
X.Org X Server 1.11.3
Release Date: 2011-12-16
X Protocol Version 11, Revision 0
Build Operating System: Linux 2.6.42-37-generic x86_64 Ubuntu
Current Operating System: Linux brain 3.2.0-38-generic #61-Ubuntu SMP Tue Feb 19 12:18:21 UTC 2013 x86_64
Kernel command line: BOOT_IMAGE=/vmlinuz root=/dev/mapper/volgroup0-root ro
Build Date: 27 February 2013 02:07:42AM
xorg-server 2:1.11.4-0ubuntu10.12 (For technical support please see http://www.ubuntu.com/support)
Current version of pixman: 0.24.4

***SYSLOG***:
Mar 14 22:43:47 brain kernel: [162059.501652] Clocksource tsc unstable (delta = -417633604 ns)
Mar 14 22:43:47 brain kernel: [162059.501800] Switching to clocksource hpet
Mar 14 22:44:10 brain kernel: [162082.999465] sched: RT throttling activated
Mar 14 22:44:25 brain kernel: [162096.021897] radeon 0000:01:00.0: GPU lockup CP stall for more than 14700msec
Mar 14 22:44:25 brain kernel: [162096.021901] GPU lockup (waiting for 0x0042896F last fence id 0x0042896E)
Mar 14 22:44:32 brain kernel: [162103.166372] radeon 0000:01:00.0: GPU softreset
Mar 14 22:44:32 brain kernel: [162103.376060] radeon 0000:01:00.0: R_008010_GRBM_STATUS=0xFFFFFFFF
Mar 14 22:44:35 brain kernel: [162103.585767] radeon 0000:01:00.0: R_008014_GRBM_STATUS2=0xFFFFFFFF
Mar 14 22:44:35 brain kernel: [162103.795453] radeon 0000:01:00.0: R_000E50_SRBM_STATUS=0xFFFFFFFF
Mar 14 22:44:56 brain kernel: [162127.910957] hrtimer: interrupt took 629189350 ns
Mar 14 22:44:56 brain kernel: [162128.330282] BUG: soft lockup - CPU#2 stuck for 23s! [Xorg:1685]
Mar 14 22:44:56 brain kernel: [162128.330285] Modules linked in: nls_utf8 udf crc_itu_t nls_iso8859_1 nls_cp437 vfat fat pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) bnep rfcomm parport_pc ppdev bluetooth usblp snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep radeon pl2303 cdc_acm snd_pcm usbserial joydev snd_seq_midi ttm drm_kms_helper drm sp5100_tco snd_rawmidi snd_seq_midi_event snd_seq i2c_piix4 snd_timer i2c_algo_bit snd_seq_device nfsd nfs lockd snd fscache amd64_edac_mod edac_core soundcore snd_page_alloc edac_mce_amd binfmt_misc auth_rpcgss nfs_acl shpchp psmouse fam15h_power k10temp eeepc_wmi asus_wmi mac_hid serio_raw sparse_keymap sunrpc it87 hwmon_vid lp parport osst st dm_crypt usbhid hid usb_storage mxm_wmi aic79xx e1000e wmi
Mar 14 22:44:56 brain kernel: [162128.330338] CPU 2
Mar 14 22:45:06 brain kernel: [162128.330340] Modules linked in: nls_utf8 udf crc_itu_t nls_iso8859_1 nls_cp437 vfat fat pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) bnep rfcomm parport_pc ppdev bluetooth usblp snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep radeon pl2303 cdc_acm snd_pcm usbserial joydev snd_seq_midi ttm drm_kms_helper drm sp5100_tco snd_rawmidi snd_seq_midi_event snd_seq i2c_piix4 snd_timer i2c_algo_bit snd_seq_device nfsd nfs lockd snd fscache amd64_edac_mod edac_core soundcore snd_page_alloc edac_mce...

I think I've encountered the same bug:

***UNAME -A***:
Linux hostname 3.2.0-38-generic #61-Ubuntu SMP Tue Feb 19 12:18:21 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

***VIDEO DRIVER***:
fglrx-updates 2:9.000-0ubuntu0.3

***X WINDOWS***:
X.Org X Server 1.11.3
Release Date: 2011-12-16
X Protocol Version 11, Revision 0
Build Operating System: Linux 2.6.42-37-generic x86_64 Ubuntu
Current Operating System: Linux brain 3.2.0-38-generic #61-Ubuntu SMP Tue Feb 19 12:18:21 UTC 2013 x86_64
Kernel command line: BOOT_IMAGE=/vmlinuz root=/dev/mapper/volgroup0-root ro
Build Date: 27 February 2013  02:07:42AM
xorg-server 2:1.11.4-0ubuntu10.12 (For technical support please see http://www.ubuntu.com/support) 
Current version of pixman: 0.24.4

***SYSLOG***:
Mar 14 22:43:47 brain kernel: [162059.501652] Clocksource tsc unstable (delta = -417633604 ns)
Mar 14 22:43:47 brain kernel: [162059.501800] Switching to clocksource hpet
Mar 14 22:44:10 brain kernel: [162082.999465] sched: RT throttling activated
Mar 14 22:44:25 brain kernel: [162096.021897] radeon 0000:01:00.0: GPU lockup CP stall for more than 14700msec
Mar 14 22:44:25 brain kernel: [162096.021901] GPU lockup (waiting for 0x0042896F last fence id 0x0042896E)
Mar 14 22:44:32 brain kernel: [162103.166372] radeon 0000:01:00.0: GPU softreset 
Mar 14 22:44:32 brain kernel: [162103.376060] radeon 0000:01:00.0:   R_008010_GRBM_STATUS=0xFFFFFFFF
Mar 14 22:44:35 brain kernel: [162103.585767] radeon 0000:01:00.0:   R_008014_GRBM_STATUS2=0xFFFFFFFF
Mar 14 22:44:35 brain kernel: [162103.795453] radeon 0000:01:00.0:   R_000E50_SRBM_STATUS=0xFFFFFFFF
Mar 14 22:44:56 brain kernel: [162127.910957] hrtimer: interrupt took 629189350 ns
Mar 14 22:44:56 brain kernel: [162128.330282] BUG: soft lockup - CPU#2 stuck for 23s! [Xorg:1685]
Mar 14 22:44:56 brain kernel: [162128.330285] Modules linked in: nls_utf8 udf crc_itu_t nls_iso8859_1 nls_cp437 vfat fat pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) bnep rfcomm parport_pc ppdev bluetooth usblp snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep radeon pl2303 cdc_acm snd_pcm usbserial joydev snd_seq_midi ttm drm_kms_helper drm sp5100_tco snd_rawmidi snd_seq_midi_event snd_seq i2c_piix4 snd_timer i2c_algo_bit snd_seq_device nfsd nfs lockd snd fscache amd64_edac_mod edac_core soundcore snd_page_alloc edac_mce_amd binfmt_misc auth_rpcgss nfs_acl shpchp psmouse fam15h_power k10temp eeepc_wmi asus_wmi mac_hid serio_raw sparse_keymap sunrpc it87 hwmon_vid lp parport osst st dm_crypt usbhid hid usb_storage mxm_wmi aic79xx e1000e wmi
Mar 14 22:44:56 brain kernel: [162128.330338] CPU 2 
Mar 14 22:45:06 brain kernel: [162128.330340] Modules linked in: nls_utf8 udf crc_itu_t nls_iso8859_1 nls_cp437 vfat fat pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) bnep rfcomm parport_pc ppdev bluetooth usblp snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep radeon pl2303 cdc_acm snd_pcm usbserial joydev snd_seq_midi ttm drm_kms_helper drm sp5100_tco snd_rawmidi snd_seq_midi_event snd_seq i2c_piix4 snd_timer i2c_algo_bit snd_seq_device nfsd nfs lockd snd fscache amd64_edac_mod edac_core soundcore snd_page_alloc edac_mce_amd binfmt_misc auth_rpcgss nfs_acl shpchp psmouse fam15h_power k10temp eeepc_wmi asus_wmi mac_hid serio_raw sparse_keymap sunrpc it87 hwmon_vid lp parport osst st dm_crypt usbhid hid usb_storage mxm_wmi aic79xx e1000e wmi
Mar 14 22:45:06 brain kernel: [162128.330380] 
Mar 14 22:45:06 brain kernel: [162128.330382] Pid: 1685, comm: Xorg Tainted: G           O 3.2.0-38-generic #61-Ubuntu To be filled by O.E.M. To be filled by O.E.M./Crosshair V Formula
Mar 14 22:45:06 brain kernel: [162128.330387] RIP: 0010:[<ffffffffa03f4159>]  [<ffffffffa03f4159>] r100_mm_rreg+0x39/0x40 [radeon]
Mar 14 22:45:06 brain kernel: [162128.330408] RSP: 0018:ffff88021ef97b48  EFLAGS: 00000286
Mar 14 22:45:09 brain rtkit-daemon[2133]: The canary thread is apparently starving. Taking action.
Mar 14 22:45:12 brain kernel: [162128.330409] RAX: 00000000ffffffff RBX: 000000007a0d2b72 RCX: 000000003c97b281
Mar 14 22:45:12 brain kernel: [162128.330412] RDX: 0000000000000e2b RSI: 0000000000000e50 RDI: ffff88021b826000
Mar 14 22:45:12 brain kernel: [162128.330414] RBP: ffff88021ef97b48 R08: 000000000000000a R09: 0000000000000000
Mar 14 22:45:12 brain kernel: [162128.330415] R10: 0000000000000000 R11: 00000000000186a0 R12: 000000000000000a
Mar 14 22:45:14 brain kernel: [162128.330417] R13: 0000000000000000 R14: 0000000000000000 R15: 00000000000186a0
Mar 14 22:45:14 brain kernel: [162128.330419] FS:  00007f0ec8364880(0000) GS:ffff88022ec80000(0000) knlGS:000000000124fb40
Mar 14 22:45:14 brain kernel: [162128.330422] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 14 22:45:14 brain kernel: [162128.330424] CR2: 00007ff969830000 CR3: 000000021bca2000 CR4: 00000000000406e0
Mar 14 22:45:15 brain kernel: [162128.330426] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 14 22:45:16 brain kernel: [162128.330428] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Mar 14 22:45:16 brain kernel: [162128.330430] Process Xorg (pid: 1685, threadinfo ffff88021ef96000, task ffff88021ff6c500)
Mar 14 22:45:16 brain kernel: [162128.330431] Stack:
Mar 14 22:45:16 brain kernel: [162128.330433]  ffff88021ef97b68 ffffffffa0406a7c ffff88021b826000 ffff88021ff6c500
Mar 14 22:45:16 brain kernel: [162128.330438]  ffff88021ef97b98 ffffffffa0407065 ffff88021b826000 ffffffffffffffff
Mar 14 22:45:16 brain kernel: [162128.330442]  ffff88021b826000 ffff88021ff6c500 ffff88021ef97ba8 ffffffffa040730e
Mar 14 22:45:16 brain kernel: [162128.330445] Call Trace:
Mar 14 22:45:18 brain kernel: [162128.330458]  [<ffffffffa0406a7c>] r600_mc_wait_for_idle+0x4c/0x70 [radeon]
Mar 14 22:45:18 brain kernel: [162128.330471]  [<ffffffffa0407065>] r600_gpu_soft_reset+0xb5/0x260 [radeon]
Mar 14 22:45:18 brain kernel: [162128.330484]  [<ffffffffa040730e>] r600_asic_reset+0xe/0x10 [radeon]
Mar 14 22:45:22 brain rtkit-daemon[2133]: Demoting known real-time threads.
Mar 14 22:45:23 brain rtkit-daemon[2133]: Successfully demoted thread 2973 of process 2968 (n/a).
Mar 14 22:45:24 brain kernel: [162128.330494]  [<ffffffffa03c65c6>] radeon_gpu_reset+0x86/0x170 [radeon]
Mar 14 22:45:24 brain kernel: [162128.330505]  [<ffffffffa03d9cc0>] radeon_fence_wait+0x400/0x430 [radeon]
Mar 14 22:45:24 brain kernel: [162128.330511]  [<ffffffff811648cd>] ? kmem_cache_alloc_trace+0x11d/0x140
Mar 14 22:45:24 brain kernel: [162128.330515]  [<ffffffff8108bdc0>] ? add_wait_queue+0x60/0x60
Mar 14 22:45:24 brain kernel: [162128.330527]  [<ffffffffa03ecff0>] radeon_ib_get+0x110/0x1a0 [radeon]
Mar 14 22:45:24 brain kernel: [162128.330539]  [<ffffffffa03ee8e7>] radeon_cs_ioctl+0xb7/0x250 [radeon]
Mar 14 22:45:25 brain rtkit-daemon[2133]: Successfully demoted thread 2972 of process 2968 (n/a).
Mar 14 22:45:25 brain rtkit-daemon[2133]: Successfully demoted thread 2971 of process 2968 (n/a).

Revision history for this message

Craig (craig-st) wrote on 2013-03-15:

#42

***GRAPHICS CARD***:
VisionTek ATI Radeon HD 4670 1 GB DDR3 PCI Express Graphics Card 900251

Revision history for this message

renegat (rozbujnik) wrote on 2013-03-24:

#43

Happens to me also when I launch trine2 game on linux steam. After a few seconds I get the GPU lockup error. Ubuntu 12.10;
However if I use xfce I am able to experience the bug seconds later than with unity.

***uname -a***
Linux rumcajs-Satellite-A300 3.5.0-26-generic #42-Ubuntu SMP Fri Mar 8 23:20:06 UTC 2013 i686 i686 i686 GNU/Linux

***lspci -nnk***
01:00.0 VGA compatible controller [0300]: Advanced Micro Devices [AMD] nee ATI RV620 [Mobility Radeon HD 3400 Series] [1002:95c4]
Subsystem: Toshiba America Info Systems Device [1179:ff1c]
Kernel driver in use: radeon
Kernel modules: radeon

Revision history for this message

Roland (Rolandixor) Taylor (rolandixor) wrote on 2013-03-30:

#44

Still happens on Ubuntu 13.04 with kernel 3.8 and on any Desktop Environment. It happens when using Blender, any 3D game or SDL game, or even at random if using a 3D desktop environment.

With KDE, the system will freeze even if left idle for a while.

Revision history for this message

nuew (nuew) wrote on 2013-03-31:

#45

Having this issue too, on Fedora though, so it's certianly upstream from you. Have the issue worst when playing Minecraft (often, not recoverable), also happens on TF2 on Steam Linux, though in that case it's recoverable. Not seeing anything odd in Xorg.0.log, but i've got this from my dmesg:
[ 3067.859336] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[ 3067.859344] radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000000099aed last fence id 0x0000000000099ae6)
[ 3067.860419] radeon 0000:01:00.0: Saved 343 dwords of commands on ring 0.
[ 3067.860426] radeon 0000:01:00.0: GPU softreset: 0x00000003
[ 3067.864215] radeon 0000:01:00.0: GRBM_STATUS = 0xE5500828
[ 3067.864220] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0xEC000001
[ 3067.864224] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[ 3067.864227] radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
[ 3067.864230] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[ 3067.864233] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00010800
[ 3067.864237] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00028006
[ 3067.864240] radeon 0000:01:00.0: R_008680_CP_STAT = 0x80038647
[ 3067.864243] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00007F6B
[ 3067.864297] radeon 0000:01:00.0: GRBM_STATUS = 0x00003828
[ 3067.864320] radeon 0000:01:00.0: GRBM_STATUS_SE0 = 0x00000007
[ 3067.864326] radeon 0000:01:00.0: GRBM_STATUS_SE1 = 0x00000007
[ 3067.864343] radeon 0000:01:00.0: SRBM_STATUS = 0x200000C0
[ 3067.864361] radeon 0000:01:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[ 3067.864376] radeon 0000:01:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000
[ 3067.864382] radeon 0000:01:00.0: R_00867C_CP_BUSY_STAT = 0x00000000
[ 3067.864387] radeon 0000:01:00.0: R_008680_CP_STAT = 0x00000000
[ 3067.881741] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[ 3067.899981] [drm] probing gen 2 caps for device 8086:151 = 3/e
[ 3067.899985] [drm] PCIE gen 2 link speeds already enabled
[ 3067.901926] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[ 3067.902030] radeon 0000:01:00.0: WB enabled
[ 3067.902032] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff88014235ac00
[ 3067.902033] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff88014235ac0c
[ 3067.918240] [drm] ring test on 0 succeeded in 1 usecs
[ 3067.918305] [drm] ring test on 3 succeeded in 1 usecs
[ 3067.930434] [drm] ib test on ring 0 succeeded in 0 usecs
[ 3067.930477] [drm] ib test on ring 3 succeeded in 0 usecs

Having this issue too, on Fedora though, so it's certianly upstream from you. Have the issue worst when playing Minecraft (often, not recoverable), also happens on TF2 on Steam Linux, though in that case it's recoverable. Not seeing anything odd in Xorg.0.log, but i've got this from my dmesg:
[ 3067.859336] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
[ 3067.859344] radeon 0000:01:00.0: GPU lockup (waiting for 0x0000000000099aed last fence id 0x0000000000099ae6)
[ 3067.860419] radeon 0000:01:00.0: Saved 343 dwords of commands on ring 0.
[ 3067.860426] radeon 0000:01:00.0: GPU softreset: 0x00000003
[ 3067.864215] radeon 0000:01:00.0:   GRBM_STATUS               = 0xE5500828
[ 3067.864220] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0xEC000001
[ 3067.864224] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[ 3067.864227] radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
[ 3067.864230] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[ 3067.864233] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00010800
[ 3067.864237] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00028006
[ 3067.864240] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x80038647
[ 3067.864243] radeon 0000:01:00.0:   GRBM_SOFT_RESET=0x00007F6B
[ 3067.864297] radeon 0000:01:00.0:   GRBM_STATUS               = 0x00003828
[ 3067.864320] radeon 0000:01:00.0:   GRBM_STATUS_SE0           = 0x00000007
[ 3067.864326] radeon 0000:01:00.0:   GRBM_STATUS_SE1           = 0x00000007
[ 3067.864343] radeon 0000:01:00.0:   SRBM_STATUS               = 0x200000C0
[ 3067.864361] radeon 0000:01:00.0:   R_008674_CP_STALLED_STAT1 = 0x00000000
[ 3067.864376] radeon 0000:01:00.0:   R_008678_CP_STALLED_STAT2 = 0x00000000
[ 3067.864382] radeon 0000:01:00.0:   R_00867C_CP_BUSY_STAT     = 0x00000000
[ 3067.864387] radeon 0000:01:00.0:   R_008680_CP_STAT          = 0x00000000
[ 3067.881741] radeon 0000:01:00.0: GPU reset succeeded, trying to resume
[ 3067.899981] [drm] probing gen 2 caps for device 8086:151 = 3/e
[ 3067.899985] [drm] PCIE gen 2 link speeds already enabled
[ 3067.901926] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000).
[ 3067.902030] radeon 0000:01:00.0: WB enabled
[ 3067.902032] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff88014235ac00
[ 3067.902033] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff88014235ac0c
[ 3067.918240] [drm] ring test on 0 succeeded in 1 usecs
[ 3067.918305] [drm] ring test on 3 succeeded in 1 usecs
[ 3067.930434] [drm] ib test on ring 0 succeeded in 0 usecs
[ 3067.930477] [drm] ib test on ring 3 succeeded in 0 usecs

Revision history for this message

nuew (nuew) wrote on 2013-03-31:

#46

And this (repeated about a thousand times) in my tee'd log of Minecraft:
radeon: The kernel rejected CS, see dmesg for more information.
EE r600_pipe.c:80 r600_create_fence - r600: too many concurrent fences

Revision history for this message

Craig (craig-st) wrote on 2013-04-10:

#47

I switched from the open source fglrx-updates 2:9.000-0ubuntu0.3 driver to the proprietary ATI/AMD FGLRX driver 2:8.960-0ubuntu1.1 and am still having problems, but can't swear that the problems are related to this bug, or are caused by a different bug. The problems I am now having after switching drivers look like the "[fglrx] ASIC hang happened" Bug #1059422.

Revision history for this message

Chascon (chascone) wrote on 2013-05-01:

#48

I get the mentioned output when X locks up (mouse movement excluded) running with a radeon x1600 video card, Gallium 0.4 on ATI RV530. I usually manage to get to tty* to see the error repeatedly posted and sometimes, with the addition of "[drm:radeon_cs_chunk] *ERROR* failed to schedule IB!".

Circumstances around error:

To elaborate on the conditions around the time of the error, I can tell you you that I've noticed the problem when running shadows, transparency, transitions, fadings, and menu transparency via Compton, and shadows, fadings, and transparency via xcompmgr.

While I write this, X is working functions properly but dmesg | garep radeon outputs:

radeon 0000:01:00.0: registered panic notifier ... [drm:radeon_cs_ib_chunk] *ERROR* Invalid command stream !

From there, killing the desktop session throws me back into a visually scrambled lightm, and the error continues to show up in tty1*. Doing sudo service lightdm restart (from tty*) does not alleviate the problem.

I don't catch any other errors. /var/log/Xorg.0.log does not report any (after a reboot, mind you) other than complaining, "Failed to load module "fglrx" (module does not exist, 0)" twice, but it's normal --or so I've been told by a Ubuntu dev.

Thoughts:

Since I no longer use unity desktop and compiz but xcompmgr and compton, and I've gotten the error with both of the latter, I can assume it's not a compositor issue. I should say that I used to get compiz (and Unity) crashing all the time two years ago or so, the reason why I stopped using Unity, but I can't say I ever noticed this particular error back then. I've been happily using OB with xcompmgr for the past two years or so, untill now.

I've always assumned the problems with Unity were video driver issues. So there's my two bits. Hopefully a savy dev can make sense of the source of the problem and issue a patch both locally and upstream.

I'm running Raring by the way.

Revision history for this message

Oibaf (oibaf) wrote on 2013-09-03:

#49

This is a kernel bug, not ati ddx.

Changed in xf86-video-ati:
status:	New → Invalid

Revision history for this message

wdesmet (kromagg) wrote on 2013-09-26:

#50

Not sure if this is the same bug, but mine starts slightly differently, i.e.:
kernel: [79731.820027] radeon 0000:01:00.0: GPU lockup CP stall for more than 10000msec
kernel: [79731.820033] radeon 0000:01:00.0: GPU lockup (waiting for 0x000000000006efb7 last fence id 0x000000000006efb6)
kernel: [79731.820090] radeon 0000:01:00.0: couldn't schedule ib
kernel: [79731.820093] [drm:radeon_cs_ib_chunk] *ERROR* Failed to schedule IB !
kernel: [79731.820119] radeon 0000:01:00.0: couldn't schedule ib
kernel: [79731.820123] [drm:radeon_cs_ib_chunk] *ERROR* Failed to schedule IB !
kernel: [79731.821373] radeon 0000:01:00.0: couldn't schedule ib

And then just keeps going. After a while, it will start to output lines about failed allocations:
kernel: [79799.398689] [drm:radeon_gem_object_create] *ERROR* Failed to allocate GEM object (14327808, 6, 4096, -12)
kernel: [79799.398766] [drm:radeon_gem_object_create] *ERROR* Failed to allocate GEM object (14327808, 6, 4096, -12)
kernel: [79799.398849] [drm:radeon_gem_object_create] *ERROR* Failed to allocate GEM object (14327808, 6, 4096, -12)
kernel: [79799.398929] [drm:radeon_gem_object_create] *ERROR* Failed to allocate GEM object (14327808, 6, 4096, -12)
kernel: [79799.398971] [drm:radeon_gem_object_create] *ERROR* Failed to allocate GEM object (14327808, 6, 4096, -12)

The system quickly becomes unusable when this happens. The only solution is to reboot and wait till it happens again. It seems to always happen while scrolling or selecting text in google chrome (which I imagine can be taxing on some sites).

Revision history for this message

Oibaf (oibaf) wrote on 2013-09-26:

#51

Wdesmet, you should report your problem here: https://bugs.freedesktop.org/enter_bug.cgi?product=xorg (on drivers/radeon).

Revision history for this message

Laurent Bonnaud (laurent-bonnaud) wrote on 2013-10-04:

#52

This bug still occurs in saucy when /sys/class/drm/card0/device/power_profile
contains "low". I have not seem any lockup with "high" or "mid".

I also tried dynpm in /sys/class/drm/card0/device/power_method and had an immediate lockup that forced me to reboot my computer.

Revision history for this message

penalvch (penalvch) wrote on 2013-11-17:

#53

Yvon TANGUY, as per http://www.asus.com/Motherboards/P6T/#support an update is available for your BIOS (1408). If you update to this following https://help.ubuntu.com/community/BiosUpdate , does it change anything?

If not, could you please both specify what happened, and provide the output of the following terminal command:
sudo dmidecode -s bios-version && sudo dmidecode -s bios-release-date

Please note your current BIOS is already in the Bug Description, so posting this on the old BIOS would not be helpful.

For more on BIOS updates and linux, please see https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette .

Thank you for your understanding.

tags:	added: bios-outdated-1408
tags:	added: kernel-bug-exists-upstream needs-upstream-testing regression-potential removed: kernel-fixed-upstream
no longer affects:	linux (Ubuntu)
affects:	xf86-video-ati → linux (Ubuntu)
Changed in linux (Ubuntu):
status:	Invalid → Incomplete
importance:	Undecided → Medium
summary:	- radeon 0000:02:00.0: GPU lockup CP stall for more than 10000msec + 1002:6739 [Asus P6T] radeon 0000:02:00.0: GPU lockup CP stall for more + than 10000msec

Revision history for this message

penalvch (penalvch) wrote on 2013-11-17:

#54

https://bugzilla.kernel.org/show_bug.cgi?id=42678#c19

Changed in linux:
importance:	Medium → Undecided
status:	Confirmed → New

Revision history for this message

Yvon TANGUY (vono22) wrote on 2014-03-14:

#55

Hi,

Sorry I didn't reply ... or very late!

Since I had this bug, I switched to the fglrx driver.
I have tested Ubuntu 13.10 a little bit, a few days, with the mesa driver, and I didn't get it again.
I have updated my BIOS too, but there was no change, anyway, I don't have anymore an Ubuntu 12.04.0 installed, I now run the 3.11 kernel on my Ubuntu 12.04.
I will start soon to test Ubuntu 14.04 on my main computer (I currently only run it on my second one). But I think, this problem seems to be solved with newer kernel + mesa + Xorg stack.

Thank you for the support.

penalvch (penalvch) on 2014-03-16

no longer affects:

linux (Ubuntu)

Revision history for this message

Launchpad Janitor (janitor) wrote on 2014-03-16:

#56

Status changed to 'Confirmed' because the bug affects multiple users.

Revision history for this message

penalvch (penalvch) wrote on 2014-03-16:

#57

Yvon TANGUY, this bug report is being closed due to your last comment https://bugs.launchpad.net/linux/+bug/986524/comments/55 regarding this being fixed with an update. For future reference you can manage the status of your own bugs by clicking on the current status in the yellow line and then choosing a new status in the revealed drop down box. You can learn more about bug statuses at https://wiki.ubuntu.com/Bugs/Status. Thank you again for taking the time to report this bug and helping to make Ubuntu better. Please submit any future bugs you may find.

Launchpad Janitor (janitor) on 2014-03-16

Changed in linux (Ubuntu):
status:	New → Confirmed

penalvch (penalvch) on 2014-03-16

affects:	linux → linux (Ubuntu)
Changed in linux (Ubuntu):
status:	New → Invalid

Revision history for this message

Alexey S (ostrokach) wrote on 2014-07-15:

#58

Could someone please explain how I get the update that led to the closing of this bug report? I am experiencing the same problem on an up-to-date Ubuntu 14.04 and Gallium 0.4 on AMD RV620 graphics.

Thanks

Revision history for this message

penalvch (penalvch) wrote on 2014-07-18:

#59

Alexey S, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

Tomi Mikkonen (2-tomi) wrote on 2014-12-19:

#60

I have the same issue with Ubuntu 14.04 and Lubuntu 14.04
uname -r
3.13.0-32-generic

Radeon driver crashes roughly once every hour :

[84392.248120] radeon 0000:02:00.0: GPU lockup CP stall for more than 10000msec
[84392.248126] radeon 0000:02:00.0: GPU lockup (waiting for 0x0000000000000004 last fence id 0x0000000000000002 on ring 5)
[84392.248129] [drm:uvd_v1_0_ib_test] *ERROR* radeon: fence wait failed (-35).
[84392.248133] [drm:radeon_ib_ring_tests] *ERROR* radeon: failed testing IB on ring 5 (-35).
[84392.414032] [drm:rv770_dpm_set_power_state] *ERROR* rv770_set_sw_state failed

lspci |grep Radeon
02:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV710 [Radeon HD 4350/4550]
02:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] RV710/730 HDMI Audio [Radeon HD 4000 series]

Revision history for this message

penalvch (penalvch) wrote on 2014-12-19:

#61

Tomi Mikkonen, thank you for your comment. Unfortunately, as this bug report is closed, it is not scoped to you, your hardware, or your problem. So your problem and hardware may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
https://wiki.ubuntu.com/Kernel/Policies/DuplicateBugs
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

As well, please do not announce in this report you created a new bug report.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Ubuntu
linux package

1002:6739 [Asus P6T] radeon 0000:02:00.0: GPU lockup CP stall for more than 10000msec

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntulinux package

1002:6739 [Asus P6T] radeon 0000:02:00.0: GPU lockup CP stall for more than 10000msec

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package