Bug #946899 “8086:2a02 [drm:i915_hangcheck_elapsed] *ERROR* Hang...” : Bugs : Linux Mint

Revision history for this message

Matt Zimmerman (mdz) wrote on 2012-03-05:

#1

AcpiTables.txt Edit (329.8 KiB, text/plain; charset="utf-8")
AlsaDevices.txt Edit (657 bytes, text/plain; charset="utf-8")
AplayDevices.txt Edit (273 bytes, text/plain; charset="utf-8")
ArecordDevices.txt Edit (299 bytes, text/plain; charset="utf-8")
BootDmesg.txt Edit (71.3 KiB, text/plain; charset="utf-8")
CRDA.txt Edit (402 bytes, text/plain; charset="utf-8")
Card0.Amixer.values.txt Edit (3.8 KiB, text/plain; charset="utf-8")
Card0.Codecs.codec.0.txt Edit (11.9 KiB, text/plain; charset="utf-8")
CurrentDmesg.txt Edit (716 bytes, text/plain; charset="utf-8")
Dependencies.txt Edit (1.9 KiB, text/plain; charset="utf-8")
IwConfig.txt Edit (357 bytes, text/plain; charset="utf-8")
Lspci.txt Edit (17.5 KiB, text/plain; charset="utf-8")
Lsusb.txt Edit (656 bytes, text/plain; charset="utf-8")
PciMultimedia.txt Edit (598 bytes, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (1.5 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (2.0 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (4.8 KiB, text/plain; charset="utf-8")
PulseSinks.txt Edit (2.1 KiB, text/plain; charset="utf-8")
PulseSources.txt Edit (5.5 KiB, text/plain; charset="utf-8")
RfKill.txt Edit (201 bytes, text/plain; charset="utf-8")
UdevDb.txt Edit (122.4 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (285.5 KiB, text/plain; charset="utf-8")
WifiSyslog.txt Edit (604.7 KiB, text/plain; charset="utf-8")

Revision history for this message

Brad Figg (brad-figg) wrote on 2012-03-05: Test with newer development kernel (3.2.0-18.28)

#2

Thank you for taking the time to file a bug report on this issue.

However, given the number of bugs that the Kernel Team receives during any development cycle it is impossible for us to review them all. Therefore, we occasionally resort to using automated bots to request further testing. This is such a request.

We have noted that there is a newer version of the development kernel than the one you last tested when this issue was found. Please test again with the newer kernel and indicate in the bug if this issue still exists or not.

You can update to the latest development kernel by simply running the following commands in a terminal window:

sudo apt-get update
sudo apt-get upgrade

If the bug still exists, change the bug status from Incomplete to Confirmed. If the bug no longer exists, change the bug status from Incomplete to Fix Released.

If you want this bot to quit automatically requesting kernel tests, add a tag named: bot-stop-nagging.

Thank you for your help, we really do appreciate it.

Changed in linux (Ubuntu):
status:	New → Confirmed
status:	Confirmed → Incomplete
tags:	added: kernel-request-3.2.0-18.28

Joseph Salisbury (jsalisbury) on 2012-03-05

Changed in linux (Ubuntu):
importance:	Undecided → Medium

Revision history for this message

Matt Zimmerman (mdz) wrote on 2012-03-06: Re: [Bug 946899] Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#3

Hi Brad,

It's still happening, multiple times per day, with 3.2.0-18.28.

On Mon, Mar 05, 2012 at 07:31:01AM -0000, Brad Figg wrote:
> Thank you for taking the time to file a bug report on this issue.
>
> However, given the number of bugs that the Kernel Team receives during
> any development cycle it is impossible for us to review them all.
> Therefore, we occasionally resort to using automated bots to request
> further testing. This is such a request.
>
> We have noted that there is a newer version of the development kernel
> than the one you last tested when this issue was found. Please test
> again with the newer kernel and indicate in the bug if this issue still
> exists or not.
>
> You can update to the latest development kernel by simply running the
> following commands in a terminal window:
>
> sudo apt-get update
> sudo apt-get upgrade
>
> If the bug still exists, change the bug status from Incomplete to
> Confirmed. If the bug no longer exists, change the bug status from
> Incomplete to Fix Released.
>
> If you want this bot to quit automatically requesting kernel tests, add
> a tag named: bot-stop-nagging.
>
> Thank you for your help, we really do appreciate it.
>
>
> ** Changed in: linux (Ubuntu)
> Status: New => Confirmed
>
> ** Changed in: linux (Ubuntu)
> Status: Confirmed => Incomplete
>
> ** Tags added: kernel-request-3.2.0-18.28
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/946899
>
> Title:
> [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU
> hung
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/946899/+subscriptions

--
- mdz

tags:	added: bot-stop-nagging
Changed in linux (Ubuntu):
status:	Incomplete → Confirmed

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2012-03-06: Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#4

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.3 kernel[1] (Not a kernel in the daily directory). Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag(Only that one tag, please leave the other tags). This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed by the mainline kernel, please add the following tag 'kernel-fixed-upstream-KERNEL-VERSION'. For example, if kernel version 3.3-rc6 fixed the issue, the tag would be: 'kernel-fixed-upstream-v3.3-rc6'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[1] http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.3-rc6-precise/

tags:

added: needs-upstream-testing

Revision history for this message

Phoenix (phoenix-dominion) wrote on 2012-03-17:

#5

Sorry for being desperate to get rid of the problem. I installed the 3.3 Kernel, but it didn't work, X still freezed. Then I rememberd I had to add the kernel nolapic to being able to boot at all. New kernel, new luck. I removed that option from Grub and so far it looks very good.

Revision history for this message

Umbrella Dish (floritiv) wrote on 2012-04-28:

#6

Experiencing this problem on Ubuntu 12.04 (linux-3.2.0-24-generic) without nolapic boot option though. When the error happens, a prior mouse hang of several seconds is a sure sign, the tft gets dark and restores, gets dark and restores, then graphics acceleration appears disabled, font displayed with faults and I have to log out and log in. The problem seems to be gone in the new kde session with desktop effects on ... until I boot anew.

My dmesg succeeding the error is:

[ 63.965111] [<ffffffff8106712f>] warn_slowpath_common+0x7f/0xc0
[ 63.965112] [<ffffffff8106718a>] warn_slowpath_null+0x1a/0x20
[ 63.965116] [<ffffffffa00779e4>] __gen6_gt_wait_for_fifo+0x94/0xa0 [i915]
[ 63.965121] [<ffffffffa0078065>] i915_write32+0xe5/0xf0 [i915]
[ 63.965126] [<ffffffffa00b4752>] gen6_ring_put_irq+0xa2/0xc0 [i915]
[ 63.965131] [<ffffffffa00b47c8>] gen6_render_ring_put_irq+0x18/0x20 [i915]
[ 63.965136] [<ffffffffa0089ba7>] i915_wait_request+0x1b7/0x560 [i915]
[ 63.965138] [<ffffffff8108aec0>] ? add_wait_queue+0x60/0x60
[ 63.965143] [<ffffffffa0089f82>] i915_gem_object_wait_rendering+0x32/0x40 [i
915]
[ 63.965148] [<ffffffffa008ed3d>] i915_gem_execbuffer_sync_rings+0xdd/0x160 [
i915]
[ 63.965153] [<ffffffffa008ef2e>] i915_gem_execbuffer_move_to_gpu+0x16e/0x200
[i915]
[ 63.965157] [<ffffffffa008f65b>] i915_gem_do_execbuffer.isra.8+0x69b/0x940 [
i915]
[ 63.965163] [<ffffffffa009f349>] ? intel_mark_busy+0xd9/0x110 [i915]
[ 63.965168] [<ffffffffa008fdc3>] i915_gem_execbuffer2+0xa3/0x270 [i915]
[ 63.965172] [<ffffffffa00165d4>] drm_ioctl+0x444/0x510 [drm]
[ 63.965177] [<ffffffffa008fd20>] ? i915_gem_execbuffer+0x420/0x420 [i915]
[ 63.965179] [<ffffffff8101dbd4>] ? restore_user_xstate+0x54/0xa0
[ 63.965181] [<ffffffff81189cfa>] do_vfs_ioctl+0x8a/0x340
[ 63.965183] [<ffffffff8118a041>] sys_ioctl+0x91/0xa0
[ 63.965185] [<ffffffff81664a82>] system_call_fastpath+0x16/0x1b
[ 63.965186] ---[ end trace 20690ee302d12d8d ]---
[ 68.296329] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 68.297855] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[ 68.311854] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 11846 at 11845, next 11847)
[ 68.315568] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[ 68.319138] [drm:init_ring_common] *ERROR* gen6 bsd ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[ 68.322179] [drm:init_ring_common] *ERROR* blt ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[ 70.159916] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 70.161991] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 11847 at 11844, next 11848)
[ 70.162096] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[ 70.162100] [drm:i915_reset] *ERROR* Failed to reset chip.

Experiencing this problem on Ubuntu 12.04 (linux-3.2.0-24-generic) without nolapic boot option though. When the error happens, a prior mouse hang of several seconds is a sure sign, the tft gets dark and restores, gets dark and restores, then graphics acceleration appears disabled, font displayed with faults and I have to log out and log in. The problem seems to be gone in the new kde session with desktop effects on ... until I boot anew.

My dmesg succeeding the error is:

[   63.965111]  [<ffffffff8106712f>] warn_slowpath_common+0x7f/0xc0
[   63.965112]  [<ffffffff8106718a>] warn_slowpath_null+0x1a/0x20
[   63.965116]  [<ffffffffa00779e4>] __gen6_gt_wait_for_fifo+0x94/0xa0 [i915]
[   63.965121]  [<ffffffffa0078065>] i915_write32+0xe5/0xf0 [i915]
[   63.965126]  [<ffffffffa00b4752>] gen6_ring_put_irq+0xa2/0xc0 [i915]
[   63.965131]  [<ffffffffa00b47c8>] gen6_render_ring_put_irq+0x18/0x20 [i915]
[   63.965136]  [<ffffffffa0089ba7>] i915_wait_request+0x1b7/0x560 [i915]
[   63.965138]  [<ffffffff8108aec0>] ? add_wait_queue+0x60/0x60
[   63.965143]  [<ffffffffa0089f82>] i915_gem_object_wait_rendering+0x32/0x40 [i
915]
[   63.965148]  [<ffffffffa008ed3d>] i915_gem_execbuffer_sync_rings+0xdd/0x160 [
i915]
[   63.965153]  [<ffffffffa008ef2e>] i915_gem_execbuffer_move_to_gpu+0x16e/0x200
 [i915]
[   63.965157]  [<ffffffffa008f65b>] i915_gem_do_execbuffer.isra.8+0x69b/0x940 [
i915]
[   63.965163]  [<ffffffffa009f349>] ? intel_mark_busy+0xd9/0x110 [i915]
[   63.965168]  [<ffffffffa008fdc3>] i915_gem_execbuffer2+0xa3/0x270 [i915]
[   63.965172]  [<ffffffffa00165d4>] drm_ioctl+0x444/0x510 [drm]
[   63.965177]  [<ffffffffa008fd20>] ? i915_gem_execbuffer+0x420/0x420 [i915]
[   63.965179]  [<ffffffff8101dbd4>] ? restore_user_xstate+0x54/0xa0
[   63.965181]  [<ffffffff81189cfa>] do_vfs_ioctl+0x8a/0x340
[   63.965183]  [<ffffffff8118a041>] sys_ioctl+0x91/0xa0
[   63.965185]  [<ffffffff81664a82>] system_call_fastpath+0x16/0x1b
[   63.965186] ---[ end trace 20690ee302d12d8d ]---
[   68.296329] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[   68.297855] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[   68.311854] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 11846 at 11845, next 11847)
[   68.315568] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[   68.319138] [drm:init_ring_common] *ERROR* gen6 bsd ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[   68.322179] [drm:init_ring_common] *ERROR* blt ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
[   70.159916] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[   70.161991] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 11847 at 11844, next 11848)
[   70.162096] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[   70.162100] [drm:i915_reset] *ERROR* Failed to reset chip.

Revision history for this message

jrierab (jrierab) wrote on 2012-05-01:

#7

I am suffering this bug and similar ones related to what seems to be a regression in kernel.

I have 3 similar systems with the following configurations:

1- Work PC, with Precise updated from Oneiric since Beta 2, i5-2400. Does not seems to be affected by the bug.

2- Main home PC, with Precise updated from Oneiric since Beta 2, i5-2500K. The bug and completely system hangs occurs so often that it is nearly unusable. This happens also with the latest mainline kernel 3.4-rc4-precise (http://kernel.ubuntu.com/~kernel-ppa/mainline/). However, if I start with kernel 3.0.17 from Oneiric, the bug does not appear. I am working with it for nearly two days without a single hang. So, this may be a workaround.

3- Same home PC, fresh Ubuntu Precise distribution installed in a clean partition, same i5-2500K. I use it as a test platform. The bug has occured, and the dmesg attached belongs to this clean system. Nothing more is installed from default, safe the precise updates.

The bug produces several 2-3 seconds black screen and back to normal for 10-20 seconds, in sequence, normally followed by a completely desktop hang which requires a full reset. Sometimes, the hang does not occur, but all windows decorations and the unity bar dissapears (like the window manager is dead).

The bug appears randomly, but more often if switching from desktop spaces, and with firefox navigator open.

Revision history for this message

jrierab (jrierab) wrote on 2012-05-01:

#8

Just after filling the comment above, the windows decorations haver dessapeared. As I had a terminal open, here it is the dmesg just after.

Revision history for this message

jrierab (jrierab) wrote on 2012-05-01:

#9

And a screen capture...

jrierab (jrierab) on 2012-05-01

tags:

removed: kernel-request-3.2.0-18.28 needs-upstream-testing

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2012-05-01:

#10

This issue appears to be an upstream bug, since you tested the latest upstream kernel. Would it be possible for you to open an upstream bug report at bugzilla.kernel.org [1]? That will allow the upstream Developers to examine the issue, and may provide a quicker resolution to the bug.

If you are comfortable with opening a bug upstream, It would be great if you can report back the upstream bug number in this bug report. That will allow us to link this bug to the upstream report.

[1] https://wiki.ubuntu.com/Bugs/Upstream/kernel

tags:

added: kernel-bug-exists-upstream

Revision history for this message

Greg Stovall (78luphr0rnk2nuqimstywepozxn9kl19tqh0tx66b5dki1xxsh5mkz9gl21a5rlwfnr8jn6ln0m3jxne2k9x1ohg85w3jabxlrqbgsz-launchpad-a811i2i3ytqlsztthjth0svbccw8inm65tmkqp9sarr553jq53in4xm1m8wn3o4rlwaer06ogwvqwv9mrqoku2x334n7di44o65qze6) wrote on 2012-05-13:

#11

Just an update on the behavior of this bug.

About a week ago, there was an update installed by Update Manager that resolved (for me) the issue of Unity crashing. The problem with the screen resolution spontaneously changing was still present, but at least it was usable.

Yesterday, Update Manager installed a new set of updates, and the regression is back. It's gone back to nearly unusable.

I'm going to have to break down and reinstall 11.10 and hope that the video issues with 12.04 get resolved pretty soon.

While running Firefox makes the breakage happen extremely quickly, I have seen in the last day the breakage happen even with nothing but Thunderbird running. In this scenario, the Unit decoration and launcher bar went away, AND all menus became empty. The menu would still create a box, but the box was empty.

Revision history for this message

jrierab (jrierab) wrote on 2012-05-19:

#12

As suggested, I reported the bug upstream. It is still present in kernel v3.4-rc7-precise and also in the latest drm-intel-experimental (http://kernel.ubuntu.com/~kernel-ppa/mainline/drm-intel-experimental/2012-05-14-precise/).

You can follow it from https://bugzilla.kernel.org/show_bug.cgi?id=43267

Sorry for the delay, but I have been quite busy recently and did not had the time to test the latest kernel versions until today (and didn't want to report without being sure the bug still was present upstream).

Bug Watch Updater (bug-watch-updater) on 2012-05-19

Changed in linux:
importance:	Unknown → High
status:	Unknown → Incomplete

Joseph Salisbury (jsalisbury) on 2012-05-21

Changed in linux (Ubuntu):
status:	Confirmed → Triaged

Revision history for this message

Anton Anikin (anton-anikin) wrote on 2012-06-03:

#13

Same problem with Ubuntu 12.10 and 3.4 kernel

Revision history for this message

jrierab (jrierab) wrote on 2012-06-14:

#14

Good news at last !!! It seems that the source of the problem has been identified upstream.

Until it is fully solved, hovewer, you can try a workaround. Just edit the config file (1st option for burg, 2nd for grub):

$ gksudo gedit /etc/default/burg &
$ gksudo gedit /etc/default/grub &

and add the "i915.i915_enable_rc6=0 option to GRUB_CMDLINE_LINUX_DEFAULT. Should be something like:

GRUB_CMDLINE_LINUX_DEFAULT="i915.i915_enable_rc6=0 quiet splash"

Then, update boot files with (1st option for burg, 2nd for grub):

$ sudo update-burg
$ sudo update-grub

That's it. Reboot and enjoy your new kernels!

Bug Watch Updater (bug-watch-updater) on 2012-06-14

Changed in linux:
status:	Incomplete → In Progress

Revision history for this message

Simon Kingsley (scjk) wrote on 2012-06-22:

#15

This is still an issue for me. Very annoying.

Revision history for this message

Donald Johnson (donald-johnson) wrote on 2012-08-21:

#16

I have seven new Network monitoring machines just built with 12.04 LTS that are impacted by this bug, a solution would be great

Revision history for this message

versus167 (wingdvd-2008) wrote on 2012-09-14:

#17

This issue affects me to. :(

Revision history for this message

Michael Vorburger (mike-vorburger) wrote on 2012-09-14:

#18

+1; also hit this, on a fresh 12.04.1 install. Will try workaround given above.

Bug Watch Updater (bug-watch-updater) on 2012-09-17

Changed in linux:
status:	In Progress → Incomplete

Bug Watch Updater (bug-watch-updater) on 2012-10-04

Changed in linux:
status:	Incomplete → Fix Released

Bug Watch Updater (bug-watch-updater) on 2012-10-08

Changed in linux:
status:	Fix Released → Confirmed

Bug Watch Updater (bug-watch-updater) on 2013-01-16

Changed in linux:
status:	Confirmed → Incomplete

Revision history for this message

Pieter (diepes) wrote on 2013-01-31:

#19

on intel opensource site

https://01.org/linuxgraphics/documentation/how-get-last-batch-buffer-gpu-hang

They suggest capturing the output of /sys/kernel/debug/dri/0/i915_error_state
cat /sys/kernel/debug/dri/0/i915_error_state > i915_error_state

I attached it here in the hope that it could help resolve the problem.

Revision history for this message

Fabio Marconi (fabiomarconi) wrote on 2013-02-01:

#20

reproduced something similar on 3.8.0-3, see bug 1112871
---
Ubuntu Bug Squad volunteer triager
http://wiki.ubuntu.com/BugSquad

tags:

added: raring

Revision history for this message

Pieter (diepes) wrote on 2013-02-20:

#21

$ uname -a
Linux t420 3.8.0-030800-generic #201302181935 SMP Tue Feb 19 00:36:19 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

from kernel ppa
Still hangs

Feb 20 11:10:37 t420 kernel: [ 9795.543780] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

Attached
$ sudo cat /sys/kernel/debug/dri/0/i915_error_state > i915_error_state-20130220

Revision history for this message

Robert Hughes (robert-4) wrote on 2013-02-22:

#22

Read somewhere that this bug was fixed, so checked out CHANGES in the new kernel builds, Chris Wilson and Daniel Vetter among others are working with a lot of fixes for this driver. Hoping it was fixed somewhere among all those contributions, my system and logs related to the drm:i915_hangcheck_hung ERROR is printed below. However, upgrading reduced the *rate* of the error, but came back today. When working with Blender or Eclipse, programs using a lot of memory, system suddenly freeze. The mouse cursor is still movable, but the rest is frozen.

uname -a:
Linux robtu 3.8.0-030800rc7-generic #201302081635 SMP Fri Feb 8 21:57:43 UTC 2013 i686 i686 i686 GNU/Linux

syslog:
Feb 21 23:34:41 robtu kernel: [233139.352461] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

kern.log:
Feb 21 23:34:41 robtu kernel: [233139.352461] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Feb 21 23:34:41 robtu kernel: [233139.352468] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

cat /debug/dri/0/i915_error_state:
no error state collected

dmesg:
[ 16.946953] i915 0000:00:02.0: setting latency timer to 64
[ 16.947317] i915 0000:00:02.0: irq 42 for MSI/MSI-X
[ 17.934259] i915 0000:00:02.0: fb0: inteldrmfb frame buffer device
[ 17.934261] i915 0000:00:02.0: registered panic notifier
[ 18.234714] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0

lspci:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

lshw:
             description: VGA compatible controller
             product: 2nd Generation Core Processor Family Integrated Graphics Controller
             vendor: Intel Corporation
             physical id: 2
             bus info: pci@0000:00:02.0
             version: 09
             width: 64 bits
             clock: 33MHz
             capabilities: msi pm vga_controller bus_master cap_list rom
             configuration: driver=i915 latency=0
             resources: irq:42 memory:c0000000-c03fffff memory:b0000000-bfffffff ioport:2000(size=64)

BTW, this guy has claimed he has fixed the problem, anyone with "kernelpowers" here want to take a look?
http://www.quineloop.com/2012/05/26/intel-i915-gpu-hung-linux.html

Read somewhere that this bug was fixed, so checked out CHANGES in the new kernel builds, Chris Wilson and Daniel Vetter among others are working with a lot of fixes for this driver. Hoping it was fixed somewhere among all those contributions, my system and logs related to the drm:i915_hangcheck_hung ERROR is printed below. However, upgrading reduced the *rate* of the error, but came back today. When working with Blender or Eclipse, programs using a lot of memory, system suddenly freeze. The mouse cursor is still movable, but the rest is frozen.

uname -a:
Linux robtu 3.8.0-030800rc7-generic #201302081635 SMP Fri Feb 8 21:57:43 UTC 2013 i686 i686 i686 GNU/Linux

syslog:
Feb 21 23:34:41 robtu kernel: [233139.352461] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

kern.log:
Feb 21 23:34:41 robtu kernel: [233139.352461] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Feb 21 23:34:41 robtu kernel: [233139.352468] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

cat /debug/dri/0/i915_error_state:
no error state collected

dmesg:
[ 16.946953] i915 0000:00:02.0: setting latency timer to 64
[ 16.947317] i915 0000:00:02.0: irq 42 for MSI/MSI-X
[ 17.934259] i915 0000:00:02.0: fb0: inteldrmfb frame buffer device
[ 17.934261] i915 0000:00:02.0: registered panic notifier
[ 18.234714] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0

lspci:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

lshw:
             description: VGA compatible controller
             product: 2nd Generation Core Processor Family Integrated Graphics Controller
             vendor: Intel Corporation
             physical id: 2
             bus info: pci@0000:00:02.0
             version: 09
             width: 64 bits
             clock: 33MHz
             capabilities: msi pm vga_controller bus_master cap_list rom
             configuration: driver=i915 latency=0
             resources: irq:42 memory:c0000000-c03fffff memory:b0000000-bfffffff ioport:2000(size=64)

BTW, this guy has claimed he has fixed the problem, anyone with "kernelpowers" here want to take a look?
http://www.quineloop.com/2012/05/26/intel-i915-gpu-hung-linux.html

Revision history for this message

Tristan BENOIST (tristan-benoist) wrote on 2013-03-04:

#23

Hello,

I have the same problem here with a freeze of all the screen except the cursor.

I have nvidia optimus handling a Geforce 520M and an intel video chip.

uname -a :
3.5.0-26-generic #40-Ubuntu SMP Tue Feb 26 19:57:24 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

dmesg :
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off
atl1c 0000:07:00.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.

attachement : /sys/kernel/debug/dri/0/i915_error_state

Revision history for this message

Alexander Adam (7ql6) wrote on 2013-03-11:

#24

Had just the same problem on

LSB Version: core-2.0-amd64:core-2.0-noarch:core-3.0-amd64:core-3.0-noarch:core-3.1-amd64:core-3.1-noarch:core-3.2-amd64:core-3.2-noarch:core-4.0-amd64:core-4.0-noarch
Description: Ubuntu 12.10

In dmesg I found:

[29668.344498] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[29668.344505] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

but in my case no /debug/dri/0/i915_error_state could be found (while the bug-gui-tool asked wether to send the data to launchpad this could be reason for the missing file I think - does it remove the file(s) afterwards?).

The interesting point is that I got in this "state" when trying to watch a railscast episode in the browser and pressed the fullscreen-button. The screen became black so I minimized again and everythink was fine again. Doing this a few times and the exception window came up.
GUI operations are very slow now and and many element won't repaint properly (ie when scrolling).

I used chromium 24.0.1312.56-0ubuntu0.12.10.3 (if it is somehow relevant).

Xorg.log seems to be interesting ((EE) [mi] EQ overflowing…) you can find it attached.

While I have a ASUS Zenbook UX31A my graphics card is

00:02.0 VGA compatible controller [0300]: Intel Corporation 3rd Gen Core processor Graphics Controller [8086:0166] (rev 09) (prog-if 00 [VGA controller])
Subsystem: ASUSTeK Computer Inc. Device [1043:1517]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 44
Region 0: Memory at f7800000 (64-bit, non-prefetchable) [size=4M]
Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M]
Region 4: I/O ports at f000 [size=64]
Expansion ROM at <unassigned> [disabled]
Capabilities: <access denied>
Kernel driver in use: i915
Kernel modules: i915

Had just the same problem on

LSB Version:	core-2.0-amd64:core-2.0-noarch:core-3.0-amd64:core-3.0-noarch:core-3.1-amd64:core-3.1-noarch:core-3.2-amd64:core-3.2-noarch:core-4.0-amd64:core-4.0-noarch
Description:	Ubuntu 12.10

In dmesg I found:

[29668.344498] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[29668.344505] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

but in my case no /debug/dri/0/i915_error_state could be found (while the bug-gui-tool asked wether to send the data to launchpad this could be reason for the missing file I think - does it remove the file(s) afterwards?).

The interesting point is that I got in this "state" when trying to watch a railscast episode in the browser and pressed the fullscreen-button. The screen became black so I minimized again and everythink was fine again. Doing this a few times and the exception window came up.
GUI operations are very slow now and and many element won't repaint properly (ie when scrolling).

I used chromium 24.0.1312.56-0ubuntu0.12.10.3 (if it is somehow relevant).

Xorg.log seems to be interesting ((EE) [mi] EQ overflowing…) you can find it attached.

While I have a ASUS Zenbook UX31A my graphics card is

00:02.0 VGA compatible controller [0300]: Intel Corporation 3rd Gen Core processor Graphics Controller [8086:0166] (rev 09) (prog-if 00 [VGA controller])
	Subsystem: ASUSTeK Computer Inc. Device [1043:1517]
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 44
	Region 0: Memory at f7800000 (64-bit, non-prefetchable) [size=4M]
	Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M]
	Region 4: I/O ports at f000 [size=64]
	Expansion ROM at <unassigned> [disabled]
	Capabilities: <access denied>
	Kernel driver in use: i915
	Kernel modules: i915

Revision history for this message

Hans (old-man999) wrote on 2013-03-12:

#25

Seems related to this: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1135759

I tried all the versions before this bug happening 3.5.24 and earlier with no success.
"915.i915_enable_rc6=0" does not make a change really here although gpu isn't hanging that often.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-03-13:

#26

Same here, tried "i915.i915_enable_rc6=0", even upgraded to Kernel 3.8.2, but the crash reports just keep coming, with or without freezes and hard shutdowns.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-03-13:

#27

Download full text (3.7 KiB)

Just to give an idea of the frequency of the GPU hangs, I pasted below a part from my syslog. After the 3rd hang in a row, the system became unresponsive and I had to do an emergency shut-down (Alt+SysRq, R, E, I, S, U , B). Hope this will help with the troubleshooting.

Mar 11 21:45:03 sppc26 kernel: [ 0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-3.5.0-26-generic root=UUID=5083e04c-1bad-44bf-a241-c839914a697a ro crashkernel=384M-2G:64M,2G-:128M quiet splash i915.i915_enable_rc6=0
.
.
.
Mar 11 21:47:07 sppc26 kernel: [ 159.875052] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 11 21:47:07 sppc26 kernel: [ 159.875056] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Mar 11 21:47:07 sppc26 kernel: [ 159.878208] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off
Mar 11 21:47:20 sppc26 kernel: [ 172.694397] CPU2: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [ 172.694399] CPU3: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [ 172.694400] CPU1: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [ 172.694401] CPU0: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [ 172.705373] CPU3: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [ 172.705374] CPU1: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [ 172.705393] CPU2: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [ 172.705394] CPU0: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [ 770.979630] CPU1: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [ 770.979633] CPU3: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [ 770.979635] CPU2: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [ 770.979637] CPU0: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [ 770.990639] CPU1: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [ 770.990641] CPU2: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [ 770.990642] CPU3: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [ 770.990643] CPU0: Package power limit normal
Mar 11 21:58:51 sppc26 kernel: [ 863.575435] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 11 21:58:51 sppc26 kernel: [ 863.575750] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off
Mar 11 22:05:38 sppc26 kernel: [ 1270.493341] CPU2: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493344] CPU1: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493346] CPU3: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493347] CPU0: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.504379] CPU2: Package power limit normal
Mar 11 22:05:38 sppc26 kernel: [ 1270.504380] CPU0: Package power limit normal
Mar 11 22:05:38 sppc26 kernel: [ 1270.504399] CPU1: Package power ...

Just to give an idea of the frequency of the GPU hangs, I pasted below a part from my syslog. After the 3rd hang in a row, the system became unresponsive and I had to do an emergency shut-down (Alt+SysRq, R, E, I, S, U , B). Hope this will help with the troubleshooting.

Mar 11 21:45:03 sppc26 kernel: [    0.000000] Kernel command line: BOOT_IMAGE=/vmlinuz-3.5.0-26-generic root=UUID=5083e04c-1bad-44bf-a241-c839914a697a ro crashkernel=384M-2G:64M,2G-:128M quiet splash i915.i915_enable_rc6=0
.
.
.
Mar 11 21:47:07 sppc26 kernel: [  159.875052] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 11 21:47:07 sppc26 kernel: [  159.875056] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Mar 11 21:47:07 sppc26 kernel: [  159.878208] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off
Mar 11 21:47:20 sppc26 kernel: [  172.694397] CPU2: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [  172.694399] CPU3: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [  172.694400] CPU1: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [  172.694401] CPU0: Package power limit notification (total events = 1)
Mar 11 21:47:20 sppc26 kernel: [  172.705373] CPU3: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [  172.705374] CPU1: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [  172.705393] CPU2: Package power limit normal
Mar 11 21:47:20 sppc26 kernel: [  172.705394] CPU0: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [  770.979630] CPU1: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [  770.979633] CPU3: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [  770.979635] CPU2: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [  770.979637] CPU0: Package power limit notification (total events = 1068)
Mar 11 21:57:19 sppc26 kernel: [  770.990639] CPU1: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [  770.990641] CPU2: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [  770.990642] CPU3: Package power limit normal
Mar 11 21:57:19 sppc26 kernel: [  770.990643] CPU0: Package power limit normal
Mar 11 21:58:51 sppc26 kernel: [  863.575435] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 11 21:58:51 sppc26 kernel: [  863.575750] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off
Mar 11 22:05:38 sppc26 kernel: [ 1270.493341] CPU2: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493344] CPU1: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493346] CPU3: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.493347] CPU0: Package power limit notification (total events = 1578)
Mar 11 22:05:38 sppc26 kernel: [ 1270.504379] CPU2: Package power limit normal
Mar 11 22:05:38 sppc26 kernel: [ 1270.504380] CPU0: Package power limit normal
Mar 11 22:05:38 sppc26 kernel: [ 1270.504399] CPU1: Package power limit normal
Mar 11 22:05:38 sppc26 kernel: [ 1270.504400] CPU3: Package power limit normal
Mar 11 22:09:56 sppc26 kernel: [ 1527.956133] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 11 22:12:00 sppc26 kernel: [ 1651.830636] SysRq : This sysrq operation is disabled.
Mar 11 22:12:01 sppc26 kernel: [ 1653.365819] SysRq : This sysrq operation is disabled.
Mar 11 22:12:03 sppc26 kernel: [ 1654.552128] SysRq : This sysrq operation is disabled.
Mar 11 22:12:04 sppc26 kernel: [ 1655.658691] SysRq : Emergency Sync

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-03-13:

#28

For completeness, below an example from the syslog under Kernel 3.8.2 (again needed emergency reset). I also tried Kernel 2.6.38-13, which did not result in the same "GPU hung" message, but lots of other crash reports, so I gave up quite quickly.

Mar 12 23:01:48 sppc26 kernel: [ 0.000000] Linux version 3.8.2-030802-generic (root@gomeisa) (gcc version 4.6.3 (Ubuntu/Linaro 4.6.3-1ubuntu5) ) #201303031906 SMP Mon Mar 4 00:07:09 UTC 2013
Mar 12 23:01:48 sppc26 kernel: [ 0.000000] Command line: BOOT_IMAGE=/vmlinuz-3.8.2-030802-generic root=UUID=5083e04c-1bad-44bf-a241-c839914a697a ro crashkernel=384M-2G:64M,2G-:128M quiet splash i915.i915_enable_rc6=0
.
.
.
Mar 12 23:09:37 sppc26 kernel: [ 505.650930] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 12 23:09:37 sppc26 kernel: [ 505.650935] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Mar 12 23:12:46 sppc26 kernel: [ 695.152169] SysRq : This sysrq operation is disabled.

Revision history for this message

Pieter (diepes) wrote on 2013-03-13:

#30

Is this kernel bug the same i915 problem ?
https://bugs.freedesktop.org/show_bug.cgi?id=54226

Revision history for this message

Alessio (alessio) wrote on 2013-03-20:

#31

Ubuntu 12.04.2LTS on Intel i3-2120 with integrated hd2000 gpu

I never had this problem before, but after yesterday kernel update (3.5.0-26-generic) after few minutes dmesg start to show this error messages

[ 175.345408] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 175.345412] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[ 175.348230] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off
[ 200.244562] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 200.244844] [drm] Enabling RC6 states: RC6 off, RC6p off, RC6pp off

after about an hour Xorg hangs I can only move the mouse pointer

I attached /sys/kernel/debug/dri/0/i915_error_state

Revision history for this message

Andrew Seguin (aseguo) wrote on 2013-03-20:

#32

Just a small "happens to me to" comment, with one small extra detail about kernel versions:

After Ubuntu 12.04 with automatic updates picked up kernel 3.2.0-39-generic, all computers on our campus (20+ with the problem) with Pentium G840 for CPU (Intel i915 graphics) started having that problem.

We resolved the problem temporarily by installing linux-image-3.2.0-38-generic and removing linux-image-3.2.0-39-generic.

The old fix described in the kernel bug report was not working for us (kernel command line option i915.i915_enable_rc6=0)

Revision history for this message

Marc-André Laverdière (marc-andre-atc-deactivatedaccount) wrote on 2013-03-21:

#33

I have experienced the same today. I am on a Linux Mint 14 system, with the latests kernel update installed.

I get the following in kern.log
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off

Output of uname -a
Linux redemption 3.5.0-26-generic #42-Ubuntu SMP Fri Mar 8 23:18:20 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Andrew Inishev (inish777) wrote on 2013-03-21:

#34

Ubuntu 12.10. Display randomly freezes for few seconds.

Mar 21 22:20:55 laptop kernel: [ 8539.287011] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 21 22:20:55 laptop kernel: [ 8539.287232] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off

uname -a:

Linux laptop 3.5.0-26-generic #42-Ubuntu SMP Fri Mar 8 23:18:20 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Also I have ATI RADEON HD 6300 as a discrete videocard.

/kernel/debug/vgaswitcheroo/switch:
0:IGD:+:Pwr:0000:00:02.0
1:DIS: :Pwr:0000:01:00.0

Revision history for this message

Ben (bhubu) wrote on 2013-03-22:

#35

Same problem: display freezes for few seconds since a few days.

Ubuntu 12.10

Mar 22 18:22:02 bhnbu kernel: [28761.486703] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

Linux bhnbu 3.5.0-26-generic #42-Ubuntu SMP Fri Mar 8 23:18:20 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Ben (bhubu) wrote on 2013-03-23:

#36

Looks like a regression in 3.5.0-26. Booting 3.5.0-25 (Linux bhnbu 3.5.0-25-generic #39-Ubuntu SMP Mon Feb 25 18:26:58 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux) does not show the problematic behaviour on my system.

Revision history for this message

Ben (bhubu) wrote on 2013-03-23:

#38

I am taking back my last comment.

Mar 23 17:10:59 bhnbu kernel: [ 3477.463901] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

:(

Pedro Villavicencio (pedro) on 2013-03-24

Changed in linux (Ubuntu):
importance:	Medium → High

Revision history for this message

Cristian Aravena Romero (caravena) wrote on 2013-03-24:

#39

Welcome Pedro Villavicencio to Ubuntu! - (¡Re-bienvenido Pedro Villavicencio a Ubuntu!) :D

Revision history for this message

Acid 303 (acid-303) wrote on 2013-03-25:

#40

I have the same error with an "Sony Vaio SVE1512C6EW" (Ubuntu 12.10 - 64 Bit).

See my german postion on ubuntuusers.de

http://forum.ubuntuusers.de/topic/error-hangcheck-timer-elapsed-gpu-hung-sony-va/

Is there a workaround for this problem to fix it?

thx,
Acid

Revision history for this message

Aymeric (mulx) wrote on 2013-03-25:

#41

Same here on a ThinkPad-Edge-E330 with a kernel 3.5.0-26-generic.

Without adding "915.i915_enable_rc6=0" to boot param graphic freeze, accessing trough ssh is still possible.
By adding this option the system graphic hang for few second and still almost usable (I've notice some text missing, probably memory corruption, but we are still able to "click" and reboot graphically).

A quick step to produce the bug: run glxgears and wait about 1 minute.

Note that I'm still able to login to the laptop with ssh, and since this bug is marked as "incomplete", tell me what information you need (like memory dump) and the way to get them, I will provided.

Regards,
Aymeric.

Revision history for this message

mycroes (mycroes) wrote on 2013-03-26:

#42

Having the same issue. Most of the time I can continue working and only get something like 'System problem detected' where I can report an issue (it's in Dutch, but I guess it's clear what window this is), sometimes (happened twice I think) the display locks completely. I tried (at least once) to do ctrl-alt-f1 and ctrl-alt-del, didn't work, but I guess it's mostly the vt switch that doesn't work when the display locks.

Now while I'm typing this I'm getting the 'System problem detected' dialog about every minute, but without the lockups.
Regards,

Michael

Revision history for this message

allartk (allartk) wrote on 2013-03-30:

#43

#1140716 is the same?

Revision history for this message

Otto Kekäläinen (otto) wrote on 2013-03-31:

#44

I've been running Ubuntu 12.04 for over a year now without problems on my home computer, but since about a week ago when I ran the update installer, the graphical system has started to crash randomly as described in this bug report.

Symptoms (appear randomly, only one at the time):
- System freezes completely during late stages of startup. Reboot recovers.
- System might start and everything work, but Apport dialogs appear one after another. Even if you complete the the Apport report, new dialogs about the exact same Intel GPU problem reapper. Reboot recovers the insane Apport loop.
- During use window decorations disappear. Window contents partially respond to mouse, but as the window manager seems dead, system must be rebooted to recover.

In kernel and syslog I've found the lines like these appear:
Mar 31 12:04:39 htpc kernel: [ 9026.908141] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 31 12:04:39 htpc kernel: [ 9026.908150] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Mar 31 12:04:39 htpc kernel: [ 9026.910800] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 2211040 at 2211035, next 2211041)
Mar 31 12:04:39 htpc kernel: [ 9026.916260] HDMI hot plug event: Codec=3 Pin=7 Presence_Detect=0 ELD_Valid=1
Mar 31 12:04:39 htpc kernel: [ 9026.916309] HDMI status: Codec=3 Pin=7 Presence_Detect=0 ELD_Valid=0
Mar 31 12:04:40 htpc kernel: [ 9027.115958] atl1c 0000:07:00.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.

The HDMI hot plug event is part of the bug. All cables are connected at all times and the driver should not get any hotplug event.

The main problem is the GPU hang.

Kernel:
Linux htpc 3.2.0-39-generic #62-Ubuntu SMP Thu Feb 28 00:28:53 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Graphics card:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller])
Subsystem: ASUSTeK Computer Inc. Device 844d
Flags: bus master, fast devsel, latency 0, IRQ 53
Memory at fb400000 (64-bit, non-prefetchable) [size=4M]
Memory at d0000000 (64-bit, prefetchable) [size=256M]
I/O ports at f000 [size=64]
Expansion ROM at <unassigned> [disabled]
Capabilities: <access denied>
Kernel driver in use: i915
Kernel modules: i915

It is interesting that this bug seems to have appeared to both 3.2, 3.5 and 3.8 -version kernels. Maybe it is related to some security fix applied recently? Or perhaps the root cause is not in the kernel package, but in some compiz or X update?

I've been running Ubuntu 12.04 for over a year now without problems on my home computer, but since about a week ago when I ran the update installer, the graphical system has started to crash randomly as described in this bug report.

Symptoms (appear randomly, only one at the time):
- System freezes completely during late stages of startup. Reboot recovers.
- System might start and everything work, but Apport dialogs appear one after another. Even if you complete the the Apport report, new dialogs about the exact same Intel GPU problem reapper. Reboot recovers the insane Apport loop.
- During use window decorations disappear. Window contents partially respond to mouse, but as the window manager seems dead, system must be rebooted to recover.

In kernel and syslog I've found the lines like these appear:
Mar 31 12:04:39 htpc kernel: [ 9026.908141] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Mar 31 12:04:39 htpc kernel: [ 9026.908150] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Mar 31 12:04:39 htpc kernel: [ 9026.910800] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 2211040 at 2211035, next 2211041)
Mar 31 12:04:39 htpc kernel: [ 9026.916260] HDMI hot plug event: Codec=3 Pin=7 Presence_Detect=0 ELD_Valid=1
Mar 31 12:04:39 htpc kernel: [ 9026.916309] HDMI status: Codec=3 Pin=7 Presence_Detect=0 ELD_Valid=0
Mar 31 12:04:40 htpc kernel: [ 9027.115958] atl1c 0000:07:00.0: vpd r/w failed.  This is likely a firmware bug on this device.  Contact the card vendor for a firmware update.

The HDMI hot plug event is part of the bug. All cables are connected at all times and the driver should not get any hotplug event.

The main problem is the GPU hang.

Kernel:
Linux htpc 3.2.0-39-generic #62-Ubuntu SMP Thu Feb 28 00:28:53 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Graphics card:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller])
	Subsystem: ASUSTeK Computer Inc. Device 844d
	Flags: bus master, fast devsel, latency 0, IRQ 53
	Memory at fb400000 (64-bit, non-prefetchable) [size=4M]
	Memory at d0000000 (64-bit, prefetchable) [size=256M]
	I/O ports at f000 [size=64]
	Expansion ROM at <unassigned> [disabled]
	Capabilities: <access denied>
	Kernel driver in use: i915
	Kernel modules: i915

It is interesting that this bug seems to have appeared to both 3.2, 3.5 and 3.8 -version kernels. Maybe it is related to some security fix applied recently? Or perhaps the root cause is not in the kernel package, but in some compiz or X update?

Revision history for this message

nldquy (nguyenledinhquy-yahoo) wrote on 2013-04-01:

#45

I got exactly the same problem and now the system always crash after a few minutes running.

Ubuntu 12.10 3.5.0-26 generic

CPU intel core i7 2600

Ram 3x4GB DDRIII

Revision history for this message

Gerry Finlay (fingau) wrote on 2013-04-02:

#46

Same thing here. A few weeks ago I did a normal upgrade and GUI now always hangs with the messages in dmesg the same as those listed in previous posts. drm: i915_hangcheck_elapsed, etc.
Ubuntu 12.04 3.2.0-39-generic
To have a usuable system I must boot the previous linux version 3.2.0-38-generic.

Revision history for this message

Otto Kekäläinen (otto) wrote on 2013-04-02:

#47

I downgraded to kernel 3.2.0-38 and the issue is disappeared. I reinstalled -39 and the issue re-appeared and then again returned to -38 and everything works again. So clearly at least for me the bug is a regression in the 3.2.0-39 kernel.

Revision history for this message

Niklas Edmundsson (niklas-edmundsson) wrote on 2013-04-03:

#48

Same here on precise amd64. Booting with 3.2.0-39 X hangs shortly after login with the earlier mentioned hangcheck messages in the kernel log (machine is remotely login:able). Reverting to 3.2.0-38 yields a stable machine.

This is on a machine with an Intel i5 CPU on an ASUS P8H67-V motherboard, with the latest&greatest BIOS (upgrading the BIOS made no difference).

tags:

added: regression-update

Revision history for this message

Cedric Schieli (cschieli) wrote on 2013-04-03: Re: [Bug 946899] Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#49

Same here, but the couple of working/non-working kernel is 3.5.0-25/3.5.0-26.

My hardware is a Dell Inspiron N7110 with a i7-2670QM CPU and a NVIDIA
GeForce GT 540M disabled using bbswitch :

[ 46.492299] bbswitch: version 0.6
[ 46.492305] bbswitch: Found integrated VGA device 0000:00:02.0:
\_SB_.PCI0.GFX0
[ 46.492309] bbswitch: Found discrete VGA device 0000:01:00.0:
\_SB_.PCI0.PEG0.PEGP
[ 46.492508] bbswitch: detected an Optimus _DSM function
[ 46.492513] bbswitch: disabling discrete graphics
[ 46.610983] bbswitch: Succesfully loaded. Discrete card 0000:01:00.0 is off

2013/4/3 Niklas Edmundsson <email address hidden>:
> Same here on precise amd64. Booting with 3.2.0-39 X hangs shortly after
> login with the earlier mentioned hangcheck messages in the kernel log
> (machine is remotely login:able). Reverting to 3.2.0-38 yields a stable
> machine.
>
> This is on a machine with an Intel i5 CPU on an ASUS P8H67-V
> motherboard, with the latest&greatest BIOS (upgrading the BIOS made no
> difference).
>
> ** Tags added: regression-update
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/946899
>
> Title:
> [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU
> hung
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/linux/+bug/946899/+subscriptions

Revision history for this message

Miklos Juhasz (mjuhasz) wrote on 2013-04-03: Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#50

I reverted "drm/i915: GFX_MODE Flush TLB Invalidate Mode must be '1' for scanline waits" and rebuilt kernel 3.2.0-39. I haven't had the problem ever since.

Revision history for this message

Miro Justice (mirojustice) wrote on 2013-04-03:

#51

same issue here

[ 1941.972506] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 1941.972511] [drm] capturing error event; look for more information in /debug/dri/1/i915_error_state
[ 1941.983583] [drm] Enabling RC6 states: RC6 on, RC6p off, RC6pp off

Linux lenore 3.5.0-26-generic #42-Ubuntu SMP Fri Mar 8 23:18:20 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Thinkpad T520 / NVIDIA Quadro NVS 4200M

Revision history for this message

Miro Justice (mirojustice) wrote on 2013-04-03:

#52

buggy fonts Edit (344.2 KiB, image/png)

I also get buggy fonts after this error, maybe it helps

Revision history for this message

Pieter (diepes) wrote on 2013-04-03:

#53

Ok . Tried www.kernel.org kernel's.
3.8.4 worked the best so far, occasional gui freezes, but switching to terminal and back kept me going.
3.8.5 had a solid freeze after hibernate.

Revision history for this message

AlexD (droid-develope) wrote on 2013-04-05:

#54

I've started expectiong this issue several times a day AFTER kernel upgrade to 3.2.0-39. When I was on 3.2.0-38 there are no such an issue at all.

uname:
3.2.0-39-generic #62-Ubuntu SMP Thu Feb 28 00:28:53 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

lspci:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

grep drm /var/log/syslog:
[11702.675347] [drm:i915_hangcheck_elapsed] *ERROR* hung
[11702.675407] [drm:i915_wait_request] *ERROR* awaiting 1514592 at 1514585, next 1514593)
[11767.328956] [drm:i915_hangcheck_elapsed] *ERROR* hung
[11767.329027] [drm:i915_wait_request] *ERROR* awaiting 1514606 at 1514585, next 1514607)
[ 23.837492] [drm] Initialized drm 1.1.0 20060810
[ 23.905900] [drm] Supports vblank timestamp
[ 23.905901] [drm] Driver supports precise vblank
[ 23.905918] [drm] applying RC6 disable quirk
[ 24.262876] fbcon: inteldrmfb (fb0) is primary
[ 24.262958] fb0: inteldrmfb frame buffer device
[ 24.262959] drm: registered panic notifier
[ 24.263797] [drm] Initialized i915 1.6.0 nor 0
[ 3226.155882] [drm:i915_hangcheck_elapsed] *ERROR* hung
[ 3226.155892] [drm] capturing error event; look dri/0/i915_error_state
[ 3226.158898] [drm:i915_wait_request] *ERROR* awaiting 599218 at 599216, next 599219)
[ 25.254664] [drm] Initialized drm 1.1.0 20060810
[ 25.314701] [drm] Supports vblank timestamp
[ 25.314702] [drm] Driver supports precise vblank
[ 25.314721] [drm] applying RC6 disable quirk
[ 25.658319] fbcon: inteldrmfb (fb0) is primary
[ 25.658402] fb0: inteldrmfb frame buffer device
[ 25.658403] drm: registered panic notifier
[ 25.659293] [drm] Initialized i915 1.6.0 nor 0
[ 1729.983644] [drm:i915_hangcheck_elapsed] *ERROR* hung
[ 1729.983653] [drm] capturing error event; look dri/0/i915_error_state
[ 1729.986633] [drm:i915_wait_request] *ERROR* awaiting 489704 at 489696, next 489705)
[ 23.307751] [drm] Initialized drm 1.1.0 20060810
[ 23.477669] [drm] Supports vblank timestamp
[ 23.477670] [drm] Driver supports precise vblank
[ 23.477688] [drm] applying RC6 disable quirk
[ 23.803226] fbcon: inteldrmfb (fb0) is primary
[ 23.803311] fb0: inteldrmfb frame buffer device
[ 23.803312] drm: registered panic notifier
[ 23.804133] [drm] Initialized i915 1.6.0 nor 0

I've started expectiong this issue several times a day AFTER kernel upgrade to 3.2.0-39. When I was on 3.2.0-38 there are no such an issue at all.

uname:
3.2.0-39-generic #62-Ubuntu SMP Thu Feb 28 00:28:53 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

lspci:
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

grep drm /var/log/syslog:
[11702.675347] [drm:i915_hangcheck_elapsed] *ERROR* hung
[11702.675407] [drm:i915_wait_request] *ERROR* awaiting 1514592 at 1514585, next 1514593)
[11767.328956] [drm:i915_hangcheck_elapsed] *ERROR* hung
[11767.329027] [drm:i915_wait_request] *ERROR* awaiting 1514606 at 1514585, next 1514607)
[   23.837492] [drm] Initialized drm 1.1.0 20060810
[   23.905900] [drm] Supports vblank timestamp 
[   23.905901] [drm] Driver supports precise vblank 
[   23.905918] [drm] applying RC6 disable quirk
[   24.262876] fbcon: inteldrmfb (fb0) is primary 
[   24.262958] fb0: inteldrmfb frame buffer device
[   24.262959] drm: registered panic notifier
[   24.263797] [drm] Initialized i915 1.6.0 nor 0
[ 3226.155882] [drm:i915_hangcheck_elapsed] *ERROR* hung
[ 3226.155892] [drm] capturing error event; look dri/0/i915_error_state
[ 3226.158898] [drm:i915_wait_request] *ERROR* awaiting 599218 at 599216, next 599219)
[   25.254664] [drm] Initialized drm 1.1.0 20060810
[   25.314701] [drm] Supports vblank timestamp 
[   25.314702] [drm] Driver supports precise vblank 
[   25.314721] [drm] applying RC6 disable quirk
[   25.658319] fbcon: inteldrmfb (fb0) is primary 
[   25.658402] fb0: inteldrmfb frame buffer device
[   25.658403] drm: registered panic notifier
[   25.659293] [drm] Initialized i915 1.6.0 nor 0
[ 1729.983644] [drm:i915_hangcheck_elapsed] *ERROR* hung
[ 1729.983653] [drm] capturing error event; look dri/0/i915_error_state
[ 1729.986633] [drm:i915_wait_request] *ERROR* awaiting 489704 at 489696, next 489705)
[   23.307751] [drm] Initialized drm 1.1.0 20060810
[   23.477669] [drm] Supports vblank timestamp 
[   23.477670] [drm] Driver supports precise vblank 
[   23.477688] [drm] applying RC6 disable quirk
[   23.803226] fbcon: inteldrmfb (fb0) is primary 
[   23.803311] fb0: inteldrmfb frame buffer device
[   23.803312] drm: registered panic notifier
[   23.804133] [drm] Initialized i915 1.6.0 nor 0

Revision history for this message

Martin Wildam (mwildam) wrote on 2013-04-05: Re: [Bug 946899] Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#55

On Fri, Apr 5, 2013 at 11:32 AM, AlexD <email address hidden> wrote:
> I've started expectiong this issue several times a day AFTER kernel
> upgrade to 3.2.0-39. When I was on 3.2.0-38 there are no such an issue
> at all.

I also have the i915 (on a Dell) and here with kernel 3.5.0-25 and
everything fine here and I only a very few hangups after my first 3.5
upgrade (don't remember which subversion of 3.5 it was) and since a
few updates ago I never had any hangup again. BTW: The 3.5 is already
available for 12.04 in the repositories.

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2013-04-08: Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#56

This may also be related to bug 1140716

Revision history for this message

Alex Shkop (a-v-shkop) wrote on 2013-04-09:

#57

Just updated to 3.5.0-27 - no effect. Same issue as with 3.5.0-26.

Revision history for this message

Miro Justice (mirojustice) wrote on 2013-04-10:

#58

upgrade to 3.5.0-27 halts the boot routine of T530 and T520
(tested with mintlinux)

Revision history for this message

Gerry Finlay (fingau) wrote on 2013-04-11:

#59

I just did apt-get dist-upgrade. The hang still occurs.

Apr 11 17:40:17 xxxx kernel: [ 0.000000] Linux version 3.2.0-40-generic (buildd@allspice) (gcc version 4.6.3 (Ubuntu/Linaro 4.6.3-1ubuntu5) ) #64-Ubuntu SMP Mon Mar 25 21:22:10 UTC 2013 (Ubuntu 3.2.0-40.64-generic 3.2.40)
Apr 11 17:40:17 xxxx kernel: [ 13.167716] [drm] Initialized i915 1.6.0 20080730 for 0000:00:02.0 on minor 0

Apr 11 17:41:44 xxxx kernel: [ 102.530057] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Apr 11 17:41:44 xxxx kernel: [ 102.530094] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Apr 11 17:41:44 xxxx kernel: [ 102.533106] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 6846 at 6842, next 6847)

To have a usuable system I must boot previous linux version 3.2.0-38-generic.

Revision history for this message

Marc-André Laverdière (marc-andre-atc-deactivatedaccount) wrote on 2013-04-15:

#60

I had this occuring again after upgrading to 3.5.0-27-generic #46-Ubuntu SMP Mon Mar 25 19:58:17 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Marco Biscaro (marcobiscaro2112) wrote on 2013-04-15:

#61

It looks like this is https://bugs.freedesktop.org/show_bug.cgi?id=54226 which apparently has a fix committed in linux-next git repository (http://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git/commit/?id=a24a11e6b4e96bca817f854e0ffcce75d3eddd13).

Could someone confirm the commit above fix this issue?

Revision history for this message

herman.vierendeels (herman-vierendeels) wrote on 2013-04-16:

#62

also same problem in
3.5.0-27-generic #46-Ubuntu SMP Mon Mar 25 19:58:17 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

AntonioK (rc5hack) wrote on 2013-04-17:

#63

Same problem with Kubuntu 12.04.2 LTS / 3.2.0-40-generic / x86_64.

i915.i915_enable_rc6=0 option doesn't helps.

I discovered that hangs occurs more often in first minutes after starting X, and if there are few hours passed without problems, hang likely won't occur untill reboot.

Revision history for this message

Ara Pulido (ara) wrote on 2013-04-18:

#65

This is happening to me with a sandybridge GPU with 3.8.0-18 at least once a day.

Revision history for this message

Pieter (diepes) wrote on 2013-04-19:

#66

Dam happened on 3.8.8-030808-generic #201304170248 SMP Wed Apr 17 06:49:45 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
Apr 19 16:12:04 t420 kernel: [69253.048784] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

Fabio Marconi (fabiomarconi) on 2013-04-21

Changed in linuxmint:
status:	New → Confirmed

Revision history for this message

Olivier Febwin (febcrash) wrote on 2013-04-25:

#67

i915_error_state Edit (2.2 MiB, text/plain)

Same problem on Ubuntu 13.04

[ 837.379197] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 837.379203] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

crash@Dell-Latitude-E6520:~$ uname -a
Linux Dell-Latitude-E6520 3.8.0-19-generic #29-Ubuntu SMP Wed Apr 17 18:16:28 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Antonios Hadjigeorgalis (antonioshadji) wrote on 2013-04-27:

#68

cat /var/log/syslog |grep *ERROR*
kernel: [89652.881261] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
kernel: [89658.951763] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
kernel: [89665.860399] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
kernel: [90567.443620] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
kernel: [90573.514121] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung

lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 12.04.2 LTS
Release: 12.04
Codename: precise

uname -a
3.5.0-27-generic #46~precise1-Ubuntu SMP Tue Mar 26 19:33:21 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

I recently enable the LTS Hardware enablement stack:
sudo apt-get install --install-recommends linux-generic-lts-quantal xserver-xorg-lts-quantal libgl1-mesa-glx-lts-quantal

This error is the first major issue I've had since a clean install of 12.04 was done in sept 2012.

Revision history for this message

Gerry Finlay (fingau) wrote on 2013-05-02:

#69

I just did apt-get dist-upgrade. The hang no longer occurs.

#uname -a
Linux XXX 3.2.0-41-generic #66-Ubuntu SMP Thu Apr 25 03:27:11 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Ubuntu linux version history:
3.2.0-38-generic No problems
3.2.0-39-generic System hang almost immediately after login
3.2.0-40-generic System hang almost immediately after login
3.2.0-41-generic No problems (so far)

I have used my system for a few hours now with no problems. If that continues I will not add any more comments to this page.

Revision history for this message

Niklas Edmundsson (niklas-edmundsson) wrote on 2013-05-02:

#70

As suggested earlier, this issue seems to be tracked in bug 1140716.

That bug recently logs "Fix Released".

In hindsight, I'm guessing that the original issue of this bug was something completely different given its age. But since this was what popped up when searching...

Revision history for this message

Pimou (pmhoudry) wrote on 2013-05-07:

#71

Same story for me on a thinkpad x220, sandy bridge, core i5, hd 3000. When using chrome, (but not only : last time it was syslog gui), and even under normal cpu load, the system freezes 2-3 times a day. The sound works, the mouse also, but you can't do nothing (even ctrl-alt-fx) but hard reset.

I got this issue on Debian Wheezy, Ubuntu 13.04, but not xubuntu 13.04. And I've experienced nearly two years of perfectly stable system on Fedora 17 (same gnome version as debian wheezy, 3.4)

Revision history for this message

Fake51 (peter-e-lind) wrote on 2013-05-09:

#72

I have this problem as well, after upgrading to 13.04 (did not experience it before). I've tried upgrading to the 3.9 kernel, but no difference (if anything it crashes more often). There is nothing as such that seems to set it off - about every two-three hours (at most) the screen just freezes up, and then after a few seconds I can use the mouse but all windows are unresponsive.

Interestingly, it's not a complete freeze - I can ctrl+alt+f1 to another terminal, killall lightdm, then start it again and get a new session running. Still, the upgrade to 13.04 has left my work computer in a rather bad shape and if there was an easy way of downgrading I'd chuck this version out as fast as I could.

Revision history for this message

peterbo (peterbo) wrote on 2013-05-11:

#73

Same here, complete new install of Ubuntu 13.04 on Asus U46SV i5-2410M 2.3 ghz, Sandy Bridge. GPU Hang especially happens frequently when using dual screen setup. Ctrl + Alt + F-something and then switching back again makes the desktop responsive again most times, although occasionally I have to rebbot.

Revision history for this message

Fake51 (peter-e-lind) wrote on 2013-05-13:

#74

Just tried peterbo's suggestion and that works for me as well - switching to another terminal, then switching back, makes the desktop responsive again.

Revision history for this message

Olivier Febwin (febcrash) wrote on 2013-05-13:

#75

As Fake51, I can get back my desktop when I switch on tty1 and then tty7

Revision history for this message

peterbo (peterbo) wrote on 2013-05-15:

#76

Just an update, the problem is completely the same with kernel 3.9 in Ubuntu 13.04, same machine as in my comment #73.

Revision history for this message

In freedesktop.org Bugzilla #65273, Murks (murks) wrote on 2013-06-02:

#112

Chipset is whatever is in the i3-2350M processor, HD3000 or whatever it's called.
system architecture: i686
libdrm: 2.4.45
mesa: 9.1.3
xf86-video-intel: 2.21.8
X.Org X Server: 1.14.1, Build Operating System: Linux 3.8.7
uname -r: 3.9.4-1-ARCH
Linux distribution: Arch Linux
Reproducable: not really, so far it only happened to me while playing games. It happens rarely and without obvious triggering event.

Last entry:
...
Jun 02 20:49:41 eeyore devmon[1128]: partition: [1]
Jun 02 23:12:45 eeyore kernel: [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Jun 02 23:12:45 eeyore kernel: [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
Jun 02 23:12:57 eeyore systemd-logind[1132]: Power key pressed.
Jun 02 23:12:57 eeyore systemd-logind[1132]: Powering Off...
...

$cat /sys/kernel/debug/dri/0/i915_error_state
no error state collected

What happens is quite simple, the whole system seems to be frozen, almost nothing reacts, certainly nothing that could help me to collect further information. The problem seems to be present since a while, it caused me to report this: https://bugs.freedesktop.org/show_bug.cgi?id=61411.
However, the first few times I played relatively demanding games and the laptop got rather warm, this time the game was far less demanding and the machine was not warmer than usual.

If there's any further information I can provide to help fix this, please tell me.

Revision history for this message

CuteChaps (sh-senthilkumar) wrote on 2013-06-12:

#77

I am running Ubuntu 13.04 X64 on 3.9 Kernel, facing same issue. Can some one get this fixed or any work around atleast would do.

[125646.132414] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[125646.132422] [drm] capturing error event; look for more information in/sys/kernel/debug/dri/0/i915_error_state
[125648.145662] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[125648.145901] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
[125648.145907] [drm:i915_reset] *ERROR* Failed to reset chip.

$ uname -ar
Linux Senthil-IN 3.9.0-030900-generic #201304291257 SMP Mon Apr 29 16:58:15 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Max Liebkies (mliebkies) wrote on 2013-06-19:

#78

I have been trying the latest mainline kernel (3.10-rc6) and haven't had any problems since 2 days.

Revision history for this message

Pieter (diepes) wrote on 2013-06-19:

#79

I am running 3.9.6 (it had a lot of i915 fix's) and also running for 3 days.
Laptop with lot of hibernate, single dual screen dock/un-docking.

Install from http://kernel.ubuntu.com/~kernel-ppa/mainline/

Revision history for this message

Pieter (diepes) wrote on 2013-06-22:

#80

and on the 4th day, GPU hung. :(

Revision history for this message

Max Liebkies (mliebkies) wrote on 2013-06-24:

#81

I haven't had any problems since my last post.

[4 days is fine given the current state in 3.8 by the way ;-)]

Revision history for this message

Anthony Rogliano (anthonysr) wrote on 2013-07-26:

#82

This still happens on Ubuntu 13.04 patched as of 7/26/2013 on a Lenovo T420.

kernel: 3.8.0-26-generic #38-Ubuntu SMP Mon Jun 17 21:43:33 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
lspci: 00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)

Please prioritize as it's extremely frustrating. I can't seem to easily capture the error_state because it's cleared out upon reboot (unable to switch TTYs etc).

Revision history for this message

Anthony Rogliano (anthonysr) wrote on 2013-07-30:

#83

I've tried the 3.9 kernel and while it still crashes this now at least allows me to switch to tty1 and back to tty7 to re-gain control of the UI.

Revision history for this message

Alexandre Ligo (alexandrekl) wrote on 2013-08-04:

#84

Same problem here after upgrading to Ubuntu 13.04. X freezes just after starting any flash video.
My system is a Sony Vaio VPCSB with dual graphics (Intel i915 and Radeon HD 6400M/7400M).

Revision history for this message

In freedesktop.org Bugzilla #65273, Brian Baligad (brianbaligad) wrote on 2013-08-16:

#113

Created attachment 84132
event capture

These are the contents of /sys/kernel/debug/dri/0/i915_error_state after the gpu hung event occurred.

Revision history for this message

In freedesktop.org Bugzilla #65273, Brian Baligad (brianbaligad) wrote on 2013-08-16:

#114

Created attachment 84134
xrandr --verbose

Revision history for this message

In freedesktop.org Bugzilla #65273, Brian Baligad (brianbaligad) wrote on 2013-08-16:

#115

Bug description:
I seem to be having the same issue. This occurred several times in a short period while using chromium on a second monitor. I set the secondary monitor up with: xrandr --output eDP1 --right-of HDMI1

System environment:
-- chipset: i7-4700MQ with the HD 4600 gpu
-- system architecture: 64-bit (x86_64)
-- xf86-video-intel: 2.21.14-2
-- xserver: 1.14.2-2
-- mesa: 9.1.6-1
-- libdrm: 2.4.46-2
-- kernel: 3.10.6-2-ARCH
-- Linux distribution: Arch
-- Machine or mobo model: Toshiba Satellite P70-A [PSPLNU-01Q006]
-- Display connector: hdmi

dmesg:
[drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[drm] capturing error event; look for more information in /sys/kernel/debug/dri/0/i915_error_state
[drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring

I attached the contents of i915_error_state

Revision history for this message

fsando (fsando) wrote on 2013-08-16:

#85

2013-08-16-120746-gpu-hang.txt.zip Edit (336.1 KiB, application/zip)

Will it help to upload log files etc?

I take a snaptshot of dmesg, i915_error_state and Xorg.0.log every time it happens. I have a ton of those files.

I put this bash script in my home folder and run it:

#!/bin/bash
# create a date for filenames
datestr=$(date '+%F-%H%M%S')
# snapshot of files
sudo cat /sys/kernel/debug/dri/0/i915_error_state > zz-hung-gpu/$datestr-i915_error_state.txt
dmesg > zz-hung-gpu/$datestr-dmesg-gpu-hang.txt
cat /var/log/Xorg.0.log > zz-hung-gpu/$datestr-Xorg.0.log-gpu-hang.txt
# write to screen the create files
echo zz-hung-gpu/$datestr-i915_error_state.txt
echo zz-hung-gpu/$datestr-dmesg-gpu-hang.txt
echo zz-hung-gpu/$datestr-Xorg.0.log-gpu-hang.txt

Revision history for this message

peterbo (peterbo) wrote on 2013-08-16:

#86

Hi guys, please try to install the newest kernel from ubuntu mainline repos. I have been running this for a week now and have gotten no hang since.

http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.11-rc5-saucy/

Download 3 deb packages, the one ending in _all and the two others for your architecture.
Then run a terminal "sudo dpkg -i linux-*" and reboot.

Revision history for this message

In freedesktop.org Bugzilla #65273, Brian Baligad (brianbaligad) wrote on 2013-08-20:

#116

Dual head display seemed to work fine earlier this month when I installed this system. In the last week or so I run into the same issues anytime the second display is connected.
[ 356.614203] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 356.614208] [drm] capturing error event; look for more information in /sys/kernel/debug/dri/0/i915_error_state
[ 356.621884] [drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring
[ 452.710408] Watchdog[966]: segfault at 0 ip 00007ff58f002938 sp 00007ff57ca5f010 error 6 in chromium[7ff58e229000+503a000]
[ 458.656344] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 458.656380] [drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring
[ 470.681128] Watchdog[1146]: segfault at 0 ip 00007fc67a291938 sp 00007fc667cee010 error 6 in chromium[7fc6794b8000+503a000]
[ 473.721812] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 483.709854] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 483.709868] [drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring
[ 541.776618] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 541.776654] [drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring
[ 553.800507] Watchdog[1201]: segfault at 0 ip 00007f83ca2c6938 sp 00007f83b7d23010 error 6 in chromium[7f83c94ed000+503a000]
[ 556.808672] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[ 556.808693] [drm:kick_ring] *ERROR* Kicking stuck wait on blitter ring

Revision history for this message

In freedesktop.org Bugzilla #65273, Brian Baligad (brianbaligad) wrote on 2013-08-20:

#117

Correction: I just realized that it's fine as long as I don't put Chromium on the second display. My statement "seemed to work fine earlier this month" was actually due to that.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-08-20:

#87

Thanks, peterbo, for the link incl. installation instructions!
Unfortunately after two days, I had another crash with the new kernel on Ubuntu 12.10. The mouse and keyboard stopped responding and before I could try to do CTRL+ALT+F4 or CTRL+ALT+SysRq+REISUB, the laptop just turned itself off. No traces left in syslog. Very annoying.

Revision history for this message

monomakh (monomakh) wrote on 2013-08-21:

#88

Hi guys, i download and install kernel from this mainline
http://kernel.ubuntu.com/~kernel-ppa/mainline/drm-intel-nightly/current/
and install this kernel
http://kernel.ubuntu.com/~kernel-ppa/mainline/drm-intel-nightly/2013-08-15-saucy/linux-headers-3.11.0-994_3.11.0-994.201308150404_all.deb
No hangs for a week.

Revision history for this message

peterbo (peterbo) wrote on 2013-08-21:

#89

schymans: I am running 13.04 so have not tested on 12.10. The crash you describe sounds a bit worse than the gpu hang but I obviously cannot be sure. My laptop never rebooted when the hang occurred though.

I must admit that I had one hang with rc5 after 7 days, however the error message was different than the usual "Hangcheck timer elapsed". It was something with a "ring" - cannot remember any more of it. It happened after having dual monitors connected, suspending my laptop and then opening it without any monitors connected. Am on rc6 now and no problems yet.

I cannot seem to force the hang, neither by running glxspheres, dragging windows fast across the desktop or playing with compiz effetcs/settings.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-08-22: Re: [Bug 946899] Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#90

Thanks, peterbo. Sorry, got confused. Of course, I'm running it on
13.04, too. I have had these really weird crashes before (rebooting by
itself) and they went away after one of the kernel upgrades, maybe even
when upgrading from 12.10 to 13.04. Got the Hangcheck timer elapsed ones
instead. That is, sometimes, when I return to my laptop, it is turned
off even when I'm pretty sure I left it on, so maybe they have just been
happening while idle. Now with the rc5 kernel I got the original crash
(without error log) within hours again, so I got discouraged from trying
any further. It's my work laptop, running dual monitors, and I really
can't afford to experiment too much as I need it to run and do its job.
I still can't believe that this is such a persistent bug (several
bugs?). The Linux kernel used to be considered bomb-proof! Apparently
its bugs, too...

On 21/08/13 16:04, peterbo wrote:
> schymans: I am running 13.04 so have not tested on 12.10. The crash you
> describe sounds a bit worse than the gpu hang but I obviously cannot be
> sure. My laptop never rebooted when the hang occurred though.
>
> I must admit that I had one hang with rc5 after 7 days, however the
> error message was different than the usual "Hangcheck timer elapsed". It
> was something with a "ring" - cannot remember any more of it. It
> happened after having dual monitors connected, suspending my laptop and
> then opening it without any monitors connected. Am on rc6 now and no
> problems yet.
>
> I cannot seem to force the hang, neither by running glxspheres, dragging
> windows fast across the desktop or playing with compiz effetcs/settings.
>

Revision history for this message

Chas Emerick (chas-d) wrote on 2013-08-23: Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#91

i915_error_state-20130822 Edit (1.0 MiB, application/octet-stream)

Another data point; I had posted over at http://ubuntuforums.org/showthread.php?t=2168780, but recently found this bug based on kernel crash syslog entries.

Anyway, I'm on a Macbook 4,2 FWIW:

$ uname -a
Linux cle-mba 3.8.0-29-generic #42-Ubuntu SMP Tue Aug 13 19:40:39 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

partial syslog:
Aug 22 05:26:55 cle-mba kernel: [ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.8.0-29-generic root=UUID=748df707-f110-49e0-b603-7ca1439341d8 ro quiet splash resume=/dev/sda3 vt.handoff=7
Aug 22 13:17:49 cle-mba kernel: [78499.977886] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Aug 22 13:17:49 cle-mba kernel: [78499.977892] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

Attached i915_error_state from the same event.

Note that I used to have i915.i915_enable_rc6=0 in my grub options, but removed it in an attempt to eliminate the hangs/crashes based on recommendations on a random forum thread. :-P Behaviour has been unaffected; I get a GPU hang or window manager crash (i.e. have to "recover" via tty1) ~every other day.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-08-28:

#92

Just confirming that the kernel under http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.11-rc5-saucy/ is not usable to me. My laptop crashes unrecoverably within hours and does not leave any traces in syslog. About to remove it again from the kernel list.

3.11.0-031100rc5-generic from http://kernel.ubuntu.com/~kernel-ppa/mainline/drm-intel-nightly/current/ works slightly better for me, but I still get similar crashes every couple of days.

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-08-28:

#93

Oops, wrong copy and paste, sorry!
3.11.0-031100rc5-generic from http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.11-rc5-saucy/ crashes several times a day
3.11.0-994-generic from http://kernel.ubuntu.com/~kernel-ppa/mainline/drm-intel-nightly/current/ crashes every couple of days on my machine

Revision history for this message

Stan Schymanski (schymans) wrote on 2013-08-28:

#94

i915_error_state Edit (2.0 MiB, text/plain)

After half a day of working on 3.8.13-030813-generic, got the "[drm:i915_hangcheck_hung]" again. At least it was sort-of recoverable, as I was able to log into another tty and backup i915_error_state (attached). Hope it helps someone for diagnosing the problem. When returning to tty7, Screen was still flackering whenever I moved the mouse, and it didn't seem to respond to any clicks, so I had to logout by CTRL+ALT+DEL and then I could log back in without problems.

Revision history for this message

Olivier Febwin (febcrash) wrote on 2013-08-30:

#95

[24033.443586] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[24033.443592] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

crash@Dell-Latitude-E6520:~$ uname -a
Linux Dell-Latitude-E6520 3.8.0-30-generic #43-Ubuntu SMP Wed Aug 21 21:07:22 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

rb.eng (rb-engch) wrote on 2013-09-11:

#96

Perhaps this is a related issue:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/946899

As I noted in the comments for that issue:

This issue impacts me multiple times a week. Sometimes the mouse continues to work but other times the mouse freezes also. I have not yet tried to open a teminal as suggested by bduncan (comment #13) but I am usually able to switch to the first terminal and kill compiz.

My freeze often occurs when switching workspaces. I often have thunderbird and chromium running laterally in desktop 1 and 2 respectively (of four total) and use an external drive on either a laptop or a netbook with an external display attached. The problem exists regardless of the hardware I am using.

When the freeze occurs and I can get to tty1, killing compiz is my work around. I usually use sudo top and find the compiz pid and kill it with signal 3. When I switch back to tty7 the screen will trigger the clearing of the freeze after I switch to it (tty7). Then compiz restarts and after a few moments (perhaps a minute) I am back working with the application which were open. When the freeze occurs and the keyboard becomes unresponsive (unable to switch to tty1) I must do a hard reboot to continue working from a restart.

I have looked into the log files and searched for any instance of compiz logging the freeze. All I have found is the indication I have killed the process. Compiz does not seem to be tied into the apport system for reporting. I expect there are several more users with this problem but the freezes go unreported as users have no idea what to do about it.

Unfortunately these unknown freezes (unknown to development community) will continue until these problems are given priority and addressed. Bottom line is that such freezes are like the blue screen of death and give potential adopters of ubuntu a crummy experience and all the more reason to switch back to something more familiar or "stable".

Compiz has made my user experience a challenging one. To that end I have enjoyed lighter interfaces including crunchbang and trisquel. I have been reluctant to go beyond the 12.04 LTS version as I am concerned my hardware is not up to the task. Compiz IMO provides me an awful experience and these problems give me pause to encourage others to the platform.

I implore those developers with more understanding of the freeze issue to put some heat on this problem and investigate what is going on and fix this. To ignore this further may render any other work on the ubuntu project to be moot if the user experience continues to suffer, IMO. I am happy to contribute what I can to help address this but I do not have deep knowledge of the compiz system.

Perhaps this is a related issue:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/946899

As I noted in the comments for that issue:

This issue impacts me multiple times a week. Sometimes the mouse continues to work but other times the mouse freezes also. I have not yet tried to open a teminal as suggested by bduncan (comment #13) but I am usually able to switch to the first terminal and kill compiz.

My freeze often occurs when switching workspaces. I often have thunderbird and chromium running laterally in desktop 1 and 2 respectively (of four total) and use an external drive on either a laptop or a netbook with an external display attached. The problem exists regardless of the hardware I am using.

When the freeze occurs and I can get to tty1, killing compiz is my work around. I usually use sudo top and find the compiz pid and kill it with signal 3. When I switch back to tty7 the screen will trigger the clearing of the freeze after I switch to it (tty7). Then compiz restarts and after a few moments (perhaps a minute) I am back working with the application which were open. When the freeze occurs and the keyboard becomes unresponsive (unable to switch to tty1) I must do a hard reboot to continue working from a restart.

I have looked into the log files and searched for any instance of compiz logging the freeze. All I have found is the indication I have killed the process. Compiz does not seem to be tied into the apport system for reporting. I expect there are several more users with this problem but the freezes go unreported as users have no idea what to do about it.

Unfortunately these unknown freezes (unknown to development community) will continue until these problems are given priority and addressed. Bottom line is that such freezes are like the blue screen of death and give potential adopters of ubuntu a crummy experience and all the more reason to switch back to something more familiar or "stable".

Compiz has made my user experience a challenging one. To that end I have enjoyed lighter interfaces including crunchbang and trisquel. I have been reluctant to go beyond the 12.04 LTS version as I am concerned my hardware is not up to the task. Compiz IMO provides me an awful experience and these problems give me pause to encourage others to the platform.

I implore those developers with more understanding of the freeze issue to put some heat on this problem and investigate what is going on and fix this. To ignore this further may render any other work on the ubuntu project to be moot if the user experience continues to suffer, IMO. I am happy to contribute what I can to help address this but I do not have deep knowledge of the compiz system.

Revision history for this message

rb.eng (rb-engch) wrote on 2013-09-11:

#97

Correct link in comment #96 should read

Perhaps this is a related issue:
https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-ati/+bug/987498

rb

Revision history for this message

monomakh (monomakh) wrote on 2013-09-16:

#98

Bug 987498 is really duplicate of this. On my home PC no problems with nvidia drivers, but on my work compiz freezes several times on day with intel.

Revision history for this message

Olivier Febwin (febcrash) wrote on 2013-10-07:

#100

Same problem on Ubuntu 13.10 Saucy

[14026.619116] [drm] capturing error event; look for more information in /sys/kernel/debug/dri/0/i915_error_state
[14026.635119] [drm:i915_set_reset_status] *ERROR* render ring hung inside bo (0x1dd7000 ctx 1) at 0x1dd71d8
crash@Dell-Latitude-E6520:~$ uname -a
Linux Dell-Latitude-E6520 3.11.0-11-generic #17-Ubuntu SMP Tue Oct 1 19:42:04 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

André Panisson (panisson) wrote on 2013-10-10:

#101

i915_error_state Edit (333.3 KiB, application/octet-stream)

Same problem on my Dell XPS 15. The interesting thing is that it happened while navigating Google Maps in Chrome.
Switched to a terminal using CTRL+ALT+F1 and back. When switched back, the interface responded for a few seconds, and hanged again. Before hanging again, Chrome was able to show the message "Rats! WebGL hit a snag".

dmesg:
[84044.751322] Watchdog[8060]: segfault at 0 ip 00007fa0e8b11f2e sp 00007fa0d8f7b4e0 error 6 in chrome[7fa0e5406000+56da000]
[84046.087148] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
[84046.087153] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state

uname -a
Linux xxx 3.8.0-31-generic #46-Ubuntu SMP Tue Sep 10 20:03:44 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

André Panisson (panisson) wrote on 2013-10-10:

#102

By the way, switching to the terminal, killing compiz and switching back again was able to restore the interface without having to restart lightdm and all my applications.

Revision history for this message

Martin Wildam (mwildam) wrote on 2013-10-12: Re: [Bug 946899] Re: [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

#103

Download full text (6.9 KiB)

I also have sometimes hangups where CTRL+ALT+F1 and afterwards
CTRL+ALT+F7 brings back GUI response, but very rarely. I am using
Chrome only for Hangouts, for everything else I am using Firefox.

On Thu, Oct 10, 2013 at 4:23 PM, André Panisson
<email address hidden> wrote:
> By the way, switching to the terminal, killing compiz and switching back
> again was able to restore the interface without having to restart
> lightdm and all my applications.
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/946899
>
> Title:
> [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU
> hung
>
> Status in The Linux Kernel:
> Incomplete
> Status in The Linux Mint Distribution:
> Confirmed
> Status in “linux” package in Ubuntu:
> Triaged
> Status in “linux” package in Fedora:
> Unknown
>
> Bug description:
> Since upgrading to 12.04 beta, I've seen this happen twice. The
> symptoms are:
>
> - The screen freezes
> - The backlight turns off
>
> At that point I have to reboot to get my display back. In syslog, i
> see:
>
>
> Mar 4 23:09:18 perseus kernel: [ 3751.612064] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
> Mar 4 23:09:18 perseus kernel: [ 3751.612076] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
> Mar 4 23:09:18 perseus kernel: [ 3751.613658] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 114040 at 114004, next 114118)
> Mar 4 23:09:18 perseus kernel: [ 3751.637049] [drm:init_ring_common] *ERROR* render ring initialization failed ctl 00000000 head 00000000 tail 00000000 start 00000000
>
> followed by a lot of things like:
>
>
> Mar 4 23:09:18 perseus kernel: [ 3751.700662] WARNING: at /build/buildd/linux-3.2.0/drivers/gpu/drm/i915/intel_display.c:793 intel_enable_pipe+0x14a/0x150 [i915]()
> Mar 4 23:09:18 perseus kernel: [ 3751.700667] Hardware name: 6465CTO
> Mar 4 23:09:18 perseus kernel: [ 3751.700670] PLL state assertion failure (expected on, current off)
>
> Mar 4 23:09:18 perseus kernel: [ 3751.812158] WARNING: at /build/buildd/linux-3.2.0/drivers/gpu/drm/i915/intel_display.c:930 assert_pipe+0x75/0x80 [i915]()
> Mar 4 23:09:18 perseus kernel: [ 3751.812165] Hardware name: 6465CTO
> Mar 4 23:09:18 perseus kernel: [ 3751.812170] pipe B assertion failure (expected on, current off)
>
> then:
>
>
> Mar 4 23:09:19 perseus kernel: [ 3753.044086] [drm:intel_lvds_enable] *ERROR* timed out waiting for panel to power on
> Mar 4 23:09:20 perseus kernel: [ 3753.671451] [drm:i915_wait_request] *ERROR* something (likely vbetool) disabled interrupts, re-enabling
> Mar 4 23:09:20 perseus kernel: [ 3753.684603] [drm:i915_wait_request] *ERROR* something (likely vbetool) disabled interrupts, re-enabling
> Mar 4 23:09:20 perseus kernel: [ 3753.704594] [drm:i915_wait_request] *ERROR* something (likely vbetool) disabled interrupts, re-enabling
> Mar 4 23:09:20 perseus kernel: [ 3753.724594] [drm:i915_wait_request] *ERROR* something (likely vbetool) disabled interrupts, re-enabling
> Mar 4 23:09:20 perseus ke...

Changed in xserver-xorg-video-intel:
importance:	Unknown → Medium
status:	Unknown → Confirmed

Changed in linux (Fedora):
importance:	Unknown → Undecided
status:	Unknown → Won't Fix

Linux Mint

8086:2a02 [drm:i915_hangcheck_elapsed] ERROR Hangcheck timer elapsed... GPU hung

Bug Description

Other bug subscribers

Related questions

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
Linux	Incomplete	High	linux-kernel-bugs #43267
Linux Mint	Incomplete	Undecided	Unassigned
xf86-video-intel	Unknown	Medium	freedesktop-bugs #65273
linux (Fedora)	Won't Fix	Undecided	redhat-bugs #879823
linux (Ubuntu)	Incomplete	High	Unassigned

Changed in xserver-xorg-video-intel:
status:	Confirmed → Unknown

Linux Mint

8086:2a02 [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

Bug Description

Other bug subscribers

Related questions

Bug attachments

Remote bug watches

8086:2a02 [drm:i915_hangcheck_elapsed] ERROR Hangcheck timer elapsed... GPU hung