drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

Bug #737972 reported by mic
80
This bug affects 14 people
Affects Status Importance Assigned to Milestone
xserver-xorg-video-intel (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Binary package hint: xserver-xorg-video-intel

This happens usualy in 10 minutes after login. Graphics freezes, but ctrl-alt-f1 usualy works (sometimes not) and I am able to run metacity from console to continue with work (and to send this bug report). Interesting parts from dmesg are these:
  741.124066] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 741.125460] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 140699 at 140697, next 140700)
[ 741.125594] [drm:i915_reset] *ERROR* Failed to reset chip.
[ 743.470831] compiz[1590]: segfault at 0 ip 00f7c5c6 sp bf9cd0a8 error 6 in libc-2.13.so[f07000+15a000]

Sometimes computer feezes totally (I can onlymove with mouse, keyboard is not working). Usualy computer freezes in 20 minuts after running metacity --replace.

I am able to provide further informations, or run some test, justl et me know.

ProblemType: Bug
DistroRelease: Ubuntu 11.04
Package: xserver-xorg-video-intel 2:2.14.0-4ubuntu4
ProcVersionSignature: Ubuntu 2.6.38-7.35-generic 2.6.38
Uname: Linux 2.6.38-7-generic i686
Architecture: i386
DRM.card0.LVDS.1:
 status: connected
 enabled: enabled
 dpms: On
 modes: 1280x800 1280x800
 edid-base64: AP///////wAkTXMjAAAAAAAPAQOAIRV4Cof1lFdPjCcnUFQAAAABAQEBAQEBAQEBAQEBAQEBxxsAoFAgFzAwIDYAS88QAAAZJhcAoFAgFzAwIDYAS88QAAAZAAAADwCBCjKBCigUAQBMo1gzAAAA/gBMVE4xNTRYMy1MMDQKAA==
DRM.card0.VGA.1:
 status: disconnected
 enabled: disabled
 dpms: Off
 modes:
 edid-base64:
Date: Fri Mar 18 22:30:52 2011
DistUpgraded: Log time: 2011-03-03 21:39:53.327059
DistroCodename: natty
DistroVariant: ubuntu
GraphicsCard:
 Intel Corporation Mobile 915GM/GMS/910GML Express Graphics Controller [8086:2592] (rev 03) (prog-if 00 [VGA controller])
   Subsystem: IBM Device [1014:058c]
   Subsystem: IBM Device [1014:058c]
InstallationMedia: Ubuntu-Netbook 10.10 "Maverick Meerkat" - Release Candidate i386 (20100928.2)
MachineType: IBM 2529ETG
PccardctlIdent:
 Socket 0:
   no product info available
PccardctlStatus:
 Socket 0:
   no card
ProcEnviron:
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.38-7-generic root=UUID=e613ec73-66b6-4a53-8659-07e132f0c679 ro quiet splash vt.handoff=7
SourcePackage: xserver-xorg-video-intel
UpgradeStatus: Upgraded to natty on 2011-03-04 (14 days ago)
dmi.bios.date: 10/28/2005
dmi.bios.vendor: IBM
dmi.bios.version: 77ET41WW (1.04 )
dmi.board.name: 2529ETG
dmi.board.vendor: IBM
dmi.board.version: Not Available
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: IBM
dmi.chassis.version: Not Available
dmi.modalias: dmi:bvnIBM:bvr77ET41WW(1.04):bd10/28/2005:svnIBM:pn2529ETG:pvrThinkPadZ60m:rvnIBM:rn2529ETG:rvrNotAvailable:cvnIBM:ct10:cvrNotAvailable:
dmi.product.name: 2529ETG
dmi.product.version: ThinkPad Z60m
dmi.sys.vendor: IBM
version.compiz: compiz 1:0.9.4-0ubuntu7
version.libdrm2: libdrm2 2.4.23-1ubuntu3
version.libgl1-mesa-glx: libgl1-mesa-glx 7.10.1-0ubuntu3
version.xserver-xorg: xserver-xorg 1:7.6~3ubuntu11
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:6.14.0-0ubuntu4
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.14.0-4ubuntu4
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:0.0.16+git20110107+b795ca6e-0ubuntu6

Revision history for this message
mic (mic-launchpad) wrote :
Revision history for this message
Bryce Harrington (bryce) wrote :

Hi mic, this sounds like a fix we just put in for 915/945 chips available in Ubuntu's linux 2.6.38-7.36 kernel, so I think once you update to that kernel you should find this resolved.

Changed in xserver-xorg-video-intel (Ubuntu):
status: New → Fix Released
Revision history for this message
mic (mic-launchpad) wrote :

I have 2.6.38-7.37 now and the problems remains (at least yesterday it was). Now I am doing update to recent natty (i update daily), that have kernel 2.6.38-7.38. After update and restart I will report the result.

Revision history for this message
mic (mic-launchpad) wrote :

After update the problem remains (now I am using kernel 2.6.38-7.38). Unity last for minute or two after login, then it freezes. I logged in to the console, tried to run metacity --replace, or compiz --replace, but nothing helps. Restart of gdm "worked", X was restarted, but screen was black. I was able to login (it was verified by login sound), but black screen is useless:-)

Running ubuntu with Classic session (no effects) seems stable, if metacity crashed, I am able to restart it from console. After that opengl application do not work.

Last part of my dmesg log:

[ 447.193579] lib80211_crypt: registered algorithm 'TKIP'
[ 501.865230] exe (2183): /proc/2183/oom_adj is deprecated, please use /proc/2183/oom_score_adj instead.
[ 518.848029] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 518.849439] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 29941 at 29938, next 29948)
[ 518.849539] [drm:i915_reset] *ERROR* Failed to reset chip.
[ 519.208698] show_signal_msg: 15 callbacks suppressed
[ 519.208706] compiz[1616]: segfault at 0 ip 00fea5c6 sp bff25f18 error 6 in libc-2.13.so[f75000+15a000]
[ 622.112540] compiz[2662]: segfault at 0 ip 00959ed1 sp bfacd060 error 6 in i915_dri.so[942000+64000]

What information do you need? I should be able to provide it.

Revision history for this message
Bryce Harrington (bryce) wrote :

From while the system is hung, we need:
  * Output of 'sudo intel_gpu_dump'
  * Copy of your /sys/kernel/debug/dri/0/i915_error_state
  * Output of 'dmesg'
  * Your /var/log/Xorg.0.log

Revision history for this message
mic (mic-launchpad) wrote :

Hi, here are the requested files (i915_error_state is twice, one for dri/0/intel_... and one for 64). Let me know if you need something additional.

Revision history for this message
Bryce Harrington (bryce) wrote : Re: [Bug 737972] Re: drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung

On Wed, Mar 23, 2011 at 11:12:20PM -0000, mic wrote:
> Hi, here are the requested files (i915_error_state is twice, one for
> dri/0/intel_... and one for 64). Let me know if you need something
> additional.
>
> ** Attachment added: "Requested files"
> https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/737972/+attachment/1934799/+files/debug.tar.gz

Thanks; for future reference please attach each file separately rather
than in a tarball; tarballs add extra steps for developers to look at
them.

Revision history for this message
mic (mic-launchpad) wrote :

Today I updated (to kernel 2.6.38-7.39), the problem still remains. But I have interesting observation. The freeze always (as I remember) occurs while I was using Google Chrome. After switching to firefox (after update), Unity seems stable (so far, nothing crashed).

Revision history for this message
Will Bickerstaff (willbickerstaff) wrote :

mic I see the same, typically when I bring up the find box in chromium (ctrl+f), but not always. If I notice the display freeze quick enough and switch to a VT wait a few seconds and switch back before my display gets garbled I can occasionally rescue it. I'm not sure this is particular to chrome/chromium though as I've had similar experiences when using inkscape.

Dmesg has:
[35115.968171] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[35115.969488] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 797622 at 797620, next 797623)
[35115.969634] [drm:i915_reset] *ERROR* Failed to reset chip.

I'll catch those other logs next time it happens.

Revision history for this message
Will Bickerstaff (willbickerstaff) wrote :

My logs attached. Is this the same bug (in which case fix released needs changing) otherwise I'll create a new bug. Again this was with chromium and the find box.

Revision history for this message
Will Bickerstaff (willbickerstaff) wrote :

Using Kernel 2.6.38-8-generic

Revision history for this message
Lukas Koranda (lkoranda) wrote :

This issue is definitely not fixed. System is really unstable under higher load. Please switch status of this bug.

Kernel 2.6.38-10-generic

[ 4625.020042] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 4625.021019] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 2064049 at 2064043, next 2064057)
[ 4627.340018] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 4627.850054] [drm:i915_reset] *ERROR* Failed to reset chip.
[ 5407.764217] [drm:i915_gem_object_bind_to_gtt] *ERROR* Attempting to bind a purgeable object
[ 5410.721187] show_signal_msg: 30 callbacks suppressed
[ 5410.721191] compiz[9217]: segfault at 0 ip 00007f0597e20afe sp 00007fff29be9230 error 4 in i965_dri.so[7f0597db2000+ac000]
[ 5412.680959] compiz[9240]: segfault at 0 ip 00007f8aa94baafe sp 00007fff81a1eed0 error 4 in i965_dri.so[7f8aa944c000+ac000]
[ 5414.470902] compiz[9257]: segfault at 0 ip 00007f618d8dfafe sp 00007fffffb39d40 error 4 in i965_dri.so[7f618d871000+ac000]
[ 5416.271635] compiz[9276]: segfault at 0 ip 00007f751325bafe sp 00007fff4ed6f320 error 4 in i965_dri.so[7f75131ed000+ac000]
[ 5418.147339] compiz[9295]: segfault at 0 ip 00007f9320509afe sp 00007fff4b3fe8f0 error 4 in i965_dri.so[7f932049b000+ac000]
[ 5419.941019] compiz[9314]: segfault at 0 ip 00007f7120bf6afe sp 00007fff156c1c60 error 4 in i965_dri.so[7f7120b88000+ac000]
[ 5421.773596] compiz[9333]: segfault at 0 ip 00007fa350e80afe sp 00007fffa659f650 error 4 in i965_dri.so[7fa350e12000+ac000]
...

Revision history for this message
mic (mic-launchpad) wrote :

I can confirm that. System is very unstable with unity (bug occurs in 10-15 minutes after launching unity ), less unstable without compiz (using metacity it manifest itself "just" few times a day). Currently I am using Xmonad, and the system is relavivelly stable (this error occurs once in a weak).

Revision history for this message
Balázs Póka (p-balazs) wrote :

Using Natty, with Linux 2.6.38-11 and Metacity, I get freezes several times a week, in a very random fashion. Usually without much load.

Revision history for this message
wadkar (wadkar) wrote :

Can someone at least link it to duplicate/related bug ?
It says fix released, and I did apt-get update && upgrade, yet this bug still occurs on my laptop
Do you guys need more logs ?

Can someone be kind enough to assign it to appropriate dev/team ?

I have enabled all notifications for this bug, feel free to ask for more information

# uname -a
Linux rednaxela 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

xserver-xorg-video-intel Version: 2:2.14.0-4ubuntu7.1

Thanks

Revision history for this message
computermacgyver (computermacgyver) wrote :

This problem just recently started happening on my system although I haven't made any configuration changes aside from updating packages. I think there might have been a fix released at one time, but it looks to me that recent updates have reintroduced the problem. I have this issue even running "Classic Ubuntu (No Effects)" (i.e. metacity with no composting) which used to be very stable.

uname -a
Linux 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

xserver-xorg-video-intel Version: 2:2.14.0-4ubuntu7.1

Revision history for this message
TheShadow (theshadow-shadowpedia) wrote :

Same as computermacgyver It started happening recently and the OS is now really unstable. It crashes somewhat randomly and my CTRL-ALT-F1 doesn't work. This is seriously affecting my workstation.

Revision history for this message
TheShadow (theshadow-shadowpedia) wrote :

Linux xander-laptop 2.6.38-12-generic #51-Ubuntu SMP Wed Sep 28 14:27:32 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

ii xserver-xorg-video-intel 2:2.14.0-4ubuntu7.2 X.Org X server -- Intel i8xx, i9xx display driver

Copy of data posted on another similar bug

https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/727594
https://launchpadlibrarian.net/84217757/data.log

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.