i915 Hangcheck timer elapsed... GPU hung

Bug #805586 reported by Akshay Bhat
60
This bug affects 11 people
Affects Status Importance Assigned to Milestone
xserver-xorg-video-intel (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Machine details:
Lenovo Thinkpad T400
Intel® CoreTM2 Duo processor P8400
Integrated Intel® GMA 4500M HD
Ubuntu 2.6.38-8-generic-pae (32bit)
uname -m i686

Symptoms when the problem occurs:
Display stops responding. There are 3 types of failures noticed:
- Display may randomly flicker (not garbage but toggles between desktop and couple open windows)
- Mouse moves but everything else is forzen (including computer time)
- Display is frozen, includes mouse pointer not moving
When the display is frozen, we are able to ssh into the machine. Further we are able to play music etc using the hot keys.

Problem has occured:
Can not correlate to any particular program or task. Seems to be random in nature.
- Has happened when actively using the laptop on AC power without the laptop ever going to sleep or screen saver mode after power up (happened within 10 minutes of powerup). Google Chrome was open along with Clementine music player at the time of failure.
- Has happened after 4 days of actively using laptop (during the 4 days laptop was put to sleep and woken up multiple times

[drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 1467.867755] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 788345 at 788325, next 788372)
[ 1468.376145] [drm:i915_reset] *ERROR* Failed to reset chip.
[ 1468.431602] show_signal_msg: 24 callbacks suppressed
[ 1468.431607] compiz[1490]: segfault at 0 ip b6fcd64d sp bff88998 error 6 in libc-2.13.so[b6eb8000+15a000]

modinfo i915
filename: /lib/modules/2.6.38-8-generic-pae/kernel/drivers/gpu/drm/i915/i915.ko
license: GPL and additional rights
description: Intel Graphics
author: Tungsten Graphics, Inc.
license: GPL and additional rights
srcversion: 0974A24E53B65781A91250E

modinfo drm
filename: /lib/modules/2.6.38-8-generic-pae/kernel/drivers/gpu/drm/drm.ko
license: GPL and additional rights
description: DRM shared core routines
author: Gareth Hughes, Leif Delgass, José Fonseca, Jon Smirl
srcversion: CC0F28C4FDAC8FEAD451313

ProblemType: Bug
DistroRelease: Ubuntu 11.04
Package: xorg 1:7.6+4ubuntu3.1
ProcVersionSignature: Ubuntu 2.6.38-8.42-generic-pae 2.6.38.2
Uname: Linux 2.6.38-8-generic-pae i686
Architecture: i386
CompizPlugins: [core,bailer,detection,composite,opengl,decor,mousepoll,vpswitch,regex,animation,snap,expo,move,compiztoolbox,place,grid,imgpng,gnomecompat,wall,ezoom,workarounds,staticswitcher,resize,fade,unitymtgrabhandles,scale,session,unityshell]
CompositorRunning: compiz
Date: Mon Jul 4 12:49:31 2011
DistUpgraded: Fresh install
DistroCodename: natty
DistroVariant: ubuntu
DkmsStatus:
 vboxhost, 4.0.6, 2.6.38-8-generic-pae, i686: installed
 fglrx, 8.841, 2.6.38-8-generic-pae, i686: installed
 tp-smapi, 0.40, 2.6.38-8-generic-pae, i686: installed
GraphicsCard:
 Intel Corporation Mobile 4 Series Chipset Integrated Graphics Controller [8086:2a42] (rev 07) (prog-if 00 [VGA controller])
   Subsystem: Lenovo Device [17aa:2112]
   Subsystem: Lenovo Device [17aa:2112]
InstallationMedia: Ubuntu 11.04 "Natty Narwhal" - Release i386 (20110427.1)
MachineType: LENOVO 2764CTO
PccardctlIdent:
 Socket 0:
   no product info available
PccardctlStatus:
 Socket 0:
   no card
ProcEnviron:
 SHELL=/bin/bash
 LANG=en_US.UTF-8
 LANGUAGE=en_US:en
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.38-8-generic-pae root=UUID=e929480c-e5e9-4ac2-9372-93e541c5c3e6 ro quiet splash vt.handoff=7
Renderer: Unknown
SourcePackage: xorg
Symptom: display
Title: Xorg freeze
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 12/06/2010
dmi.bios.vendor: LENOVO
dmi.bios.version: 7UET91WW (3.21 )
dmi.board.name: 2764CTO
dmi.board.vendor: LENOVO
dmi.board.version: Not Available
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: Not Available
dmi.modalias: dmi:bvnLENOVO:bvr7UET91WW(3.21):bd12/06/2010:svnLENOVO:pn2764CTO:pvrThinkPadT400:rvnLENOVO:rn2764CTO:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
dmi.product.name: 2764CTO
dmi.product.version: ThinkPad T400
dmi.sys.vendor: LENOVO
version.compiz: compiz 1:0.9.4+bzr20110606-0ubuntu1~natty1
version.libdrm2: libdrm2 2.4.23-1ubuntu6
version.libgl1-mesa-dri: libgl1-mesa-dri 7.10.2-0ubuntu2
version.libgl1-mesa-dri-experimental: libgl1-mesa-dri-experimental N/A
version.libgl1-mesa-glx: libgl1-mesa-glx 7.10.2-0ubuntu2
version.xserver-xorg: xserver-xorg 1:7.6+4ubuntu3.1
version.xserver-xorg-video-ati: xserver-xorg-video-ati N/A
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.14.0-4ubuntu7.1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:0.0.16+git20110107+b795ca6e-0ubuntu7

Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :
Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :
Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :
Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :
Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :
Revision history for this message
RedSingularity (redsingularity) wrote :

Assigning package.

affects: ubuntu → xserver-xorg-video-intel (Ubuntu)
Revision history for this message
exactt (giesbert) wrote :

Saw this with sympthom no. 1 after installing the latest kernel from Natty-porposed. Previously I suffered from https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/761065

Changed in xserver-xorg-video-intel (Ubuntu):
status: New → Confirmed
Revision history for this message
Lenbok (lenbok) wrote :

I can repeatably produce this bug on my Intel Sandybridge Core i7-2600. Symptoms exactly as described. I can often switch to linux console and kill the process using GL to regain control. Example dmesg:

Sep 18 22:00:10 tron kernel: [ 318.861649] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:10 tron kernel: [ 318.863118] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 57983 at 57980, next 57984)
Sep 18 22:00:16 tron kernel: [ 325.158236] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:16 tron kernel: [ 325.158313] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 58725 at 58722, next 58726)
Sep 18 22:00:17 tron kernel: [ 325.209242] show_signal_msg: 24 callbacks suppressed
Sep 18 22:00:17 tron kernel: [ 325.209245] compiz[1676]: segfault at 0 ip 00007fd4b2684be8 sp 00007ffffd764ad0 error 6 in i965_dri.so[7fd4b2613000+ac000]
Sep 18 22:00:19 tron kernel: [ 327.357035] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:25 tron kernel: [ 333.633635] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:25 tron kernel: [ 333.633673] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 59197 at 59194, next 59198)
Sep 18 22:00:31 tron kernel: [ 340.190074] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:31 tron kernel: [ 340.190113] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 60627 at 60624, next 60628)
Sep 18 22:00:38 tron kernel: [ 346.446685] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:38 tron kernel: [ 346.446720] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 62241 at 62238, next 62242)
Sep 18 22:00:42 tron kernel: [ 351.184113] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 18 22:00:42 tron kernel: [ 351.184174] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 62791 at 62783, next 62792)
Sep 18 22:00:42 tron kernel: [ 351.184401] [drm:i915_reset] *ERROR* GPU hanging too fast, declaring wedged!
Sep 18 22:00:42 tron kernel: [ 351.184405] [drm:i915_reset] *ERROR* Failed to reset chip.
Sep 18 22:00:42 tron kernel: [ 351.185368] openscad[2095]: segfault at 0 ip 00007f8003bb07d1 sp 00007fffefdc2de0 error 6 in i965_dri.so[7f8003b3f000+ac000]
Sep 18 22:00:43 tron kernel: [ 352.039066] compiz[2109]: segfault at 0 ip 00007f2ff2cb3be8 sp 00007fffac6e2530 error 6 in i965_dri.so[7f2ff2c42000+ac000]
Sep 18 22:00:45 tron kernel: [ 353.387006] compiz[2146]: segfault at 0 ip 00007f19bafb3be8 sp 00007fffc900e210 error 6 in i965_dri.so[7f19baf42000+ac000]
Sep 18 22:00:46 tron kernel: [ 354.646491] compiz[2163]: segfault at 0 ip 00007f181d840be8 sp 00007ffff170e9d0 error 6 in i965_dri.so[7f181d7cf000+ac000]

Sing out if you need any information to help debug.

Revision history for this message
adrien (adriendidio) wrote :
Download full text (4.1 KiB)

Same problem on my notebook(Clevo w150hrm) with natty and kernel 2.6.38-11
ubuntu freeze, sometime i can move the mouse, sometime not.

Sep 20 14:10:59 ubuntu-W150HRM kernel: [17893.207876] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 20 14:10:59 ubuntu-W150HRM kernel: [17893.210396] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 11397783 at 11397777, next 11397784)
Sep 20 14:11:43 ubuntu-W150HRM kernel: [17937.853667] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 20 14:11:43 ubuntu-W150HRM kernel: [17937.853767] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 11476838 at 11476828, next 11476839)
Sep 20 14:38:14 ubuntu-W150HRM kernel: [19527.947376] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Sep 20 14:38:14 ubuntu-W150HRM kernel: [19527.947505] [drm:i915_do_wait_request] *ERROR* i915_do_wait_request returns -11 (awaiting 12776623 at 12776606, next 12776624)
Sep 20 14:38:14 ubuntu-W150HRM kernel: [19528.020401] show_signal_msg: 6 callbacks suppressed
Sep 20 14:38:14 ubuntu-W150HRM kernel: [19528.020405] compiz[1615]: segfault at 0 ip 00007f70f9c4fbe8 sp 00007ffff5c81960 error 6 in i965_dri.so[7f70f9bde000+ac000]
Sep 20 14:38:17 ubuntu-W150HRM kernel: [19531.147260] compiz[7480]: segfault at 0 ip 00007f9c8e535be8 sp 00007fffb0942130 error 6 in i965_dri.so[7f9c8e4c4000+ac000]
Sep 20 14:38:19 ubuntu-W150HRM kernel: [19533.508949] compiz[7500]: segfault at 0 ip 00007fa89b0d0be8 sp 00007fff201b2a00 error 6 in i965_dri.so[7fa89b05f000+ac000]
Sep 20 14:38:22 ubuntu-W150HRM kernel: [19535.900321] compiz[7518]: segfault at 0 ip 00007fdb08ec9be8 sp 00007fff52f22490 error 6 in i965_dri.so[7fdb08e58000+ac000]
Sep 20 14:38:24 ubuntu-W150HRM kernel: [19538.297708] compiz[7535]: segfault at 0 ip 00007f17a945ebe8 sp 00007fff5d514380 error 6 in i965_dri.so[7f17a93ed000+ac000]
Sep 20 14:38:27 ubuntu-W150HRM kernel: [19540.752142] compiz[7552]: segfault at 0 ip 00007f5d834f4be8 sp 00007fff7fd5e440 error 6 in i965_dri.so[7f5d83483000+ac000]
Sep 20 14:38:29 ubuntu-W150HRM kernel: [19543.047402] compiz[7569]: segfault at 0 ip 00007feeb1c5abe8 sp 00007fff79ff9000 error 6 in i965_dri.so[7feeb1be9000+ac000]
Sep 20 14:38:31 ubuntu-W150HRM kernel: [19545.564338] compiz[7586]: segfault at 0 ip 00007f94b4cecbe8 sp 00007fffc297c740 error 6 in i965_dri.so[7f94b4c7b000+ac000]
Sep 20 14:38:34 ubuntu-W150HRM kernel: [19547.933439] compiz[7603]: segfault at 0 ip 00007f70e1e22be8 sp 00007fff0d5529f0 error 6 in i965_dri.so[7f70e1db1000+ac000]
Sep 20 14:38:36 ubuntu-W150HRM kernel: [19550.153363] compiz[7620]: segfault at 0 ip 00007fcf2acb1be8 sp 00007fffbde900c0 error 6 in i965_dri.so[7fcf2ac40000+ac000]
Sep 20 14:38:39 ubuntu-W150HRM kernel: [19552.979779] compiz[7637]: segfault at 0 ip 00007fedeeaa7be8 sp 00007fffb39ba790 error 6 in i965_dri.so[7fedeea36000+ac000]
Sep 20 14:38:42 ubuntu-W150HRM kernel: [19555.896434] compiz[7655]: segfault at 0 ip 00007f57314b2be8 sp 00007fff42067f10 error 6 in i965_dri.so[7f5731441000+ac000]
Sep 20 14:38:44 ubuntu-W150HRM kernel: [19558.297323] compiz[7673]: segfault at 0 ip 00007fb1f8388be8...

Read more...

Revision history for this message
Lenbok (lenbok) wrote :

Does anyone know if this still occurs in 11.10?

Revision history for this message
Henrik Sölver (henrik-solver) wrote :

Yes it does.

Revision history for this message
Bryce Harrington (bryce) wrote :

Hey nodeax,

You filed this bug report against natty, but I see it's still open and
doesn't appear to have much activity recently. So, now that oneiric
is released and stable, this may be a good point for you to upgrade
and re-test if this issue is still present there.

If it's solved in the new release and you think it's worth backporting
the fix, please indicate that. Or if having the fix in the new release
is good enough, feel free to close out the bug (or let us know and we'll
close it.)

If it's not solved, leave the bug report open. I can't promise we'll
get to it (we get way more bugs filed than we can usually get to), but
your testing and feedback can help out if and when we do.

Changed in xserver-xorg-video-intel (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Jens Kehne (jkehne) wrote :

I'm having the same problem on oneiric. The screen randomly freezes for a few seconds (the mouse still moves, but everything else is dead), then everything turns black for a second, and then it all goes back to normal. Afterwards, I find this in my syslog:

[ 200.319085] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
[ 200.319094] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
[ 200.330227] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 31970 at 31965, next 31971)
udevd[3721]: failed to execute '/usr/share/apport/apport-gpu-error-intel.py' '/usr/share/apport/apport-gpu-error-intel.py': No such file or directory
[ 200.405176] [drm:ironlake_update_pch_refclk] *ERROR* enabling SSC on PCH
[ 200.700351] [drm:ironlake_update_pch_refclk] *ERROR* enabling SSC on PCH

The "enabling SSC on PCH"-error also appears randomly from time to time, but I don't notice it otherwise.

Also, it only seems to occur when I have my second screen connected.

Revision history for this message
Henrik Sölver (henrik-solver) wrote :
Revision history for this message
Henrik Sölver (henrik-solver) wrote :

After I have installed Intel drivers from this ppa:glasen/intel-driver (https://launchpad.net/~glasen/+archive/intel-driver) I can not trigger this bug anymore. Before it always happened after a few minutes running a webgl demo in chromium browser. E.g. http://dl-developer.sonyericsson.com/demo/webgl/blog/webgl_materials_normalmap.html or http://www.ambiera.com/coppercube/demo.php?demo=dynamiclight&mode=webgl

Bryce Harrington (bryce)
Changed in xserver-xorg-video-intel (Ubuntu):
status: Incomplete → Fix Released
Revision history for this message
TroubleMakerDV (grayman2000) wrote :

Unexpectedly ran into this on 12.10.
m/b ASUS P67M8-PRO, integrated Intel video.

After hot unplug some device from PSU, getting same error: only REIUSB works.

Revision history for this message
mycroes (mycroes) wrote :

Having similar issues in 12.10. Everything is frozen for a few seconds and after that Ubuntu wants to send a bug report. dmesg shows hangcheck timer for i915 elapsed.

Revision history for this message
DiegoV (diegofcviegas) wrote :

This is happening with me on quantal.

Revision history for this message
Akshay Bhat (nodeax-deactivatedaccount) wrote :

This bug has re-appeared recently. Running Ubuntu 12.04.

Apr 7 17:05:23 Sweet kernel: [31293.206494] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... GPU hung
Apr 7 17:05:23 Sweet kernel: [31293.206504] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Apr 7 17:05:23 Sweet kernel: [31293.217351] [drm:i915_wait_request] *ERROR* i915_wait_request returns -11 (awaiting 2596982 at 2596974, next 2596983)

Revision history for this message
Lenbok (lenbok) wrote :

Yes, I started getting this too after a recent update. Very annoying. I wasn't sure which package update caused the problem but I rebooted into grub and selected the previous kernel (3.2.0-38) and it seems fine so far. Something in that update has reintroduced the problem. Can we re-open this or is there already a new bug?

Revision history for this message
Chris Wilson (ickle) wrote :

Lenbook, you have a different bug, see 1140716.

To post a comment you must log in.