System will periodically lockup with [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... render ring idle
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Incomplete
|
Medium
|
Unassigned |
Bug Description
I have noticed the lockups periodically since 09/13/2014. I can't remember what kernel version was in use then possibly kernel 3.13.0-35-generic or -36. It has continued periodically through versions -39; -40 and now to -43. The last time this has happened was today Dec 13 14:40:57 localhost kernel: [154858.820009] [drm:i915_
ProblemType: Bug
DistroRelease: Ubuntu 14.04
Package: linux-image-
ProcVersionSign
Uname: Linux 3.13.0-43-generic x86_64
ApportVersion: 2.14.1-0ubuntu3.6
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
CurrentDesktop: GNOME
Date: Sat Dec 13 20:22:51 2014
HibernationDevice: RESUME=
InstallationDate: Installed on 2014-10-24 (50 days ago)
InstallationMedia: Ubuntu 14.04 LTS "Trusty Tahr" - Release amd64 (20140417)
IwConfig:
eth0 no wireless extensions.
lo no wireless extensions.
MachineType: Dell Inc. OptiPlex 780
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
XDG_RUNTIME_
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=
RelatedPackageV
linux-
linux-
linux-firmware 1.127.10
RfKill:
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 02/13/2010
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A03
dmi.board.name: 0C27VV
dmi.board.vendor: Dell Inc.
dmi.board.version: A01
dmi.chassis.
dmi.chassis.type: 6
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.
dmi.product.name: OptiPlex 780
dmi.sys.vendor: Dell Inc.
chris pollock (cpollock) wrote : | #1 |
- AlsaInfo.txt Edit (27.8 KiB, text/plain; charset="utf-8")
- BootDmesg.txt Edit (58.8 KiB, text/plain; charset="utf-8")
- CRDA.txt Edit (322 bytes, text/plain; charset="utf-8")
- CurrentDmesg.txt Edit (2.1 KiB, text/plain; charset="utf-8")
- Dependencies.txt Edit (3.2 KiB, text/plain; charset="utf-8")
- Lspci.txt Edit (12.4 KiB, text/plain; charset="utf-8")
- Lsusb.txt Edit (1000 bytes, text/plain; charset="utf-8")
- ProcCpuinfo.txt Edit (1.6 KiB, text/plain; charset="utf-8")
- ProcInterrupts.txt Edit (1.7 KiB, text/plain; charset="utf-8")
- ProcModules.txt Edit (2.5 KiB, text/plain; charset="utf-8")
- PulseList.txt Edit (18.8 KiB, text/plain; charset="utf-8")
- UdevDb.txt Edit (136.6 KiB, text/plain; charset="utf-8")
- UdevLog.txt Edit (278.5 KiB, text/plain; charset="utf-8")
- WifiSyslog.txt Edit (108.1 KiB, text/plain; charset="utf-8")
Brad Figg (brad-figg) wrote : Status changed to Confirmed | #3 |
Changed in linux (Ubuntu): | |
status: | New → Confirmed |
tags: | added: bios-outdated-a15 |
Changed in linux (Ubuntu): | |
importance: | Undecided → Low |
status: | Confirmed → Incomplete |
chris pollock (cpollock) wrote : | #5 |
Since I've never updated a bios before I'm in the process of reading the links you sent. As soon as I've made the update I'll run the commands you requested and add to the report.
chris pollock (cpollock) wrote : | #6 |
After doing more reading and asking questions in the #ubuntu IRC channel it was suggested I update to a newer kernel version instead. So yesterday after I updated to - Linux localhost 3.14.0-
If this solves the problem I will post a comment
chris pollock (cpollock) wrote : | #7 |
It seems that this has been fixed by updating to a newer kernel. It's been over five days without any issues.
chris pollock (cpollock) wrote : | #8 |
- dmesg output drm.debug=0x06 Edit (77.2 KiB, text/plain)
Apparently this is not a bug that is only related to Ubuntu but to other flavors as well, see https:/
Christian Egger (ce4) wrote : | #9 |
I own a Dell Optiplex 760 from 2009 and I have the exact same problem on Debian Jessie (running kernel 3.16.0-4-amd64).
I have been running BIOS version A04 (from 2009) and just upgraded to the latest A16 BIOS version.
I can keep you informed if this fixes the ugly bug.
FYI, I did the following to upgrade the BIOS, took me around 5mins:
- copied an USB FreeDOS 1.1 image onto an empty USB drive. I used this one (don't forget to bunzip2):
http://
- copied the latest O760-A15.exe BIOS update file for my Dell 760 onto the thumbdrive
- booted into FreeDOS and ran the bios update
chris pollock (cpollock) wrote : Re: [Bug 1402331] Re: System will periodically lockup with [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer elapsed... render ring idle | #10 |
On Thu, 2015-01-15 at 11:49 +0000, johnnys wrote:
> I own a Dell Optiplex 760 from 2009 and I have the exact same problem on
> Debian Jessie (running kernel 3.16.0-4-amd64).
>
> I have been running BIOS version A04 (from 2009) and just upgraded to
> the latest A16 BIOS version.
>
> I can keep you informed if this fixes the ugly bug.
>
> FYI, I did the following to upgrade the BIOS, took me around 5mins:
>
> - copied an USB FreeDOS 1.1 image onto an empty USB drive. I used this one (don't forget to bunzip2):
> http://
> - copied the latest O760-A15.exe BIOS update file for my Dell 760 onto the thumbdrive
> - booted into FreeDOS and ran the bios update
>
Johnny, I need to follow up with you on the above. I've now got
'command.com', 'kernel.sys', and '0780-A15.exe' on an empty thumb drive.
I assume the next step is to boot into it? FYI as I was looking for an
empty thumb drive in my collection this morning the thing locked up
again. Any help would be appreciated as I've never updated a BIOS before
in all my years.
Thanks
Chris
--
Chris
KeyID 0xE372A7DA98E6705C
31.11°N 97.89°W (Elev. 1092 ft)
08:02:06 up 9 min, 1 user, load average: 0.33, 0.64, 0.46
Ubuntu 14.04.1 LTS, kernel 3.13.0-44-generic
Christian Egger (ce4) wrote : | #11 |
Hi Chris,
I just came back to bring bad news: My machine just crashed again (running with the newly flashed BIOS A16).
I assume you got the copy process right (probably also using the 'dd' command). Otherwise you wouldn't have command.com kernel.sys and the bios update file "0780-A15.exe' on it). Next step would be to boot into this newly created FreeDOS environment, i.e.:
- shutdown the PC
- insert the thumb drive
- be sure that your system has "boot to USB" enabled
otherwise hit F12 (or so) to "select device to boot from"
- set the current date (FreeDOS will prompt you)
- start the BIOS update by typing "0780-A15.exe"
- follow the steps on screen...
- reboot
cheers,
John
chris pollock (cpollock) wrote : | #12 |
On Thu, 2015-01-15 at 14:23 +0000, johnnys wrote:
> Hi Chris,
>
> I just came back to bring bad news: My machine just crashed again
> (running with the newly flashed BIOS A16).
>
> I assume you got the copy process right (probably also using the 'dd'
> command). Otherwise you wouldn't have command.com kernel.sys and the
> bios update file "0780-A15.exe' on it). Next step would be to boot into
> this newly created FreeDOS environment, i.e.:
>
> - shutdown the PC
> - insert the thumb drive
> - be sure that your system has "boot to USB" enabled
> otherwise hit F12 (or so) to "select device to boot from"
> - set the current date (FreeDOS will prompt you)
> - start the BIOS update by typing "0780-A15.exe"
> - follow the steps on screen...
> - reboot
>
> cheers,
>
> John
>
Thanks John, been so long since I ran a DOS program I forgot how to
execute a file :( had to Google it. Would have been ok if I'd made the
first character an 'O' instead of putting a '0' in. Finally got it right
though. Since I already had the A15 update I installed that one. That's
screwed up that you already had a lockup, check comment #4 where it was
suggested that I update the BIOS. Since both of us have already done
that you may want to go to the bug report and leave a comment that 'well
that didn't work, what next' but I guess be nicer. I'm also working on
this bug report - https:/
may want to get involved there also. I'm going to make a note of when I
upgraded the BIOS and follow up with comments on both bugs when (note I
didn't say if) it locks up/crashes again. I'd probably run 'sudo lshw'
and attach the output to you comment to show that you have in fact
updated the BIOS that's what I'm going to do and since I'm on A15 and
you're on A16 and just had a lockup that should tell them something.
This is getting absurd, something needs to be done. If you notice the
first comment in the link above you'll see that person is running 'Arch
Linux' and a completely different type of hardware so this bug is
affecting a lot of systems. I've already upgraded to a 3.14* kernel from
a Ubuntu PPA which did not help so I went back to the 3.13* version.
Hopefully someone, someday will get this fixed.
Chris
--
Chris
KeyID 0xE372A7DA98E6705C
31.11°N 97.89°W (Elev. 1092 ft)
08:40:03 up 6 min, 2 users, load average: 1.14, 1.32, 0.71
Ubuntu 14.04.1 LTS, kernel 3.13.0-44-generic
chris pollock (cpollock) wrote : | #13 |
- lshw output after upgrading BIOS to A15 Edit (19.1 KiB, text/plain)
I have now upgraded my BIOS to version A15, the output of sudo lshw is attached.
penalvch (penalvch) wrote : | #14 |
chris pollock, now that you have updated your BIOS, have you experienced a lockup?
tags: |
added: latest-bios-a15 removed: bios-outdated-a15 |
chris pollock (cpollock) wrote : | #15 |
Not yet, however it's only been a bit over two days since the update and this is a very intermittent issue. I will most definitely post an update if/when the lockup happens. If not and I get to where I'm sure the system is stable I'll post. I've also added the following to my /etc/default/grub configuration file:
GRUB_CMDLINE_
and will post the output of 'dmesg' if another lockup happens.
Christian Egger (ce4) wrote : | #16 |
I have experienced another lockup since updating the BIOS to the latest version.
However, I'm using Debian Jessie (3.16 kernel) and a slightly different Dell model (Optiplex 760 instead of an Optiplex 780).
chris pollock (cpollock) wrote : | #17 |
- lshw output after upgrading BIOS to A15 Edit (19.1 KiB, text/plain)
Well, it lasted a bit over two days this time, just experienced another lockup
Jan 17 20:14:43 localhost kernel: [214866.808010] [drm:i915_
This is after updating to the A15 BIOS on my Dell 780
chris pollock (cpollock) wrote : | #18 |
chris pollock (cpollock) wrote : | #19 |
I neglected to add this but whether it makes a difference the lockup occurred while I was scrolling down posts in Facebook using Firefox.
Christian Egger (ce4) wrote : | #20 |
@Chris: You can try to untick "use hardware acceleration, if available" and "smooth scrolling" in Firefox' settings (navigate to Advanced tab=>General sub tab=>Browsing). At least for me, this reduced the frequency of lockups.
chris pollock (cpollock) wrote : | #21 |
On Sun, 2015-01-18 at 11:31 +0000, Christian Egger wrote:
> @Chris: You can try to untick "use hardware acceleration, if available"
> and "smooth scrolling" in Firefox' settings (navigate to Advanced
> tab=>General sub tab=>Browsing). At least for me, this reduced the
> frequency of lockups.
>
Done, I'm curious as to why on Launchpad this bug is low importance and
status still shows incomplete and assigned to no one while here at
freedesktop.org https:/
marked as high importance and critical. I believe that since we've done
what Chris Penalver requested by updating our BIOS, which BTW did not
fix the issue, I'll be posting my info to the bug report at
freedesktop.org above.
--
Chris
KeyID 0xE372A7DA98E6705C
31.11°N 97.89°W (Elev. 1092 ft)
07:14:04 up 10:55, 1 user, load average: 1.58, 1.35, 0.90
Ubuntu 14.04.1 LTS, kernel 3.13.0-44-generic
tags: | added: regression-release |
tags: |
added: regression-update removed: regression-release |
penalvch (penalvch) wrote : | #22 |
chris pollock, just to clarify, reporters who have an outdated BIOS can find the Importance Low as the BIOS should have already been updated before reporting it on Launchpad. Now that it is updated, a different Importance now applies. As well, freedesktop.org Severity/Priority is not restricted, so anyone can adjust it to whatever they want, whether or not it applies, so it is largely meaningless. Also, a bug report being assigned in Launchpad is up to a developer assigning themselves, versus being forced/auto assigned.
Despite this, could you please test the latest upstream kernel available from the very top line at the top of the page (the release names are irrelevant for testing, and please do not test the daily folder) following https:/
If the test did not allow you to test to the issue (ex. you couldn't boot into the OS) please make a comment in your report about this, and continue to test the next most recent kernel version until you can test to the issue. Once you've tested the upstream kernel, please comment on which kernel version specifically you tested. If this bug is fixed in the mainline kernel, please add the following tags:
kernel-
kernel-
where VERSION-NUMBER is the version number of the kernel you tested exactly shown as:
kernel-
This can be done by clicking on the yellow circle with a black pencil icon next to the word Tags located at the bottom of the bug description.
If the mainline kernel does not fix this bug, please add the following tags:
kernel-
kernel-
Once testing of the upstream kernel is complete, please mark this bug's Status as Confirmed. Please let us know your results. Thank you for your understanding.
Changed in linux (Ubuntu): | |
importance: | Low → Medium |
description: | updated |
chris pollock (cpollock) wrote : | #23 |
Will do Chris, so on the page you linked the latest non RC candidate is 3.18.3-vivid or do you want me to attempt to install 3.19-rc5-vivid?
penalvch (penalvch) wrote : | #24 |
chris pollock, 3.19-rc5.
chris pollock (cpollock) wrote : | #25 |
Thanks Chris, booted fine saw no issues on install or during boot. Output of uname -a Linux localhost 3.19.0-
chris pollock (cpollock) wrote : | #26 |
Whether this means anything or not I have no idea however I will post it in case it does. I get via a small script an hourly snippet of my syslog. When things are going on I have a habit of scanning through it intermittently for anything that, to me, looks abnormal. I happened to take a close look at the output of one of today's and noticed:
kernel: [87987.468212] [drm:intel_
kernel: [87987.468219] [drm:g4x_
kernel: [87987.468221] [drm:g4x_
kernel: [87987.468223] [drm:intel_
kernel: [87987.468225] [drm:g4x_update_wm] Setting FIFO watermarks - A: plane=40, cursor=2, B: plane=2, cursor=2, SR: plane=0, cursor=0
kernel: [87989.117248] [drm:i915_gem_open]
This has shown up multiple times since the 14th of Jan and not sure of exactly how many times since I updated to the 3.19 kernel however it is in 14 hourly log snippets. As I said I don't know if this means anything or not, probably doesn't. Even though it's only been a bit over 28hrs I haven't experienced a lockup as of yet.
chris pollock (cpollock) wrote : | #27 |
tags: | added: kernel-bug-exists-upstream kernel-bug-exists-upstream-3.19-rc5 needs-bisect |
chris pollock (cpollock) wrote : | #29 |
Chris, first question of the morning, when I experienced the lockup last nigh the 'Hangcheck .......' error was not anywhere in my syslog, does this mean anything? Did my dmesg attachment show anything? Secondly, as I'm reading through the 'bisect' instructions you sent it mentions building a kernel from scratch, I take it that that is a requirement to do this? If so this may take me a few days as 1)I've never built a kernel from scratch before and 2)medications I take prevent me from thinking very clearly until late in the day. However, if in fact building a kernel is required I'll throw myself into it and see what happens.
penalvch (penalvch) wrote : | #30 |
chris pollock:
>" when I experienced the lockup last nigh the 'Hangcheck .......' error was not anywhere in my syslog, does this mean anything?"
If one does not attach a log from /var/log , or captured through alternate methods, that contains either a kernel call trace, xorg backtrace, or at least the hangcheck originally reported, then it's largely useless unfortunately.
>"Did my dmesg attachment show anything?"
Unfortunately, it did not show any of the three things above. It is not terribly uncommon for nothing of value to be printed in any of the logs during a complete system lock. You may have better success capturing helpful information utilizing the capture methods noted in https:/
>"it mentions building a kernel from scratch, I take it that that is a requirement to do this?"
Yes.
chris pollock (cpollock) wrote : | #31 |
Thank you Chris, I'll start working on this tomorrow.
chris pollock (cpollock) wrote : | #32 |
Although I'm still working on digesting the 'kernel bisection' I have noticed several things. Firstly I've had four 'lockups' since 17 Jan, none of which contain the [drm:i915_
kernel: [ 1326.412487] [drm:intel_
kernel: [ 1326.412493] [drm:g4x_
kernel: [ 1326.412495] [drm:g4x_
kernel: [ 1326.412497] [drm:intel_
kernel: [ 1326.412499] [drm:g4x_update_wm] Setting FIFO watermarks - A: plane=40, cursor=2, B: plane=2, cursor=2, SR: plane=0, cursor=0
I might add that though the system 'appears' to be 'locked up' it is only the video that is locked, however the mouse cursor will continue to move, if that makes any sense. The system continues to fetch and process mail with fetchmail and procmail, SpamAssassin and ClamAv continue to run as well as scripts I have to report spam and so forth. Whether this change will make any difference in this bug report I'm not sure however I will continue to work on the kernel bisection as requested.
chris pollock (cpollock) wrote : | #33 |
Since the 17th of Jan I've had seven of the 'lockups' none of which show [drm:i915_
chris pollock (cpollock) wrote : | #34 |
Christopher, I worked on the bisection tonight however when I ran the command earlier this evening and built the kernel it built the latest *-45 instead of *-35. I just tried the command I saw in:
Build Environment
If you've not built a kernel on your system before, there are some packages needed before you can successfully build. You can get these installed with:
sudo apt-get build-dep linux-image-$(uname -r)
I ran the command again and got the below, where am I screwing up? I'm really wanting to get this done as I have several other projects going also and the 'freezes' are getting more and more frequent with the *-45 kernel.
chris@localhost:~$ sudo apt-get build-dep linux-image-
Reading package lists... Done
Building dependency tree
Reading state information... Done
Picking 'linux' as source package instead of 'linux-
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
15 not fully installed or removed.
After this operation, 0 B of additional disk space will be used.
Do you want to continue? [Y/n] n
Abort.
chris pollock (cpollock) wrote : | #36 |
On Wed, 2015-02-04 at 11:34 +0000, Christopher M. Penalver wrote:
> chris pollock, it would be best to switch over to bisecting the mainline
> kernel following the article.
>
Working on it, are these the three files I need to work with? And the
same for 3.13.0-36-generic which is where I believe the issue started. I
noticed when I installed the 3.19 version I had a file called
(linux-
don't see a _all.deb file in the 3.13.0-35 branch.
linux-headers-
linux-image-
linux-image-
--
Chris
KeyID 0xE372A7DA98E6705C
31.11°N 97.89°W (Elev. 1092 ft)
20:15:11 up 9:32, 2 users, load average: 0.23, 0.36, 0.39
Ubuntu 14.04.1 LTS, kernel 3.13.0-45-generic
chris pollock (cpollock) wrote : | #37 |
I just had another 'lockup' after about 11 days with the same error message - Feb 13 19:05:22 localhost kernel: [807775.808019] [drm:i915_
chris pollock (cpollock) wrote : | #38 |
I just experienced another 'lockup' at 19:06:06 however this time as sometimes in the past the 'Hangcheck....' error was not present in my syslog. All that was noted was this:
Feb 14 19:04:53 localhost kernel: [10683.755797] systemd-
Feb 14 19:04:53 localhost dbus[382]: [system] Successfully activated service 'org.freedeskto
Feb 14 19:09:48 localhost kernel: [10978.591143] [drm:intel_
chris pollock (cpollock) wrote : | #39 |
Another lockup this afternoon, black screen, mouse cursor was present and could be moved. When moving around (black) screen hand would appear as if I was hovering over something of course I have no idea what it was. Could not CTRL>ALT>F* to terminal log-in.
Feb 17 17:18:01 localhost kernel: [252433.820010] [drm:i915_
chris pollock (cpollock) wrote : | #40 |
I forgot to add the kernel version I'm running - Linux localhost 3.13.0-45-generic #74-Ubuntu SMP Tue Jan 13 19:36:28 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
chris pollock (cpollock) wrote : | #41 |
On 18 Feb, 2015 I installed kernel 3.19.0-
Feb 19 07:18:54 localhost kernel: [49604.616334] [drm:i915_gem_open]
Feb 19 07:19:59 localhost kernel: [49669.989779] [drm:intel_
Feb 19 07:19:59 localhost kernel: [49669.989786] [drm:g4x_
Feb 19 07:19:59 localhost kernel: [49669.989789] [drm:g4x_
Feb 19 07:19:59 localhost kernel: [49669.989791] [drm:intel_
Feb 19 07:19:59 localhost kernel: [49669.989794] [drm:g4x_update_wm] Setting FIFO watermarks - A: plane=40, cursor=2, B: plane=2, cursor=2, SR: plane=0, cursor=0
Feb 19 07:20:00 localhost kernel: [49670.747858] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747862] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747865] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747867] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747870] [drm:connected_
Feb 19 07:20:00 localhost kernel: [49670.747872] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747874] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747875] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747876] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747878] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747879] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747881] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747882] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747882] [drm:intel_
Feb 19 07:20:00 localhost kernel: [49670.747885] [drm:drm_
chris pollock (cpollock) wrote : | #42 |
chris pollock (cpollock) wrote : | #43 |
chris pollock (cpollock) wrote : | #44 |
Again another lockup this evening at 19:16:40, 20 Feb 2015. And again the 'Hangcheck' error did not make itself known, what my syslog shows for this time period before the CTRL>ALT>F1 is:
Feb 20 19:17:01 localhost CRON[29189]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Feb 20 19:17:28 localhost kernel: [129187.759473] [drm:intel_
Feb 20 19:17:28 localhost kernel: [129187.759479] [drm:g4x_
Feb 20 19:17:28 localhost kernel: [129187.759481] [drm:g4x_
Feb 20 19:17:28 localhost kernel: [129187.759483] [drm:intel_
Feb 20 19:17:28 localhost kernel: [129187.759485] [drm:g4x_update_wm] Setting FIFO watermarks - A: plane=40, cursor=2, B: plane=2, cursor=2, SR: plane=0, cursor=0
Feb 20 19:17:29 localhost kernel: [129189.173796] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173800] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173803] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173805] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173808] [drm:connected_
Feb 20 19:17:29 localhost kernel: [129189.173810] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173812] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173813] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173814] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173816] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173817] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173819] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173820] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173820] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173823] [drm:drm_
Feb 20 19:17:29 localhost kernel: [129189.173824] [drm:intel_
Feb 20 19:17:29 localhost kernel: [129189.173827] [drm:drm_
Feb 20 19:17:29 localhost kernel: [129189.173829] [drm:intel_
chris pollock (cpollock) wrote : | #45 |
chris pollock (cpollock) wrote : | #46 |
This is now getting into the realm of the absurd. No matter what kernel version I run in the 3.13.* series the system will lockup after a day or so with the above error. Today it locked up at 9:09am CST - Mar 1 09:09:00 localhost kernel: [182499.820012] [drm:i915_
chris pollock (cpollock) wrote : | #47 |
And again this happens less than 24hrs later still with kernel 3.13.0-46-generic #76-Ubuntu SMP Thu Feb 26 18:52:13 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux. I was able to drop to a terminal with CTRL-ALT-F1 this time as it locked up in Firefox.
Mar 2 07:35:38 localhost kernel: [71138.820009] [drm:i915_
Portion of kern.log when lockup happened:
Mar 2 07:35:38 localhost kernel: [71138.820009] [drm:i915_
Mar 2 07:36:23 localhost kernel: [71184.699897] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.780637] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.780641] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.780644] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.780656] [drm:i9xx_
Mar 2 07:36:24 localhost kernel: [71185.788019] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788021] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788023] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788026] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788029] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788031] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788042] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788044] [drm:intel_
Mar 2 07:36:24 localhost kernel: [71185.788046] [drm:intel_
chris pollock (cpollock) wrote : | #48 |
- kern2mar.log Edit (3.8 MiB, text/plain)
Attached is the kern.log showing four of the lockups and what happened before and after them
chris pollock (cpollock) wrote : | #49 |
Lockup happened again today with the hangcheck error. It has happened since my last post and now however without the hangcheck error. All I would see is a black screen with a movable mouse cursor. Today I disabled X-Screensaver and it locked up while using FireFox running kernel 3.13.0-46-generic #77-Ubuntu SMP Mon Mar 2 18:23:39 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux.
chris pollock (cpollock) wrote : | #50 |
As a test on 9 March I booted to 3.13.0-
Mar 17 13:57:59 localhost kernel: [93991.808012] [drm:i915_
I'm still running the 3.13.0 kernel however have not enabled X-Screensaver just as a test. Is there anything else I can check or do?
penalvch (penalvch) wrote : | #51 |
chris pollock, testing the latest mainline kernel (4.0-rc4) would be helpful.
chris pollock (cpollock) wrote : | #52 |
I've booted into 4.0.0-040000rc4
at 7:49 this evening. Also reactivated X-Screensaver to see how things go. Will advise if/when anything out of the ordinary happens.
chris pollock (cpollock) wrote : | #53 |
Christopher, since booting into the kernel in comment #52 I have seen this probably over a thousand times in my hourly syslog snippet:
Mar 17 20:02:34 localhost kernel: [ 824.137147] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.145104] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.153100] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.161098] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.169102] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.177100] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.185100] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.193139] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.201099] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.209097] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.217098] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.225095] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.233092] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.241092] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.249099] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.257114] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.265091] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.273095] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.281094] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.289092] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.289565] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.297106] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.305090] [drm:drm_
Mar 17 20:02:34 localhost kernel: [ 824.313092] [drm:drm_
penalvch (penalvch) wrote : | #54 |
chris pollock, could you please provide a yes or no answer to the following question:
When using the 4.0-rc4 kernel, is the problem this bug is scoped to "System will periodically lockup" reproducible?
chris pollock (cpollock) wrote : | #55 |
Yes, between 10pm on 18 Mar 2015 and 7am on 19 Mar 2015 the system locked up though I can't find any evidence of the 'hangcheck.....' error
chris pollock (cpollock) wrote : | #56 |
Christopher, I made a grave error in my statement about not seeing the 'hangcheck' error this morning. I just did a body search in my syslog snippets and there it was
Mar 19 07:34:14 localhost kernel: [128723.820044] [drm:i915_
Now this is very odd because during this time I was trying to get the system to drop to a terminal with CTRL>ALT>F1, after some period of time when it didn't drop to the terminal login I was trying to blindly login when suddenly the terminal did appear. I logged in and it was at 7:36am when I rebooted so the above 'hangcheck' appeared while I was either trying to login or I had also been looking at the size of my syslog which BTW is over 20Mb. So in regards to your question from last night, a definite yes, but I have no idea what the reason is as usual.
penalvch (penalvch) wrote : | #57 |
chris pollock, just to clarify, did this issue occur, or not occur with kernel 3.13.0-35 (just yes it did, or no it did not)?
tags: |
added: kernel-bug-exists-upstream-4.0-rc4 removed: kernel-bug-exists-upstream-3.19-rc5 |
chris pollock (cpollock) wrote : | #58 |
Yes
chris pollock (cpollock) wrote : | #59 |
At approximately 7:20pm, 19 March 2015 I noticed that again all I saw on my monitor was a black screen with the mouse cursor that was movable. I did a CTRL>ALT>F1 and left the room and came back about 5 minutes later when I was able to log-in and run 'sudo reboot'. This time I ensured that I did a message body check on the hourly syslog snippets for 'hangcheck' and there was none associated with this lockup. Now I do have another question associated with my comment #53. Since what I've shown in that comment is in being written to my syslog it is now over 32mb in size and that is only for today. Is this another bug I need to report in association with the 4.0 kernel since it didn't start happening until I started running it or what? I checked earlier today for an rc5 but didn't see one.
Thanks
penalvch (penalvch) wrote : | #60 |
chris pollock, did this problem not occur in a release prior to Trusty?
tags: |
added: regression-potential removed: needs-bisect regression-update |
chris pollock (cpollock) wrote : | #61 |
I can't provide that information since I didn't begin to run Trusty until around late July of last year when my other Linux box finally died and I decided to run Ubuntu. Prior to that I ran Mandriva.
chris pollock (cpollock) wrote : | #62 |
Christopher, I asked this question, After boot kernel: [ 1228.531419] usblp0: removed kernel: [ 1228.532703] usblp 1-3.5:1.0: usblp0: USB Bidirectional printer dev 6 if 0 alt 0 proto 2 vid 0x03F0 pid 0x2B17 is added to syslog every 6 seconds here - https:/
Ubuntu 14.04.2 LTS, kernel 4.0.0-040000rc4
Mar 20 11:23:13 localhost kernel: [ 1757.100062] usblp0: removed
Mar 20 11:23:13 localhost kernel: [ 1757.101389] usblp 1-3.5:1.0: usblp0: USB Bidirectional printer dev 6 if 0 alt 0 proto 2 vid 0x03F0 pid 0x2B17
Mar 20 11:23:19 localhost kernel: [ 1763.104667] usblp0: removed
Mar 20 11:23:19 localhost kernel: [ 1763.105807] usblp 1-3.5:1.0: usblp0: USB Bidirectional printer dev 6 if 0 alt 0 proto 2 vid 0x03F0 pid 0x2B17
Mar 20 11:23:25 localhost kernel: [ 1769.110688] usblp0: removed
Mar 20 11:23:25 localhost kernel: [ 1769.111851] usblp 1-3.5:1.0: usblp0: USB Bidirectional printer dev 6 if 0 alt 0 proto 2 vid 0x03F0 pid 0x2B17
However as soon as I print something, in this case a test page from the CUPS web interface, it stops:
Mar 20 11:24:19 localhost kernel: [ 1823.166289] usblp0: removed
Mar 20 11:24:19 localhost kernel: [ 1823.167583] usblp 1-3.5:1.0: usblp0: USB Bidirectional printer dev 6 if 0 alt 0 proto 2 vid 0x03F0 pid 0x2B17
Mar 20 11:24:20 localhost hp[8064]: io/hpmud/model.c 108: unable to open /etc/hp/hplip.conf: No such file or directory
Mar 20 11:24:20 localhost hp[8064]: io/hpmud/model.c 532: no HP_LaserJet_1020 attributes found in /data/models/
Mar 20 11:24:20 localhost hp[8064]: io/hpmud/model.c 543: no HP_LaserJet_1020 attributes found in /data/models/
Mar 20 11:24:21 localhost foo2zjs-wrapper: foo2zjs-wrapper -z1 -P -L0 -r1200x600 -p1 -T3 -m1 -s7 -n1
Mar 20 11:24:22 localhost kernel: [ 1826.056212] usblp0: removed
Mar 20 11:24:22 localhost foo2zjs-wrapper: gs -sPAPERSIZE=letter -g10200x6600 -r1200x600 -sDEVICE=pbmraw -dCOLORSCREEN -dMaxBitmap=
Mar 20 11:24:22 localhost foo2zjs-wrapper: foo2zjs -r1200x600 -g10200x6600 -p1 -m1 -n1 -d1 -s7 -z1 -u 192x96 -l 192x96 -L 0 -T3 -P
Notice it's every 6 seconds and goes away after I print something.
penalvch (penalvch) wrote : | #63 |
chris pollock, just to clarify Launchpad is not the correct venue for reporting issues with the upstream kernel.
However, Launchpad is the correct venue for your problem, as it is reproducible with the Ubuntu kernel, and as a part of the debugging process, the upstream kernel is tested.
Despite this, for regression testing purposes, could you please test for this via http://
chris pollock (cpollock) wrote : | #64 |
I take it you just want me to run via the live DVD and not do a full install correct?
chris pollock (cpollock) wrote : | #65 |
I'd like to ask you another question Christopher, am I the only person in the whole world running Ubuntu that has gone through all the kernel versions we've been testing that has this problem?
chris pollock (cpollock) wrote : | #66 |
So, I come into the computer room this morning, turn on the monitor, move the mouse and of course I see just the black screen and mouse cursor. This was 6:49am. I do a CTRL>ALT>F1 and walk away to take care of a few things and at 7:02 I came back, did a log-in via the terminal and entered the 'sudo reboot' command. After the boot was completed I did a 'body' search of my syslog hourly snippets for 'hangcheck' and there were none found from last night when I left the system until this morning when I went and found the black screen. What I did see in my 7am (which is really from 6am to 6:59am) snippet is in the attachment. Have no idea what it means but as I said above 6:49 was when I move the cursor as I usually do to get out of the screensaver. Could X-Screensaver have anything to do with this issue? I notice that there's a spamd run in this file, please ignore it. I just captured the whole area that looked like the trace.
penalvch (penalvch) wrote : | #67 |
chris pollock:
>"I take it you just want me to run via the live DVD and not do a full install correct?"
If the issue has been reproducible in a live environment, then testing that would be fine.
chris pollock (cpollock) wrote : | #68 |
How long do you think I should run the live DVD? I can probably only run it during the night from say 10pm my time until about 8am my time as I have lots of other work going on my system during the day and I don't have another system to do the testing on. Also did you have a chance to see my comment #65?
chris pollock (cpollock) wrote : | #70 |
Christopher, some things of note:
1. I went a bit over two days on 4.0.0-040000rc4
chris pollock (cpollock) wrote : | #71 |
A new kernel came down from Ubuntu - 3.13.0-48-generic #80-Ubuntu SMP Thu Mar 12 11:16:15 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux the other day so I've been running it. It ran for 3d 6h 07m before locking up at 21:26:56 on 26 March 2015. This time there was no sign of the 'Hangcheck.....' error anywhere in my syslog. I have discovered these kernels - Index of /~kernel-
chris pollock (cpollock) wrote : | #72 |
I'd like to post an update to this bug report. For about two weeks now I've been running kernel 4.0.0-997-generic #201503310205 SMP Tue Mar 31 02:07:04 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux which I got here http://
jippie (jph4dotcom) wrote : | #73 |
Similar, probalby same problem here.
- I've been running Kubuntu 14.04 for about a year with no problems. Problem started about 3 months ago or so, not entirely sure about the period.
- The screen usually locks up when using Firefox, today it happen when I switched windows from Firefox to Konsole (the GUI app). The screen locked half way switching the window borders from blue to grey. (Could it be the screen candy thingy that also does the wobbly windows, what's the name again?). The konsole window border on my right monitor is was grey, is now locked faint blue and should have turned to full blue. The Firefox window on the left monitor was full blue, is now slightly going to grey and locked right there and then.
- I don't use a Dell computer, but some Intel mother board based system (DG43GT with a Q9550).
- When the system is locked, which happens every two or three weeks, I can sometimes change to text console but not always. Currently I cannot access console, but I can log in to the box through SSH.
- Screen blanking (power save) doesn't work when the screen locked.
Linux diablo 3.13.0-49-generic #83-Ubuntu SMP Fri Apr 10 20:11:33 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
When my box locks up, I usually run full patching before power cycle, some messages in log files may be caused by that. I attached log files and I believe in the past I have some debug headers installed on my system which seems to list somewhat useful information.
[vr mei 15 23:12:46 2015] [drm:i915_
[vr mei 15 23:15:36 2015] INFO: task Xorg:2243 blocked for more than 120 seconds.
[vr mei 15 23:15:36 2015] Tainted: G OX 3.13.0-49-generic #83-Ubuntu
[vr mei 15 23:15:36 2015] "echo 0 > /proc/sys/
[vr mei 15 23:15:36 2015] Xorg D ffff88023bd134c0 0 2243 2230 0x00400000
[vr mei 15 23:15:36 2015] ffff8800b29d7a18 0000000000000086 ffff88022a6d1800 ffff8800b29d7fd8
[vr mei 15 23:15:36 2015] 00000000000134c0 00000000000134c0 ffff88022a6d1800 ffff880035523000
[vr mei 15 23:15:36 2015] ffff880035472ad8 ffff880035eda800 ffff880035eda800 ffff8800032da780
[vr mei 15 23:15:36 2015] Call Trace:
[vr mei 15 23:15:36 2015] [<ffffffff81725
[vr mei 15 23:15:36 2015] [<ffffffffa015e
[vr mei 15 23:15:36 2015] [<ffffffff810ab
[vr mei 15 23:15:36 2015] [<ffffffffa0170
[vr mei 15 23:15:36 2015] [<ffffffffa0026
[vr mei 15 23:15:36 2015] [<ffffffffa00a4
[vr mei 15 23:15:36 2015] [<ffffffff813cd
[vr mei 15 23:15:36 2015] [<ffffffff810a2
[vr mei 15 23:15:36 2015] [<ffffffff813da
[vr mei 15 23:15:36 2015] [<ffffffff81462
[vr mei 15 23:15:36 2015] [<ffffffff81458
[vr mei 15 23:15:36 2015] [<ffffffff81459
jippie (jph4dotcom) wrote : | #74 |
Quick question:
I can probably figure out from back ups which kernel I was running about 6 months ago, would it be possible to roll back to an older kernel version?
20140601/
20140601/
20140706/
20140706/
20140803/
20140907/
20141005/
20141102/
20141207/
20150104/
20150201/
20150301/
20150405/
20150405/
20150412/
20150412/
chris pollock (cpollock) wrote : | #75 |
I've had several video lockups since running the kernel and driver that I mentioned in comment 72. I've been trying to figure out a way to get a back trace. All I've been able to come up with so far is this from the /var/log/kern.log tied to the latest lockup:
May 13 17:59:19 localhost kernel: [216520.292010] ------------[ cut here ]------------
May 13 17:59:19 localhost kernel: [216520.292037] WARNING: CPU: 1 PID: 1157 at /home/kernel/
May 13 17:59:19 localhost kernel: [216520.292039] vblank wait timed out on crtc 0
May 13 17:59:19 localhost kernel: [216520.292041] Modules linked in: btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c ses enclosure coretemp snd_hda_
May 13 17:59:19 localhost kernel: [216520.292093] CPU: 1 PID: 1157 Comm: Xorg Not tainted 4.0.0-997-generic #201503310205
May 13 17:59:19 localhost kernel: [216520.292095] Hardware name: Dell Inc. OptiPlex 780 /0C27VV, BIOS A15 08/06/2013
May 13 17:59:19 localhost kernel: [216520.292097] 0000000000000476 ffff8800d6c1fa08 ffffffff817e3106 0000000000000007
May 13 17:59:19 localhost kernel: [216520.292100] ffff8800d6c1fa58 ffff8800d6c1fa48 ffffffff810791b7 0000000000000286
May 13 17:59:19 localhost kernel: [216520.292103] 0000000000000000 ffff8800d6f3b800 0000000000000000 0000000000000000
May 13 17:59:19 localhost kernel: [216520.292107] Call Trace:
May 13 17:59:19 localhost kernel: [216520.292114] [<ffffffff817e3
May 13 17:59:19 localhost kernel: [216520.292119] [<ffffffff81079
May 13 17:59:19 localhost kernel: [216520.292122] [<ffffffff81079
May 13 17:59:19 localhost kernel: [216520.292127] [<ffffffff810bb
May 13 17:59:19 localhost kernel: [216520.292140] [<ffffffffc0371
May 13 17:59:19 localhost kernel: [216520.292144] [<ffffffff810bb
May 13 17:59:19 localhost kernel: [216520.292157] [<ffffffffc0371
May 13 17:59:19 localhost kernel: [216520.292167] [<ffffffffc03cd
May 13 17:59:19 localhost kernel: [216520.292175] [<ffffffffc03cd
May 13 17:59:19 localhost kernel: [216520.292193] [<ffffffffc037b
May 13 17:59:19 localhost kernel: [216520.292203] [<ffffffffc037b
May 13 17:59:19 localhost kernel: [216520.292206] [<ffffffff817ed
jippie (jph4dotcom) wrote : | #76 |
For what it is worth: X is waiting for I/O, hence the D state:
$ ps aux | grep X
root 2243 0.9 0.1 552832 14336 tty7 Ds+ mei03 176:23 /usr/bin/X -core :0 -seat seat0 -auth /var/run/
Not sure how I can figure out which file X is waiting for, lsof spitrs out over 500 lines.
penalvch (penalvch) wrote : | #77 |
jippie, it will help immensely if you filed a new report via a terminal:
ubuntu-bug linux
Please feel free to subscribe me to it.
jippie (jph4dotcom) wrote : | #78 |
Filed a new report under: https:/
Josh Rosenberg (7-launchpad-desh-info) wrote : | #79 |
I have been experiencing the same problem periodically (the "[drm:i915_
My syslog snippet is attached.
Robert Hrovat (robi-hipnos) wrote : | #80 |
It happens multiple times a day on almost every machine at work. All machines has intel built in graphics and the only common thing about this bug is that there was always web browser running some page with flash.
penalvch (penalvch) wrote : | #81 |
Josh Rosenberg / Robert Hrovat, it will help immensely if you filed a new report via a terminal:
ubuntu-bug linux
Please feel free to subscribe me to it.
Robert Hrovat (robi-hipnos) wrote : | #82 |
Christopher, I think I might find workaround by uninstalling compiz and use gnome fallback. Which is default on all machines in company. It's the second day when none of machine crashed.
Josh Rosenberg (7-launchpad-desh-info) wrote : | #83 |
Unfortunately I haven't been able to use ubuntu-bug to open a new report, though I still experience the problem regularly (including twice within the past hour).
chris pollock (cpollock) wrote : | #84 |
Josh, here's another bug report I filed on this - https:/
penalvch (penalvch) wrote : | #85 |
Josh Rosenberg, in order to be most helpful, you will want to file a new report (not add anything to an already existing report) via http://
For more on why this is helpful please see https:/
Josh Rosenberg (7-launchpad-desh-info) wrote : | #86 |
I just filed a new bug at https:/
This change was made by a bot.