Bug #1097178 “nouveau failed to idle channel 0xcccc0000” : Bugs : linux-lts-raring package : Ubuntu

Poil (poil) on 2013-01-08

description:

updated

Revision history for this message

Hein van Dam (h-t-vandam) wrote on 2013-01-14:

#1

With kernel 3.8 I have the same problem, with kernel 3.7 I can still boot. It must be the nvidia card as the 3.8 kernel works fine with my msi wind netbook.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2013-01-18:

#2

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-lts-quantal (Ubuntu):
status:	New → Confirmed

Revision history for this message

xxx (xgddghxx-deactivatedaccount-deactivatedaccount) wrote on 2013-02-14:

#3

I've run a bisect on Linus' master branch, and narrowed the issue down to this one line change, which isn't much to go on:

commit 7707b701ebfea64afa6bfb23aa318fd687892754
Author: Marcin Slusarz <email address hidden>
Commit: Ben Skeggs <email address hidden>

drm/nv40/mpeg: fix context handling

It slipped in thanks to typeless API.

Signed-off-by: Marcin Slusarz <email address hidden>
Signed-off-by: Ben Skeggs <email address hidden>

diff --git a/drivers/gpu/drm/nouveau/core/engine/mpeg/nv40.c b/drivers/gpu/drm/n
index 1241857..f7c581a 100644
--- a/drivers/gpu/drm/nouveau/core/engine/mpeg/nv40.c
+++ b/drivers/gpu/drm/nouveau/core/engine/mpeg/nv40.c
@@ -38,7 +38,7 @@ struct nv40_mpeg_priv {
};

struct nv40_mpeg_chan {
- struct nouveau_mpeg base;
+ struct nouveau_mpeg_chan base;
};

Revision history for this message

xxx (xgddghxx-deactivatedaccount-deactivatedaccount) wrote on 2013-02-15:

#4

Bug also exists in Nouveau driver bugzilla, which I've also updated with the bisect result:

https://bugs.freedesktop.org/show_bug.cgi?id=54786

Revision history for this message

xxx (xgddghxx-deactivatedaccount-deactivatedaccount) wrote on 2013-02-16:

#5

Nouveau driver bug comment states that this issue is no longer present in the latest kernel - after updating to latest from Linus' tree (commit 323a72d83c9) and running, I can confirm that this is the case.

xxx (xgddghxx-deactivatedaccount-deactivatedaccount) on 2013-02-18

tags:

added: kernel-fixed-upstream

Revision history for this message

Jan K. (jan-launchpad-kantert) wrote on 2013-05-24:

#6

Will there be a kernel update to 3.9 with this fix for 13.04? Also affects Thinkpad T420s with Nvidia Optimus.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2013-05-31:

#7

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-lts-raring (Ubuntu):
status:	New → Confirmed

Revision history for this message

FuzzyQ (atomicfuzzyq) wrote on 2013-05-31:

#8

I'm experiencing this bug on a MSI GT70 with a GeForce GTX 675MX (Optimus).

Revision history for this message

Mohan (dr-mohan) wrote on 2013-10-09:

#9

This bug affects me too. Details as follows
Kernel 3.8.0-31-generic #46-Ubuntu SMP Tue Sep 10 20:03:44 UTC 2013 x86_64

Revision history for this message

giri (bollsg) wrote on 2013-12-14:

#10

I see the nouveau failed due to idle channel error on installing 12.10.

Revision history for this message

Marc Quinton (mquinton) wrote on 2014-04-04:

#11

Hello.

I have this bug, with ubuntu-trusty, in early stage, when I ready some video with VLC.
- my video card : NVIDIA Corporation GT218 [NVS 300]
- xserver-xorg-video-nouveau, 1:1.0.10-1ubuntu2

best regards.

Revision history for this message

Marc Quinton (mquinton) wrote on 2014-04-04:

#12

redhat bugzilla point out to a kernel patch for 3.14 : https://bugzilla.redhat.com/show_bug.cgi?id=918732

Revision history for this message

Vova U (uwl) wrote on 2014-04-23:

#13

after the upgrade from 12.04 to 14.04 the systemis not useable any more

Revision history for this message

Marian Krause (mkdugi) wrote on 2014-04-27:

#14

Hi,

This bug affects me :-(.

Up to date Ubuntu trusty 14.04 (today 2014.04.27).
01:00.0 VGA compatible controller: NVIDIA Corporation G71GL [Quadro FX 3500] (rev a1)
xserver-xorg-video-nouveau 1:1.0.10-1ubuntu2

Syslog:
Apr 27 17:27:05 cacko kernel: [ 852.982767] nouveau E[Xorg[3900]] failed to idle channel 0xcccc0001 [Xorg[3900]]
Apr 27 17:27:20 cacko kernel: [ 867.981445] nouveau E[Xorg[3900]] failed to idle channel 0xcccc0001 [Xorg[3900]]
Apr 27 17:27:35 cacko kernel: [ 882.980118] nouveau E[Xorg[3900]] failed to idle channel 0xcccc0000 [Xorg[3900]]
Apr 27 17:27:50 cacko kernel: [ 897.978806] nouveau E[Xorg[3900]] failed to idle channel 0xcccc0000 [Xorg[3900]]

X freezes. I can change to text terminal, do some things and reboot (reboot takes long).

This appear in my system when Adobe Flash Player trys to play flash in Firefox (didn't check with other browsers).
Maby not every flash.

Revision history for this message

Jakub Liška (liska-jakub) wrote on 2014-05-01:

#15

I got it too, X freezes right after this :

May 1 16:05:56 lisak kernel: [ 1588.032792] nouveau E[Xorg[1323]] failed to idle channel 0xcccc0000 [Xorg[1323]]
May 1 16:05:57 lisak gnome-session[1879]: Gdk-WARNING: gnome-session: Fatal IO error 11 (Resource temporarily unavailable) on X server :0.#012
May 1 16:05:57 lisak colord: device removed: xrandr-BenQ-BenQ G2411HD-D4910151SL0
May 1 16:05:57 lisak colord: Profile removed: icc-d63583c0a76a513bac920519a90532e8
May 1 16:06:12 lisak kernel: [ 1603.182708] nouveau E[compiz[2063]] failed to idle channel 0xcccc0000 [compiz[2063]]

VGA compatible controller: NVIDIA Corporation C77 [GeForce 8200] (rev a2)

dpkg -l | grep nouveau
ii libdrm-nouveau2:amd64 2.4.52-1 amd64 Userspace interface to nouveau-specific kernel DRM services -- runtime
ii xserver-xorg-video-nouveau 1:1.0.10-1ubuntu2 amd64 X.Org X server -- Nouveau display driver

uname -a
Linux lisak 3.13.0-24-generic #46-Ubuntu SMP Thu Apr 10 19:11:08 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Jakub Liška (liska-jakub) wrote on 2014-05-01:

#16

Btw I don't know if it is related to this issue, but sometimes I also get this error :

411.265798] nouveau E[ PFIFO][0000:02:00.0] DMA_PUSHER - ch 2 [Xorg[1108]] get 0x00200358c4 put 0x00200358e0 ib_get 0x000000cc ib_put 0x000000cf state 0x80000000 (err: INVALID_CMD) push 0x00400040

After that my thinkpad usb keyboard stops working. There is nothing logged about it though except for this nouveau error.

Revision history for this message

Sven Arnold (sven-internetallee) wrote on 2014-05-16:

#17

I see this error also repeatedly since upgrading from 13.10 to 14.04. The system is unusable since then.
I tried to use different version of the nvidia proprietary drivers but not of them did run stable either. Currently I use updated drivers from oibaf ppa.

Kernel:
3.13.0-24-generic #47-Ubuntu SMP Fri May 2 23:30:00 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

Graphics Device:
VGA compatible controller: NVIDIA Corporation C79 [GeForce 9400] (rev b1)

xserver-xorg-video-nouveau 1:1.0.10+git1405091930.8604a7~gd~t
libdrm-nouveau2:amd64 2.4.54+git1405131830.305478~gd~t
libdrm2:amd64 2.4.54+git1405131830.305478~gd~t

[ 4657.804009] nouveau E[Xorg[1120]] failed to idle channel 0xcccc0001 [Xorg[1120]]
[ 4657.804035] nouveau E[ PFB][0000:02:00.0] trapped write at 0x010028e3a8 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804008] nouveau E[Xorg[1120]] failed to idle channel 0xcccc0001 [Xorg[1120]]
[ 4672.804036] nouveau E[ PFB][0000:02:00.0] trapped write at 0x010028e398 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804063] nouveau E[ PFB][0000:02:00.0] trapped write at 0x010028e3a0 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804078] nouveau E[ PFB][0000:02:00.0] trapped write at 0x010028e390 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804093] nouveau E[ PFB][0000:02:00.0] trapped write at 0x010028e388 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804126] nouveau E[ PFB][0000:02:00.0] trapped write at 0x01003f9020 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804151] nouveau E[ PFB][0000:02:00.0] trapped write at 0x0100000000 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4676.506881] nouveau W[ PFIFO][0000:02:00.0] unknown intr 0x06080000, ch 127

above line is repeated about 100 times, then:

May 16 22:12:33 comet kernel: [ 4676.507823] nouveau E[ PFIFO][0000:02:00.0] still angry after 101 spins, halt

I see this error also repeatedly since upgrading from 13.10 to 14.04. The system is unusable since then.
I tried to use different version of the nvidia proprietary drivers but not of them did run stable either. Currently I use updated drivers from oibaf ppa.

Kernel:
3.13.0-24-generic #47-Ubuntu SMP Fri May 2 23:30:00 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

Graphics Device:
VGA compatible controller: NVIDIA Corporation C79 [GeForce 9400] (rev b1)

xserver-xorg-video-nouveau    1:1.0.10+git1405091930.8604a7~gd~t
libdrm-nouveau2:amd64           2.4.54+git1405131830.305478~gd~t
libdrm2:amd64                            2.4.54+git1405131830.305478~gd~t

[ 4657.804009] nouveau E[Xorg[1120]] failed to idle channel 0xcccc0001 [Xorg[1120]]
[ 4657.804035] nouveau E[     PFB][0000:02:00.0] trapped write at 0x010028e3a8 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804008] nouveau E[Xorg[1120]] failed to idle channel 0xcccc0001 [Xorg[1120]]
[ 4672.804036] nouveau E[     PFB][0000:02:00.0] trapped write at 0x010028e398 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804063] nouveau E[     PFB][0000:02:00.0] trapped write at 0x010028e3a0 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804078] nouveau E[     PFB][0000:02:00.0] trapped write at 0x010028e390 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804093] nouveau E[     PFB][0000:02:00.0] trapped write at 0x010028e388 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804126] nouveau E[     PFB][0000:02:00.0] trapped write at 0x01003f9020 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4672.804151] nouveau E[     PFB][0000:02:00.0] trapped write at 0x0100000000 on channel 0x0001fee0 [unknown] BAR/PFIFO_WRITE/IN reason: PAGE_NOT_PRESENT
[ 4676.506881] nouveau W[   PFIFO][0000:02:00.0] unknown intr 0x06080000, ch 127

above line is repeated about 100 times, then:

May 16 22:12:33 comet kernel: [ 4676.507823] nouveau E[   PFIFO][0000:02:00.0] still angry after 101 spins, halt

Revision history for this message

Juan (elkato) wrote on 2014-05-31:

#18

Sven,

Are you still using 14.04 and nvidia?
How you solved?
I'm at that point. My card is optimus nvidia.

Thanks in advance!

Revision history for this message

Sven Arnold (sven-internetallee) wrote on 2014-06-01:

#19

Juan,

meanwhile I noted that I have a potential hardware problem (Mainboard or RAM) which added confusion:

Strangely, my system works with one DDR2 module of 2GB but crashes when using two modules. The RAM itself seems ok (tried each of the four modules separately tried every memory bank). While this could be induced by a problem on the mainboard and/or power supply it is still confusing that the problem occured exactly when upgrading to 14.04.

Anyways: Currently, with one 2GB DIMM in use I have a probably working setup:

kernel 3.13.0-27
xserver-xorg-video-nouveau 1:1.0.10+git1405261930.4a18dd~gd~t
libdrm-nouveau2:amd64 2.4.54+git1405200630.8fc62c~gd~t
nouveau-firmware 20091212-0ubuntu1

graphics devices drivers are from oibaf ppa

Best regards,

Sven

Revision history for this message

penalvch (penalvch) wrote on 2014-06-02:

#21

Poil, thank you for taking the time to report this bug and helping to make Ubuntu better. Please execute the following command, as it will automatically gather debugging information, in a terminal:
apport-collect 1097178
When reporting bugs in the future please use apport by using 'ubuntu-bug' and the name of the package affected. You can learn more about this functionality at https://wiki.ubuntu.com/ReportingBugs.

tags:	added: regression-release
tags:	removed: kernel-fixed-upstream
no longer affects:	linux-lts-quantal (Ubuntu)
Changed in linux-lts-raring (Ubuntu):
importance:	Undecided → Low
status:	Confirmed → Incomplete
tags:	added: raring

Revision history for this message

John Small (jds340) wrote on 2014-06-05:

#22

Why is this listed as low importance. For people who have the problem it's high importance. I can't use my laptop becauses of it.

I can't wait for a fix. I'm using 13.10, so I'll wipe it and try 14.04. It's that serious I have to wipe my setup and do a complete re-install

Revision history for this message

Poil (poil) wrote on 2014-06-05:

#23

@Christopher M. Penalver (penalvch)

Sorry I'm no more using an Nvidia cards on my computer; I can give my old card if someone want to debug ...

Revision history for this message

penalvch (penalvch) wrote on 2014-06-05:

#24

Poil, this bug report is being closed due to your last comment https://bugs.launchpad.net/ubuntu/+source/linux-lts-raring/+bug/1097178/comments/23 regarding you are no longer using the hardware. For future reference you can manage the status of your own bugs by clicking on the current status in the yellow line and then choosing a new status in the revealed drop down box. You can learn more about bug statuses at https://wiki.ubuntu.com/Bugs/Status. Thank you again for taking the time to report this bug and helping to make Ubuntu better. Please submit any future bugs you may find.

John Small, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Changed in linux-lts-raring (Ubuntu):
status:	Incomplete → Invalid

Revision history for this message

John Small (jds340) wrote on 2014-06-07:

#25

Ok will do. It'll be a while yet. I've been through many cycles of install-wipe-install trying to sort this one out.

Perham (perham-x) on 2014-07-03

Changed in linux-lts-raring (Ubuntu):
status:	Invalid → Confirmed

penalvch (penalvch) on 2014-07-04

Changed in linux-lts-raring (Ubuntu):
status:	Confirmed → Invalid

Revision history for this message

Zerosith (zerosith) wrote on 2014-07-07:

#26

It also happens to me, I don't think this bug should be closed. I have an asus n53g with optimus support and haven't been able to fix this with the nvidia propietary drivers.

Can someone provide more assistance?

Thanks

Revision history for this message

penalvch (penalvch) wrote on 2014-07-07:

#27

Zerosith, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

Jakub Liška (liska-jakub) wrote on 2014-08-10:

#28

So, it seems I'm stuck with Raring for good on this hardware :-)

Revision history for this message

penalvch (penalvch) wrote on 2014-08-11:

#29

Jakub Liška, unfortunately as this bug report is closed, it has nothing to do with you, your problem, or your hardware. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#30

FWIW, hit this just now, with brand new kernel 3.16.1 and ubuntu precise LTS 12.04

about 30-60 seconds into boot, after X comes up, before loging in:
nouveau failed to idle channel 0xcccc0000
then monitor shuts off, and the system hangs hard, cannot alt-f1 to get to a tty

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#31

The same bug hits redhat too, and seems to affect *all* kernels after 3.7 See https://bugzilla.redhat.com/show_bug.cgi?id=918732 for details. That bug report also suggests a hacky kernel patch that appears to avoid a race condition, and is claimed to fix the problem. Will try it out shortly.

Revision history for this message

penalvch (penalvch) wrote on 2014-08-31:

#32

linas, unfortunately as this bug report is closed, it has nothing to do with you, your problem, or your hardware. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#33

The kernel patches there are not sufficent to resolve the hang for me.

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#34

Harrumph. The indicated kernel patch does make the "failed to idle channel 0xcccc0000" message go away!

specifically, this:

diff --git a/drivers/gpu/drm/nouveau/core/subdev/mc/base.c
b/drivers/gpu/drm/nouveau/core/subdev/mc/base.c
index b4b9943..719db60 100644
--- a/drivers/gpu/drm/nouveau/core/subdev/mc/base.c
+++ b/drivers/gpu/drm/nouveau/core/subdev/mc/base.c
@@ -49,6 +49,8 @@ nouveau_mc_intr(int irq, void *arg)
if (pmc->use_msi)
oclass->msi_rearm(pmc);

+ udelay(1);
+
        if (intr) {
                u32 stat = intr = nouveau_mc_intr_mask(pmc);
                while (map->stat) {

It does NOT fix the X11 crash/hang, and appears to maybe never have been the root cause of the crash/hang. Why? Because system can still be ssh'ed into. Only the keyboard is unresponsive (and the monitor is self-powers off) (can't alt-f1 switch to tty)

The root cause appears to be this: in /var/log/Xorg.0.log:
33.188] [mi] EQ overflowing. Additional events will be discarded until existing events are processed.
followed by dozen or more stack traces

ps aux shows that X server is in D state (uninterruptible sleep) and of course kill -9 does not work on it, as a result.
top shows bizarre stuff -- a high loadavg, but idle system . Hrmmm Go figure. That's wrong.

top - 22:03:52 up 19 min, 1 user, load average: 3.00, 2.99, 2.22
Tasks: 133 total, 1 running, 132 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.0%us, 0.0%sy, 0.0%ni,100.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#35

Download full text (29.2 KiB)

some more uninterruptible sleep stuff, from /var/log:

Aug 30 21:48:43 blackspot kernel: [ 240.160069] kworker/1:2 D 0000000000000001 0 64 2 0x00000000
Aug 30 21:48:43 blackspot kernel: [ 240.160088] Workqueue: pm pm_runtime_work
Aug 30 21:50:42 blackspot kernel: [ 360.160069] kworker/0:1 D 0000000000000001 0 26 2 0x00000000
Aug 30 21:50:42 blackspot kernel: [ 360.160102] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:50:46 blackspot kernel: [ 360.160397] kworker/1:2 D 0000000000000001 0 64 2 0x00000000
Aug 30 21:50:46 blackspot kernel: [ 360.160407] Workqueue: pm pm_runtime_work
Aug 30 21:50:51 blackspot kernel: [ 360.160678] Xorg D 0000000000000001 0 1946 1136 0x00000000
Aug 30 21:52:43 blackspot kernel: [ 480.160069] kworker/0:1 D 0000000000000001 0 26 2 0x00000000
Aug 30 21:52:43 blackspot kernel: [ 480.160101] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:52:47 blackspot kernel: [ 480.160392] kworker/1:2 D 0000000000000001 0 64 2 0x00000000
Aug 30 21:52:47 blackspot kernel: [ 480.160402] Workqueue: pm pm_runtime_work
Aug 30 21:52:52 blackspot kernel: [ 480.160674] Xorg D 0000000000000001 0 1946 1136 0x00000000
Aug 30 21:54:43 blackspot kernel: [ 600.160071] kworker/0:1 D 0000000000000001 0 26 2 0x00000000
Aug 30 21:54:43 blackspot kernel: [ 600.160105] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:54:47 blackspot kernel: [ 600.160396] kworker/1:2 D 0000000000000001 0 64 2 0x00000000
Aug 30 21:54:47 blackspot kernel: [ 600.160405] Workqueue: pm pm_runtime_work
Aug 30 21:54:53 blackspot kernel: [ 600.160677] Xorg D 0000000000000001 0 1946 1136 0x00000000

and this:

Aug 30 21:46:04 blackspot kernel: [ 8.731718] nouveau E[ DISPLAY][0000:02:00.0] 01:0130: func 08 lookup failed, -2
Aug 30 21:46:05 blackspot kernel: [ 8.731750] nouveau W[ DRM] TMDS table script pointers not stubbed
Aug 30 21:46:06 blackspot kernel: [ 9.320395] EXT4-fs (md4): mounting with "discard" option, but the device does not support discard
Aug 30 21:46:08 blackspot kernel: [ 17.034452] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010000 put 0x00010090 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:08 blackspot kernel: [ 17.287691] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010090 put 0x000100a0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:08 blackspot kernel: [ 21.764994] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100a0 put 0x000100b0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:09 blackspot kernel: [ 21.796431] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory
Aug 30 21:46:09 blackspot kernel: [ 21.963345] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100b0 put 0x000100c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:09 blackspot kernel: [ 42.381349] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER ...

some more uninterruptible sleep stuff,  from /var/log:

Aug 30 21:48:43 blackspot kernel: [  240.160069] kworker/1:2     D 0000000000000001     0    64      2 0x00000000
Aug 30 21:48:43 blackspot kernel: [  240.160088] Workqueue: pm pm_runtime_work
Aug 30 21:50:42 blackspot kernel: [  360.160069] kworker/0:1     D 0000000000000001     0    26      2 0x00000000
Aug 30 21:50:42 blackspot kernel: [  360.160102] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:50:46 blackspot kernel: [  360.160397] kworker/1:2     D 0000000000000001     0    64      2 0x00000000
Aug 30 21:50:46 blackspot kernel: [  360.160407] Workqueue: pm pm_runtime_work
Aug 30 21:50:51 blackspot kernel: [  360.160678] Xorg            D 0000000000000001     0  1946   1136 0x00000000
Aug 30 21:52:43 blackspot kernel: [  480.160069] kworker/0:1     D 0000000000000001     0    26      2 0x00000000
Aug 30 21:52:43 blackspot kernel: [  480.160101] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:52:47 blackspot kernel: [  480.160392] kworker/1:2     D 0000000000000001     0    64      2 0x00000000
Aug 30 21:52:47 blackspot kernel: [  480.160402] Workqueue: pm pm_runtime_work
Aug 30 21:52:52 blackspot kernel: [  480.160674] Xorg            D 0000000000000001     0  1946   1136 0x00000000
Aug 30 21:54:43 blackspot kernel: [  600.160071] kworker/0:1     D 0000000000000001     0    26      2 0x00000000
Aug 30 21:54:43 blackspot kernel: [  600.160105] Workqueue: events output_poll_execute [drm_kms_helper]
Aug 30 21:54:47 blackspot kernel: [  600.160396] kworker/1:2     D 0000000000000001     0    64      2 0x00000000
Aug 30 21:54:47 blackspot kernel: [  600.160405] Workqueue: pm pm_runtime_work
Aug 30 21:54:53 blackspot kernel: [  600.160677] Xorg            D 0000000000000001     0  1946   1136 0x00000000

and this:

Aug 30 21:46:04 blackspot kernel: [    8.731718] nouveau E[ DISPLAY][0000:02:00.0] 01:0130: func 08 lookup failed, -2
Aug 30 21:46:05 blackspot kernel: [    8.731750] nouveau W[     DRM] TMDS table script pointers not stubbed
Aug 30 21:46:06 blackspot kernel: [    9.320395] EXT4-fs (md4): mounting with "discard" option, but the device does not support discard
Aug 30 21:46:08 blackspot kernel: [   17.034452] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010000 put 0x00010090 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:08 blackspot kernel: [   17.287691] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010090 put 0x000100a0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:08 blackspot kernel: [   21.764994] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100a0 put 0x000100b0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:09 blackspot kernel: [   21.796431] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory
Aug 30 21:46:09 blackspot kernel: [   21.963345] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100b0 put 0x000100c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:09 blackspot kernel: [   42.381349] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x75725f64 put 0x000100d0 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:10 blackspot kernel: [   57.391648] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100d0 put 0x000100e0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:10 blackspot kernel: [   57.506303] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100e0 put 0x000100f0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:10 blackspot kernel: [   57.565812] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000100f0 put 0x00010100 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:11 blackspot kernel: [   57.578416] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010100 put 0x00010110 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:11 blackspot kernel: [   57.829133] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010110 put 0x00010120 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:11 blackspot kernel: [   72.504485] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x70006c60 put 0x00010130 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:12 blackspot kernel: [   72.504950] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010130 put 0x00010140 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:12 blackspot kernel: [   75.691799] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x20000000 put 0x00010250 state 0xc0020000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:12 blackspot kernel: [   87.524577] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010250 put 0x00010260 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:13 blackspot kernel: [   87.525214] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x44495f50 put 0x00010270 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:13 blackspot kernel: [   87.530339] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x5f434e58 put 0x00010280 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:13 blackspot kernel: [   87.537539] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010280 put 0x00010290 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:13 blackspot kernel: [   87.538001] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x00010290 put 0x000102a0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:14 blackspot kernel: [   87.538942] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000102a0 put 0x000102b0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:14 blackspot kernel: [   87.542842] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000102b0 put 0x000102c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:14 blackspot kernel: [   90.783374] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x65725f64 put 0x000102c8 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:27 blackspot kernel: [  105.780011] nouveau E[Xorg[1247]] failed to idle channel 0xcccc0000 [Xorg[1247]]
Aug 30 21:46:28 blackspot kernel: [  105.780056] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000102c8 put 0x000102d0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:42 blackspot kernel: [  120.780015] nouveau E[Xorg[1247]] failed to idle channel 0xcccc0000 [Xorg[1247]]
Aug 30 21:48:41 blackspot kernel: [  240.160046] INFO: task kworker/1:2:64 blocked for more than 120 seconds.
Aug 30 21:48:42 blackspot kernel: [  240.160060]       Not tainted 3.16.1-linas #1
Aug 30 21:48:42 blackspot kernel: [  240.160064] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:48:43 blackspot kernel: [  240.160093]  ffff880235e13ba8 0000000000000046 000000000000003a ffff8800bf209ec0
Aug 30 21:48:43 blackspot kernel: [  240.160100]  0000000000000400 ffff880235f063f0 0000000000011800 ffff880235e13fd8
Aug 30 21:48:43 blackspot kernel: [  240.160106]  0000000000011800 ffff880235f063f0 ffff880235e13bb8 ffff8802362d2098
Aug 30 21:48:43 blackspot kernel: [  240.160112] Call Trace:
Aug 30 21:48:43 blackspot kernel: [  240.160128]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:48:43 blackspot kernel: [  240.160134]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:48:44 blackspot kernel: [  240.160147]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:48:44 blackspot kernel: [  240.160153]  [<ffffffff813133bd>] pm_runtime_forbid+0x3d/0x50
Aug 30 21:48:44 blackspot kernel: [  240.160281]  [<ffffffffa050a466>] nouveau_pmops_runtime_suspend+0xc6/0xe0 [nouveau]
Aug 30 21:48:44 blackspot kernel: [  240.160290]  [<ffffffff8127cb45>] pci_pm_runtime_suspend+0x75/0x150
Aug 30 21:48:44 blackspot kernel: [  240.160296]  [<ffffffff8127cad0>] ? pci_pm_runtime_resume+0xb0/0xb0
Aug 30 21:48:45 blackspot kernel: [  240.160303]  [<ffffffff81312244>] __rpm_callback+0x24/0x70
Aug 30 21:48:45 blackspot kernel: [  240.160309]  [<ffffffff813122ba>] rpm_callback+0x2a/0x90
Aug 30 21:48:45 blackspot kernel: [  240.160315]  [<ffffffff81312780>] rpm_suspend+0xf0/0x4c0
Aug 30 21:48:45 blackspot kernel: [  240.160324]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:48:45 blackspot kernel: [  240.160334]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:48:45 blackspot kernel: [  240.160339]  [<ffffffff8131398a>] pm_runtime_work+0x9a/0xa0
Aug 30 21:48:46 blackspot kernel: [  240.160346]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:48:46 blackspot kernel: [  240.160354]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:48:46 blackspot kernel: [  240.160361]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:48:46 blackspot kernel: [  240.160369]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:48:46 blackspot kernel: [  240.160375]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:48:47 blackspot kernel: [  240.160382]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:48:47 blackspot kernel: [  240.160388]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:50:41 blackspot kernel: [  360.160044] INFO: task kworker/0:1:26 blocked for more than 120 seconds.
Aug 30 21:50:42 blackspot kernel: [  360.160059]       Not tainted 3.16.1-linas #1
Aug 30 21:50:42 blackspot kernel: [  360.160063] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:50:43 blackspot kernel: [  360.160108]  ffff880236307c28 0000000000000046 ffff880236307bd8 ffff8800bf20d490
Aug 30 21:50:43 blackspot kernel: [  360.160116]  ffff88022b4f1c20 ffff880236169710 0000000000011800 ffff880236307fd8
Aug 30 21:50:43 blackspot kernel: [  360.160122]  0000000000011800 ffff880236169710 ffff880236307c38 ffff8802362d2098
Aug 30 21:50:43 blackspot kernel: [  360.160129] Call Trace:
Aug 30 21:50:43 blackspot kernel: [  360.160146]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:50:43 blackspot kernel: [  360.160156]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:50:43 blackspot kernel: [  360.160168]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:50:44 blackspot kernel: [  360.160174]  [<ffffffff813135e7>] __pm_runtime_resume+0x57/0x90
Aug 30 21:50:44 blackspot kernel: [  360.160291]  [<ffffffffa051d5e2>] nouveau_connector_detect+0x62/0x3d0 [nouveau]
Aug 30 21:50:44 blackspot kernel: [  360.160300]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:50:44 blackspot kernel: [  360.160310]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:50:44 blackspot kernel: [  360.160323]  [<ffffffffa02c93e0>] output_poll_execute+0xb0/0x180 [drm_kms_helper]
Aug 30 21:50:44 blackspot kernel: [  360.160331]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:50:45 blackspot kernel: [  360.160339]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:50:45 blackspot kernel: [  360.160347]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:50:45 blackspot kernel: [  360.160355]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:50:45 blackspot kernel: [  360.160362]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:50:45 blackspot kernel: [  360.160369]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:50:45 blackspot kernel: [  360.160376]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:50:45 blackspot kernel: [  360.160386] INFO: task kworker/1:2:64 blocked for more than 120 seconds.
Aug 30 21:50:46 blackspot kernel: [  360.160391]       Not tainted 3.16.1-linas #1
Aug 30 21:50:46 blackspot kernel: [  360.160394] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:50:47 blackspot kernel: [  360.160410]  ffff880235e13ba8 0000000000000046 000000000000003a ffff8800bf209ec0
Aug 30 21:50:47 blackspot kernel: [  360.160417]  0000000000000400 ffff880235f063f0 0000000000011800 ffff880235e13fd8
Aug 30 21:50:47 blackspot kernel: [  360.160423]  0000000000011800 ffff880235f063f0 ffff880235e13bb8 ffff8802362d2098
Aug 30 21:50:47 blackspot kernel: [  360.160429] Call Trace:
Aug 30 21:50:47 blackspot kernel: [  360.160437]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:50:47 blackspot kernel: [  360.160444]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:50:48 blackspot kernel: [  360.160452]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:50:48 blackspot kernel: [  360.160458]  [<ffffffff813133bd>] pm_runtime_forbid+0x3d/0x50
Aug 30 21:50:48 blackspot kernel: [  360.160522]  [<ffffffffa050a466>] nouveau_pmops_runtime_suspend+0xc6/0xe0 [nouveau]
Aug 30 21:50:48 blackspot kernel: [  360.160533]  [<ffffffff8127cb45>] pci_pm_runtime_suspend+0x75/0x150
Aug 30 21:50:48 blackspot kernel: [  360.160540]  [<ffffffff8127cad0>] ? pci_pm_runtime_resume+0xb0/0xb0
Aug 30 21:50:49 blackspot kernel: [  360.160547]  [<ffffffff81312244>] __rpm_callback+0x24/0x70
Aug 30 21:50:49 blackspot kernel: [  360.160553]  [<ffffffff813122ba>] rpm_callback+0x2a/0x90
Aug 30 21:50:49 blackspot kernel: [  360.160560]  [<ffffffff81312780>] rpm_suspend+0xf0/0x4c0
Aug 30 21:50:49 blackspot kernel: [  360.160568]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:50:49 blackspot kernel: [  360.160575]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:50:49 blackspot kernel: [  360.160581]  [<ffffffff8131398a>] pm_runtime_work+0x9a/0xa0
Aug 30 21:50:50 blackspot kernel: [  360.160589]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:50:50 blackspot kernel: [  360.160597]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:50:50 blackspot kernel: [  360.160605]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:50:50 blackspot kernel: [  360.160612]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:50:50 blackspot kernel: [  360.160619]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:50:50 blackspot kernel: [  360.160626]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:50:50 blackspot kernel: [  360.160632]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:50:51 blackspot kernel: [  360.160668] INFO: task Xorg:1946 blocked for more than 120 seconds.
Aug 30 21:50:51 blackspot kernel: [  360.160672]       Not tainted 3.16.1-linas #1
Aug 30 21:50:51 blackspot kernel: [  360.160675] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:50:51 blackspot kernel: [  360.160685]  ffff88022fe73d58 0000000000000086 ffff88022fe73d28 ffff8802360dc530
Aug 30 21:50:52 blackspot kernel: [  360.160692]  ffff880200000000 ffff8800bfb09710 0000000000011800 ffff88022fe73fd8
Aug 30 21:50:52 blackspot kernel: [  360.160698]  0000000000011800 ffff8800bfb09710 ffff88022fe73d68 ffff8802362d2098
Aug 30 21:50:52 blackspot kernel: [  360.160704] Call Trace:
Aug 30 21:50:52 blackspot kernel: [  360.160713]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:50:52 blackspot kernel: [  360.160719]  [<ffffffff8131247b>] __pm_runtime_barrier+0x8b/0x160
Aug 30 21:50:52 blackspot kernel: [  360.160726]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:50:53 blackspot kernel: [  360.160733]  [<ffffffff81313528>] pm_runtime_barrier+0x48/0xb0
Aug 30 21:50:53 blackspot kernel: [  360.160740]  [<ffffffff8127b2af>] pci_config_pm_runtime_get+0x3f/0x70
Aug 30 21:50:53 blackspot kernel: [  360.160748]  [<ffffffff8127ef94>] pci_read_config+0x84/0x250
Aug 30 21:50:53 blackspot kernel: [  360.160758]  [<ffffffff81174b05>] sysfs_kf_bin_read+0x45/0x70
Aug 30 21:50:53 blackspot kernel: [  360.160765]  [<ffffffff81173fcf>] kernfs_fop_read+0xaf/0x160
Aug 30 21:50:53 blackspot kernel: [  360.160773]  [<ffffffff81112176>] vfs_read+0xa6/0x170
Aug 30 21:50:54 blackspot kernel: [  360.160779]  [<ffffffff8111258a>] SyS_pread64+0x8a/0xa0
Aug 30 21:50:54 blackspot kernel: [  360.160786]  [<ffffffff814dd1d6>] system_call_fastpath+0x1a/0x1f
Aug 30 21:52:41 blackspot kernel: [  480.160045] INFO: task kworker/0:1:26 blocked for more than 120 seconds.
Aug 30 21:52:42 blackspot kernel: [  480.160059]       Not tainted 3.16.1-linas #1
Aug 30 21:52:42 blackspot kernel: [  480.160064] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:52:43 blackspot kernel: [  480.160106]  ffff880236307c28 0000000000000046 ffff880236307bd8 ffff8800bf20d490
Aug 30 21:52:43 blackspot kernel: [  480.160114]  ffff88022b4f1c20 ffff880236169710 0000000000011800 ffff880236307fd8
Aug 30 21:52:43 blackspot kernel: [  480.160120]  0000000000011800 ffff880236169710 ffff880236307c38 ffff8802362d2098
Aug 30 21:52:43 blackspot kernel: [  480.160127] Call Trace:
Aug 30 21:52:44 blackspot kernel: [  480.160143]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:52:44 blackspot kernel: [  480.160153]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:52:44 blackspot kernel: [  480.160164]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:52:44 blackspot kernel: [  480.160170]  [<ffffffff813135e7>] __pm_runtime_resume+0x57/0x90
Aug 30 21:52:44 blackspot kernel: [  480.160287]  [<ffffffffa051d5e2>] nouveau_connector_detect+0x62/0x3d0 [nouveau]
Aug 30 21:52:44 blackspot kernel: [  480.160296]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:52:45 blackspot kernel: [  480.160306]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:52:45 blackspot kernel: [  480.160318]  [<ffffffffa02c93e0>] output_poll_execute+0xb0/0x180 [drm_kms_helper]
Aug 30 21:52:45 blackspot kernel: [  480.160326]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:52:45 blackspot kernel: [  480.160334]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:52:45 blackspot kernel: [  480.160342]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:52:45 blackspot kernel: [  480.160350]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:52:45 blackspot kernel: [  480.160357]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:52:46 blackspot kernel: [  480.160364]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:52:46 blackspot kernel: [  480.160371]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:52:46 blackspot kernel: [  480.160381] INFO: task kworker/1:2:64 blocked for more than 120 seconds.
Aug 30 21:52:46 blackspot kernel: [  480.160386]       Not tainted 3.16.1-linas #1
Aug 30 21:52:46 blackspot kernel: [  480.160389] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:52:47 blackspot kernel: [  480.160406]  ffff880235e13ba8 0000000000000046 000000000000003a ffff8800bf209ec0
Aug 30 21:52:47 blackspot kernel: [  480.160412]  0000000000000400 ffff880235f063f0 0000000000011800 ffff880235e13fd8
Aug 30 21:52:47 blackspot kernel: [  480.160418]  0000000000011800 ffff880235f063f0 ffff880235e13bb8 ffff8802362d2098
Aug 30 21:52:47 blackspot kernel: [  480.160424] Call Trace:
Aug 30 21:52:48 blackspot kernel: [  480.160433]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:52:48 blackspot kernel: [  480.160439]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:52:48 blackspot kernel: [  480.160447]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:52:48 blackspot kernel: [  480.160453]  [<ffffffff813133bd>] pm_runtime_forbid+0x3d/0x50
Aug 30 21:52:48 blackspot kernel: [  480.160517]  [<ffffffffa050a466>] nouveau_pmops_runtime_suspend+0xc6/0xe0 [nouveau]
Aug 30 21:52:48 blackspot kernel: [  480.160529]  [<ffffffff8127cb45>] pci_pm_runtime_suspend+0x75/0x150
Aug 30 21:52:49 blackspot kernel: [  480.160536]  [<ffffffff8127cad0>] ? pci_pm_runtime_resume+0xb0/0xb0
Aug 30 21:52:49 blackspot kernel: [  480.160543]  [<ffffffff81312244>] __rpm_callback+0x24/0x70
Aug 30 21:52:49 blackspot kernel: [  480.160549]  [<ffffffff813122ba>] rpm_callback+0x2a/0x90
Aug 30 21:52:49 blackspot kernel: [  480.160556]  [<ffffffff81312780>] rpm_suspend+0xf0/0x4c0
Aug 30 21:52:49 blackspot kernel: [  480.160564]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:52:50 blackspot kernel: [  480.160571]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:52:50 blackspot kernel: [  480.160578]  [<ffffffff8131398a>] pm_runtime_work+0x9a/0xa0
Aug 30 21:52:50 blackspot kernel: [  480.160585]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:52:50 blackspot kernel: [  480.160593]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:52:50 blackspot kernel: [  480.160601]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:52:50 blackspot kernel: [  480.160608]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:52:51 blackspot kernel: [  480.160615]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:52:51 blackspot kernel: [  480.160622]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:52:51 blackspot kernel: [  480.160628]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:52:51 blackspot kernel: [  480.160664] INFO: task Xorg:1946 blocked for more than 120 seconds.
Aug 30 21:52:51 blackspot kernel: [  480.160668]       Not tainted 3.16.1-linas #1
Aug 30 21:52:52 blackspot kernel: [  480.160671] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:52:52 blackspot kernel: [  480.160681]  ffff88022fe73d58 0000000000000086 ffff88022fe73d28 ffff8802360dc530
Aug 30 21:52:52 blackspot kernel: [  480.160687]  ffff880200000000 ffff8800bfb09710 0000000000011800 ffff88022fe73fd8
Aug 30 21:52:52 blackspot kernel: [  480.160694]  0000000000011800 ffff8800bfb09710 ffff88022fe73d68 ffff8802362d2098
Aug 30 21:52:52 blackspot kernel: [  480.160700] Call Trace:
Aug 30 21:52:53 blackspot kernel: [  480.160708]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:52:53 blackspot kernel: [  480.160714]  [<ffffffff8131247b>] __pm_runtime_barrier+0x8b/0x160
Aug 30 21:52:53 blackspot kernel: [  480.160722]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:52:53 blackspot kernel: [  480.160729]  [<ffffffff81313528>] pm_runtime_barrier+0x48/0xb0
Aug 30 21:52:53 blackspot kernel: [  480.160736]  [<ffffffff8127b2af>] pci_config_pm_runtime_get+0x3f/0x70
Aug 30 21:52:54 blackspot kernel: [  480.160743]  [<ffffffff8127ef94>] pci_read_config+0x84/0x250
Aug 30 21:52:54 blackspot kernel: [  480.160753]  [<ffffffff81174b05>] sysfs_kf_bin_read+0x45/0x70
Aug 30 21:52:54 blackspot kernel: [  480.160760]  [<ffffffff81173fcf>] kernfs_fop_read+0xaf/0x160
Aug 30 21:52:54 blackspot kernel: [  480.160767]  [<ffffffff81112176>] vfs_read+0xa6/0x170
Aug 30 21:52:54 blackspot kernel: [  480.160773]  [<ffffffff8111258a>] SyS_pread64+0x8a/0xa0
Aug 30 21:52:55 blackspot kernel: [  480.160779]  [<ffffffff814dd1d6>] system_call_fastpath+0x1a/0x1f
Aug 30 21:54:41 blackspot kernel: [  600.160044] INFO: task kworker/0:1:26 blocked for more than 120 seconds.
Aug 30 21:54:42 blackspot kernel: [  600.160061]       Not tainted 3.16.1-linas #1
Aug 30 21:54:42 blackspot kernel: [  600.160066] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:54:43 blackspot kernel: [  600.160111]  ffff880236307c28 0000000000000046 ffff880236307bd8 ffff8800bf20d490
Aug 30 21:54:43 blackspot kernel: [  600.160118]  ffff88022b4f1c20 ffff880236169710 0000000000011800 ffff880236307fd8
Aug 30 21:54:43 blackspot kernel: [  600.160124]  0000000000011800 ffff880236169710 ffff880236307c38 ffff8802362d2098
Aug 30 21:54:44 blackspot kernel: [  600.160131] Call Trace:
Aug 30 21:54:44 blackspot kernel: [  600.160150]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:54:44 blackspot kernel: [  600.160160]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:54:44 blackspot kernel: [  600.160171]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:54:44 blackspot kernel: [  600.160177]  [<ffffffff813135e7>] __pm_runtime_resume+0x57/0x90
Aug 30 21:54:45 blackspot kernel: [  600.160290]  [<ffffffffa051d5e2>] nouveau_connector_detect+0x62/0x3d0 [nouveau]
Aug 30 21:54:45 blackspot kernel: [  600.160299]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:54:45 blackspot kernel: [  600.160309]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:54:45 blackspot kernel: [  600.160322]  [<ffffffffa02c93e0>] output_poll_execute+0xb0/0x180 [drm_kms_helper]
Aug 30 21:54:45 blackspot kernel: [  600.160329]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:54:45 blackspot kernel: [  600.160337]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:54:45 blackspot kernel: [  600.160345]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:54:46 blackspot kernel: [  600.160353]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:54:46 blackspot kernel: [  600.160361]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:54:46 blackspot kernel: [  600.160368]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:54:46 blackspot kernel: [  600.160374]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:54:46 blackspot kernel: [  600.160385] INFO: task kworker/1:2:64 blocked for more than 120 seconds.
Aug 30 21:54:47 blackspot kernel: [  600.160390]       Not tainted 3.16.1-linas #1
Aug 30 21:54:47 blackspot kernel: [  600.160392] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:54:47 blackspot kernel: [  600.160409]  ffff880235e13ba8 0000000000000046 000000000000003a ffff8800bf209ec0
Aug 30 21:54:48 blackspot kernel: [  600.160416]  0000000000000400 ffff880235f063f0 0000000000011800 ffff880235e13fd8
Aug 30 21:54:48 blackspot kernel: [  600.160421]  0000000000011800 ffff880235f063f0 ffff880235e13bb8 ffff8802362d2098
Aug 30 21:54:48 blackspot kernel: [  600.160428] Call Trace:
Aug 30 21:54:48 blackspot kernel: [  600.160436]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:54:48 blackspot kernel: [  600.160442]  [<ffffffff81312ffb>] rpm_resume+0x16b/0x4f0
Aug 30 21:54:49 blackspot kernel: [  600.160450]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:54:49 blackspot kernel: [  600.160456]  [<ffffffff813133bd>] pm_runtime_forbid+0x3d/0x50
Aug 30 21:54:49 blackspot kernel: [  600.160520]  [<ffffffffa050a466>] nouveau_pmops_runtime_suspend+0xc6/0xe0 [nouveau]
Aug 30 21:54:49 blackspot kernel: [  600.160531]  [<ffffffff8127cb45>] pci_pm_runtime_suspend+0x75/0x150
Aug 30 21:54:49 blackspot kernel: [  600.160537]  [<ffffffff8127cad0>] ? pci_pm_runtime_resume+0xb0/0xb0
Aug 30 21:54:50 blackspot kernel: [  600.160545]  [<ffffffff81312244>] __rpm_callback+0x24/0x70
Aug 30 21:54:50 blackspot kernel: [  600.160551]  [<ffffffff813122ba>] rpm_callback+0x2a/0x90
Aug 30 21:54:50 blackspot kernel: [  600.160558]  [<ffffffff81312780>] rpm_suspend+0xf0/0x4c0
Aug 30 21:54:50 blackspot kernel: [  600.160566]  [<ffffffff8104ced3>] ? add_timer+0x13/0x20
Aug 30 21:54:50 blackspot kernel: [  600.160573]  [<ffffffff81058398>] ? __queue_delayed_work+0x68/0x150
Aug 30 21:54:51 blackspot kernel: [  600.160580]  [<ffffffff8131398a>] pm_runtime_work+0x9a/0xa0
Aug 30 21:54:51 blackspot kernel: [  600.160587]  [<ffffffff81058674>] process_one_work+0x144/0x3e0
Aug 30 21:54:51 blackspot kernel: [  600.160595]  [<ffffffff81058fd2>] worker_thread+0x112/0x510
Aug 30 21:54:51 blackspot kernel: [  600.160603]  [<ffffffff81058ec0>] ? create_and_start_worker+0x50/0x50
Aug 30 21:54:51 blackspot kernel: [  600.160610]  [<ffffffff8105f4c4>] kthread+0xc4/0xe0
Aug 30 21:54:52 blackspot kernel: [  600.160617]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:54:52 blackspot kernel: [  600.160623]  [<ffffffff814dd12c>] ret_from_fork+0x7c/0xb0
Aug 30 21:54:52 blackspot kernel: [  600.160630]  [<ffffffff8105f400>] ? flush_kthread_worker+0xa0/0xa0
Aug 30 21:54:52 blackspot kernel: [  600.160666] INFO: task Xorg:1946 blocked for more than 120 seconds.
Aug 30 21:54:53 blackspot kernel: [  600.160670]       Not tainted 3.16.1-linas #1
Aug 30 21:54:53 blackspot kernel: [  600.160674] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 30 21:54:53 blackspot kernel: [  600.160683]  ffff88022fe73d58 0000000000000086 ffff88022fe73d28 ffff8802360dc530
Aug 30 21:54:53 blackspot kernel: [  600.160689]  ffff880200000000 ffff8800bfb09710 0000000000011800 ffff88022fe73fd8
Aug 30 21:54:54 blackspot kernel: [  600.160696]  0000000000011800 ffff8800bfb09710 ffff88022fe73d68 ffff8802362d2098
Aug 30 21:54:54 blackspot kernel: [  600.160702] Call Trace:
Aug 30 21:54:54 blackspot kernel: [  600.160710]  [<ffffffff814d9de4>] schedule+0x24/0x60
Aug 30 21:54:54 blackspot kernel: [  600.160717]  [<ffffffff8131247b>] __pm_runtime_barrier+0x8b/0x160
Aug 30 21:54:54 blackspot kernel: [  600.160724]  [<ffffffff810786a0>] ? __wake_up_sync+0x10/0x10
Aug 30 21:54:54 blackspot kernel: [  600.160730]  [<ffffffff81313528>] pm_runtime_barrier+0x48/0xb0
Aug 30 21:54:55 blackspot kernel: [  600.160737]  [<ffffffff8127b2af>] pci_config_pm_runtime_get+0x3f/0x70
Aug 30 21:54:55 blackspot kernel: [  600.160745]  [<ffffffff8127ef94>] pci_read_config+0x84/0x250
Aug 30 21:54:55 blackspot kernel: [  600.160755]  [<ffffffff81174b05>] sysfs_kf_bin_read+0x45/0x70
Aug 30 21:54:55 blackspot kernel: [  600.160762]  [<ffffffff81173fcf>] kernfs_fop_read+0xaf/0x160
Aug 30 21:54:55 blackspot kernel: [  600.160770]  [<ffffffff81112176>] vfs_read+0xa6/0x170
Aug 30 21:54:56 blackspot kernel: [  600.160776]  [<ffffffff8111258a>] SyS_pread64+0x8a/0xa0
Aug 30 21:54:56 blackspot kernel: [  600.160783]  [<ffffffff814dd1d6>] system_call_fastpath+0x1a/0x1f

Revision history for this message

penalvch (penalvch) wrote on 2014-08-31:

#36

linas, please again see https://bugs.launchpad.net/ubuntu/+source/linux-lts-raring/+bug/1097178/comments/32 .

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#37

All the kernel stak traces are in power-management code, which should not be tripping.

Note also the timestampes in this intersting sequence:

Aug 30 21:46:14 blackspot kernel: [ 90.783374] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x65725f64 put 0x000102c8 state 0xc0000000 (err: MEM_FAULT) push 0x00000000
Aug 30 21:46:27 blackspot kernel: [ 105.780011] nouveau E[Xorg[1247]] failed to idle channel 0xcccc0000 [Xorg[1247]]
Aug 30 21:46:28 blackspot kernel: [ 105.780056] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[1247]] get 0x000102c8 put 0x000102d0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 30 21:46:42 blackspot kernel: [ 120.780015] nouveau E[Xorg[1247]] failed to idle channel 0xcccc0000 [Xorg[1247]]
Aug 30 21:48:41 blackspot kernel: [ 240.160046] INFO: task kworker/1:2:64 blocked for more than 120 seconds.

the DMA push, then *exactly* 15 seconds later, the idle channel, then exctly 15 seconds later, another, then exactly 120 seconds later, the deadlock warning.

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#38

Since mine is a desktop system, and I don't need power-management or suspend, I make menuconfig and unset CONFIG_PM to disable power management. The subroutines in the stack trace: rpm_suspend is in ./base/power and pci_pm_runtime_resume pci_pm_runtime_suspend etc. are in drivers/pci/pci-driver.c and are built only if CONFIG_PM_RUNTIME is set.

Recompiled, rebooted. Its .. sort of better. X no longer hung in uninterruptible sleep. The /var/log/Xorg.0.log messages " [mi] EQ overflowing. Additional events will be discarded until..." went away too, because X can now run.

Anyway, X seems to run now. Still getting junk like this though:

Aug 31 00:02:56 blackspot kernel: [ 247.134014] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[2061]] get 0x000102b0 put 0x000102c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 31 00:02:56 blackspot kernel: [ 247.134014] nouveau E[ PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[2061]] get 0x000102b0 put 0x000102c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 31 00:03:11 blackspot kernel: [ 262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:11 blackspot kernel: [ 262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:11 blackspot kernel: [ 262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [ 277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [ 277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [ 277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [ 277.312790] nouveau E[ PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:26 blackspot kernel: [ 277.312790] nouveau E[ PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:26 blackspot kernel: [ 277.312790] nouveau E[ PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:41 blackspot kernel: [ 292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [ 292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [ 292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [ 292.312027] nouveau E[ PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1134 data 0x0046d1d7
Aug 31 00:03:41 blackspot kernel: [ 292.312027] nouveau E[ PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1134 data 0x0046d1d7

so not all is well, not just yet. But for now X is up, for me.

Since mine is a desktop system, and I don't need power-management or suspend, I make menuconfig and unset CONFIG_PM to disable power management.  The subroutines in the stack trace: rpm_suspend  is in ./base/power and pci_pm_runtime_resume pci_pm_runtime_suspend etc. are in drivers/pci/pci-driver.c and are built only if CONFIG_PM_RUNTIME is set.

Recompiled, rebooted.  Its .. sort of better. X no longer hung in uninterruptible sleep. The /var/log/Xorg.0.log messages " [mi] EQ overflowing. Additional events will be discarded until..." went away too, because X can now run.

Anyway, X seems to  run now.  Still getting junk like this though:

Aug 31 00:02:56 blackspot kernel: [  247.134014] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[2061]] get 0x000102b0 put 0x000102c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 31 00:02:56 blackspot kernel: [  247.134014] nouveau E[   PFIFO][0000:01:06.0] DMA_PUSHER - ch 1 [Xorg[2061]] get 0x000102b0 put 0x000102c0 state 0x80000000 (err: INVALID_CMD) push 0x00000000
Aug 31 00:03:11 blackspot kernel: [  262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:11 blackspot kernel: [  262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:11 blackspot kernel: [  262.312013] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [  277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [  277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [  277.312016] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001 [Xorg[2061]]
Aug 31 00:03:26 blackspot kernel: [  277.312790] nouveau E[   PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:26 blackspot kernel: [  277.312790] nouveau E[   PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:26 blackspot kernel: [  277.312790] nouveau E[   PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1130 data 0x00000000
Aug 31 00:03:41 blackspot kernel: [  292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [  292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [  292.312012] nouveau E[Xorg[2061]] failed to idle channel 0xcccc0000 [Xorg[2061]]
Aug 31 00:03:41 blackspot kernel: [  292.312027] nouveau E[   PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1134 data 0x0046d1d7
Aug 31 00:03:41 blackspot kernel: [  292.312027] nouveau E[   PFIFO][0000:01:06.0] CACHE_ERROR - ch 1 [Xorg[2061]] subc 0 mthd 0x1134 data 0x0046d1d7

so not all is well, not just yet. But for now X is up, for me.

Revision history for this message

linas (linasvepstas) wrote on 2014-08-31:

#39

Summary/wrap-up report: After above changes, the X server took 8 tries to come up, each time hanging when it went to paint the lightdm pane. Each try took about 3 minutes (almost 1/2 hour elapsed), after which some failsafe tries to restart X. Each try is corellated with the "failed to idle channel" messages and/or the DMA_PUSHER errors. The 8th time it suceeded; at which point I was able to log in and use X as normal. There have been no further X disruptions after that point: the problem, whatever it is, is transient. (I have not yet tried to watch any youtube videos, though...) BTW, the 5th attempt Xorg.5.log file is filled with dozens of "[mi] EQ overflowing. Additional events will be discarded" error messages, and dozens of corresponding stack traces. None of the other failed attempts have this.

Since this is the very latest linux kernel, and what appears to be the latest libdrm, I'll try to pursue this with the kernel devs directly. The above report is an FYI for anyone else enountering this issue, since this bug is the #1 hit that google currently provides for these error messages, and its the ONLY bug that provides real, actionable information on how to resolve the issue. (i.e. simply marking this bug as invalid and telling everyone to take a flying leap doesn't actually decrease the relevance of the bug report for real-world users).

Revision history for this message

penalvch (penalvch) wrote on 2014-08-31:

#40

linas:

>"The above report is an FYI for anyone else enountering this issue,"

Launchpad is a development platform, not an FYI forum. If you want to help on Launchpad, file a bug report as already previously requested of you on multiple occasions. To do otherwise is unhelpful noise on a closed report.

>"...since this bug is the #1 hit that google currently provides for these error messages,"

This being a #1 hit on a search engine is irrelevant. Finding information that just repeats an error message one encounters doesn't help anyone in getting their bug resolved.

"and its the ONLY bug that provides real, actionable information on how to resolve the issue."

There is nothing actionable on this closed report, as it's not about you, your hardware, or your problem. If you want to provide real, actionable information, submit a commit upstream that addresses this problem. Anything else is just a hack, or WORKAROUND, that again, would need to be on a new report, as already fully detailed to you previously on multiple occasions.

"(i.e. simply marking this bug as invalid and telling everyone to take a flying leap"

I've never said that. Please stop making false accusations.

"...doesn't actually decrease the relevance of the bug report for real-world users)."

One wants to make a bug report on Launchpad, a development platform, relevant to developers (i.e. the people actually providing fixes). By not filing a bug report, you are further delaying developers from addressing your issue.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

linas (linasvepstas) wrote on 2014-09-03:

#41

Solution (for me): see https://bugs.freedesktop.org/show_bug.cgi?id=70388

boot the kernel with vram_pushbuf=1

if that does not work, try agpmode=0

Revision history for this message

linas (linasvepstas) wrote on 2014-09-03:

#42

Users with AGP graphics cards and VIA pcie chipsets might find some luck. with agpmode=2 as described here: https://www.libreoffice.org/bugzilla/show_bug.cgi?id=20341

Revision history for this message

rubberducky (rubber-ducky170) wrote on 2014-09-29:

#43

Had the same problem with computer Freezing after Log In. The mouse and wallpaper were displayed, but did not go any further than that, except to black screen with the "nouveau E[Xorg[2061]] failed to idle channel 0xcccc0001" Error repeated.
I've tried a lot of tricks, including whats on this page, to no avail.

Finally found one that worked:
https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-nouveau/+bug/1313402

And here's the code that worked for me:

sudo apt-get install nvidia-current
sudu reboot

Revision history for this message

penalvch (penalvch) wrote on 2014-10-01:

#44

rubberducky, thank you for your comment. Unfortunately, as this bug report is closed, this bug report is not scoped to you, your hardware, or your problem. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into the default Ubuntu kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

As well, please do not announce in this report you created a new bug report.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message

cagancelik (cagancelik) wrote on 2015-03-01:

#45

This bug effects me as well. Let alone installation, I can't even use Live CD due to this bug. Ubuntu is completely unusable. See more in this thread (with screen shots). SuSe Linux boots and Works just fine but Ubuntu hangs all the time.

My laptop has GTX880M 8GB GPU. Obviously this is an Nvidia related bug and no importance is given to fix it. Shame for Linux community not to support a top of the line graphics card in 2015.

Revision history for this message

cagancelik (cagancelik) wrote on 2015-03-01:

#46

Sorry, forget to include my screenshot.

Revision history for this message

penalvch (penalvch) wrote on 2015-03-01:

#47

cagancelik, as this report is closed, it doesn't affect you. If you want your problem addressed, it would help immensely if you filed a new report via a terminal:
ubuntu-bug linux

Please feel free to subscribe me to it.

Revision history for this message

mik047 (mik047) wrote on 2015-06-09:

#48

affects me too.. I have NVIDIA® Quadro® K2100M Graphics 2GB GDDR5 on my config. While installing ubuntu, it complains of following..

nouveau E [DRM] failed to idle channel 0xcccc0000 [DRM]
xhci_hcd 0000:00:14.0: HC died: cleaning up
INFO: rcu_sched detected stails on CPUs/tasks: { 2} (detected by 0, t=150002 jiffies, g=324, c=323, q=0)
BUG: soft lockup - CPU#0 stuck for 22s! [khubd:74]
BUG: soft lockup - CPU#0 stuck for 22s! [scsi_id:1042]
...series of the last two lines....
INFO: task kworker/2:1:83 blocked for more than 120 seconds.
Tainted: G W 3.16.0.30-generic #40~04.1-Ubuntu
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message
...again the above message continues randomly starting from INFO: rcu_sched .......

Revision history for this message

Andrew (am-public-o) wrote on 2015-07-03:

#49

Another one affected here. Nvidia GTX 250? Cannot even access shell commands to fix anything. Supposedly the proprietary drivers fix this. I have now had at least three PC's with varying hardware affected by driver issues on a clean install. Bugs have been present since sometime after 12.04 LTS, in the 13.x releases and is still present in 14.04.02LTS.

Feature Request: Install Proprietary drivers at OS install time by default

Revision history for this message

penalvch (penalvch) wrote on 2015-07-04:

#50

Andrew, as this report is closed, you wouldn't be affected by it.

However, if you would like your issue addressed, please file a new report via a terminal:
ubuntu-bug linux

Please feel free to subscribe me to it.

Ubuntu
linux-lts-raring package

nouveau failed to idle channel 0xcccc0000

Bug Description

Other bug subscribers

Remote bug watches

Ubuntulinux-lts-raring package

nouveau failed to idle channel 0xcccc0000

Bug Description

Other bug subscribers

Remote bug watches

Ubuntu
linux-lts-raring package