nvidia: Xorg hangs at 100% CPU when playing video: "fallen off the bus"
This is likely a dupe of bug #1001007, but apport-collect says it's better to file a new bug, so here I go. If they are a dupe, it may not be just video playing that causes it, or just 3D usage, but maybe VDPAU or a combination thereof. I do use 3D compositing with KDM in KDE 4.8.3, and I have VDPAU installed.
I have been running Kubuntu on this Dell XPS M1330 laptop with NVIDIA 8400M GS since 2008 with no problems. Suddenly, after upgrading from 11.10 Oneiric to 12.04 Precise, I am regularly experiencing X hanging with 100% CPU. So far, every time it has happened, it's been while watching YouTube videos in Flash in Firefox. It may be minutes, hours, or days between crashes.
When it happens, sometimes the cursor is movable, and sometimes the cursor disappears. The rest of the screen freezes, but sound continues playing. I can SSH in and see X at 100% CPU. I cannot change VTs. I can sometimes SAK+K several times to kill X and then KDM and X will restart--other times I have to power off.
I just downgraded to 295.20 to see if the bug exists in this version--I've read in some places that people find 295.20 to be stable. Note that even though I am filing this bug and ran apport under 295.20, I have experienced the bug in 295.40 and 295.49, the versions in precise and precise-updates.
When the hang happens, I see this in dmesg every time:
[ 2140.551099] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 2140.551107] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[ 2140.551133] NVRM: os_pci_init_handle: invalid context!
[ 2140.551139] NVRM: os_pci_init_handle: invalid context!
[ 2140.551206] NVRM: os_pci_init_handle: invalid context!
[ 2140.551224] NVRM: os_pci_init_handle: invalid context!
[ 2140.551229] NVRM: os_pci_init_handle: invalid context!
[ 2142.079045] irq 16: nobody cared (try booting with the "irqpoll" option)
[ 2142.079051] Pid: 0, comm: BFS/0 Tainted: P C O 3.3.6-pf-adp+ #6
[ 2142.079054] Call Trace:
[ 2142.079061] [<c154ddf6>] ? printk+0x2d/0x2f
[ 2142.079067] [<c10b4139>] __report_
[ 2142.079070] [<c10b43ae>] note_interrupt+
[ 2142.079174] [<f9bad083>] ? nv_kern_
[ 2142.079178] [<c10b21ee>] handle_
[ 2142.079182] [<c10b4cf0>] ? handle_
[ 2142.079186] [<c15569e0>] ? nmi_stack_
[ 2142.079190] [<c10269e8>] ? default_
[ 2142.079194] [<c101df4d>] ? __io_apic_
[ 2142.079198] [<c10b238b>] handle_
[ 2142.079201] [<c10b4c20>] ? unmask_
[ 2142.079204] [<c10b4c6e>] handle_
[ 2142.079206] <IRQ> [<c155d3b2>] ? do_IRQ+0x42/0xc0
[ 2142.079213] [<c1008638>] ? sched_clock+
[ 2142.079217] [<c1060ffb>] ? sched_clock_
[ 2142.079221] [<c155d2f0>] ? common_
[ 2142.079225] [<c10600d8>] ? build_sched_
[ 2142.079230] [<c1315fef>] ? arch_local_
[ 2142.079233] [<c13169cb>] ? acpi_idle_
[ 2142.079237] [<c1448d9d>] ? cpuidle_
[ 2142.079241] [<c100174c>] ? cpu_idle+0x9c/0xe0
[ 2142.079244] [<c1531825>] ? rest_init+0x5d/0x68
[ 2142.079249] [<c17f5745>] ? start_kernel+
[ 2142.079252] [<c17f517f>] ? loglevel+0x2b/0x2b
[ 2142.079255] [<c17f5078>] ? i386_start_
[ 2142.079257] handlers:
[ 2142.079340] [<f9bad050>] nv_kern_isr
[ 2142.079342] Disabling IRQ #16
Again, this never, ever happened in all these years until I "upgraded" from Oneiric to Precise. Now my laptop is completely unreliable--or, at least, I risk a hang and having to kill all processes whenever I watch a video. Not much of an upgrade. :(
I don't know what to do now. If 295.20 is not stable, should I keep downgrading? I assume I'll eventually run into a version that won't install on Precise, whether due to kernel incompatibilities or other issues. I don't think downgrading the entire system to Oneiric (reinstalling, at that) is a good option, either, as I'll be stuck with older versions of other software, including all of KDE.
Here are some other links that may be relevant:
Thread on nvnews that I think is this same bug:
Probably not the same bug, but perhaps related in that it may be a regression in 295.33+. The performance regressions may be fixed, but this hang is not:
I will do whatever I can to help debug this. This is very frustrating!
DistroRelease: Ubuntu 12.04
Package: nvidia-current 295.20-0ubuntu1
Uname: Linux 3.2.0-24-generic i686
Date: Sat May 26 19:01:42 2012
PATH=(custom, no user)
UpgradeStatus: Upgraded to precise on 2012-05-06 (20 days ago)
|Changed in nvidia-graphics-drivers (Ubuntu):|
|status:||New → Confirmed|
|Adam Porter (alphapapa) wrote : Re: [Bug 1005028] Re: nvidia: Xorg hangs at 100% CPU when playing video: "fallen off the bus"||#4|
|Adam Porter (alphapapa) wrote : Re: [Bug 1005028] [NEW] nvidia: Xorg hangs at 100% CPU when playing video: "fallen off the bus"||#5|