Similar issue here. I'm fairly sure I saw different errors, i.e. not only nvidia.
After the below example, it's dead. It always concerns IRQ's. I enclose some examples from a rich gallery I have in /var/log.
name [ 1377.796013] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
name [ 1377.796088] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
name [ 1377.796013] NVRM: os_pci_init_handle: invalid context!
name last message repeated 4 times
name [ 1380.200095] irq 16: nobody cared (try booting with the "irqpo
ll" option)
name [ 1380.200102] Pid: 3019, comm: mplayer Tainted: P O 3
.2.0-35-generic #55-Ubuntu
name [ 1380.200105] Call Trace:
name [ 1380.200113] [<c1560f2c>] ? printk+0x2d/0x2f
name [ 1380.200119] [<c10b1259>] __report_bad_irq+0x29/0xd0
name [ 1380.200123] [<c10b14b4>] note_interrupt+0x104/0x150
name [ 1380.200127] [<c10af38f>] handle_irq_event_percpu+0x9f/0x1f0
name [ 1380.200132] [<c1027518>] ? default_spin_lock_flags+0x8/0x10
name [ 1380.200136] [<c157614d>] ? _raw_spin_lock_irqsave+0x2d/0x40
name [ 1380.200140] [<c1027518>] ? default_spin_lock_flags+0x8/0x10
name [ 1380.200144] [<c101e56d>] ? __io_apic_modify_irq+0x7d/0x90
name [ 1380.200148] [<c10af51b>] handle_irq_event+0x3b/0x60
name [ 1380.200152] [<c10b1cc0>] ? unmask_irq+0x30/0x30
name [ 1380.200156] [<c10b1d0e>] handle_fasteoi_irq+0x4e/0xd0
name [ 1380.200158] <IRQ> [<c157d832>] ? do_IRQ+0x42/0xc0
name [ 1380.200165] [<c157d770>] ? common_interrupt+0x30/0x38
name [ 1380.200169] [<c1570000>] ? add_i2c_device+0x126/0x166
name [ 1380.200172] handlers:
name [ 1380.200281] [<f9833050>] nv_kern_isr
name [ 1380.200284] Disabling IRQ #16
Similar issue here. I'm fairly sure I saw different errors, i.e. not only nvidia.
After the below example, it's dead. It always concerns IRQ's. I enclose some examples from a rich gallery I have in /var/log.
name [ 1377.796013] NVRM: GPU at 0000:01:00.0 has fallen off the bus. bad_irq+ 0x29/0xd0 0x104/0x150 irq_event_ percpu+ 0x9f/0x1f0 spin_lock_ flags+0x8/ 0x10 lock_irqsave+ 0x2d/0x40 spin_lock_ flags+0x8/ 0x10 modify_ irq+0x7d/ 0x90 irq_event+ 0x3b/0x60 irq+0x30/ 0x30 fasteoi_ irq+0x4e/ 0xd0 interrupt+ 0x30/0x38 device+ 0x126/0x166
name [ 1377.796088] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
name [ 1377.796013] NVRM: os_pci_init_handle: invalid context!
name last message repeated 4 times
name [ 1380.200095] irq 16: nobody cared (try booting with the "irqpo
ll" option)
name [ 1380.200102] Pid: 3019, comm: mplayer Tainted: P O 3
.2.0-35-generic #55-Ubuntu
name [ 1380.200105] Call Trace:
name [ 1380.200113] [<c1560f2c>] ? printk+0x2d/0x2f
name [ 1380.200119] [<c10b1259>] __report_
name [ 1380.200123] [<c10b14b4>] note_interrupt+
name [ 1380.200127] [<c10af38f>] handle_
name [ 1380.200132] [<c1027518>] ? default_
name [ 1380.200136] [<c157614d>] ? _raw_spin_
name [ 1380.200140] [<c1027518>] ? default_
name [ 1380.200144] [<c101e56d>] ? __io_apic_
name [ 1380.200148] [<c10af51b>] handle_
name [ 1380.200152] [<c10b1cc0>] ? unmask_
name [ 1380.200156] [<c10b1d0e>] handle_
name [ 1380.200158] <IRQ> [<c157d832>] ? do_IRQ+0x42/0xc0
name [ 1380.200165] [<c157d770>] ? common_
name [ 1380.200169] [<c1570000>] ? add_i2c_
name [ 1380.200172] handlers:
name [ 1380.200281] [<f9833050>] nv_kern_isr
name [ 1380.200284] Disabling IRQ #16
[12639.702668] NVRM: GPU at 0000:01:00.0 has fallen off the bus. bad_irq+ 0x29/0xd0 0x104/0x150 irq_event_ percpu+ 0x9f/0x1f0 fasteoi_ irq+0xd0/ 0xd0 spin_lock_ flags+0x8/ 0x10 modify_ irq+0x7d/ 0x90 irq_event+ 0x3b/0x60 fasteoi_ irq+0xd0/ 0xd0 spin_lock_ flags+0x8/ 0x10 modify_ irq+0x7d/ 0x90 irq_event+ 0x3b/0x60 irq+0x30/ 0x30 fasteoi_ irq+0x4e/ 0xd0 0x93/0xa0 interrupt+ 0x30/0x38
[12639.702675] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[12639.702693] NVRM: os_pci_init_handle: invalid context!
[12639.702697] NVRM: os_pci_init_handle: invalid context!
[12639.702750] NVRM: os_pci_init_handle: invalid context!
[12639.702764] NVRM: os_pci_init_handle: invalid context!
[12639.702768] NVRM: os_pci_init_handle: invalid context!
[12641.590004] irq 16: nobody cared (try booting with the "irqpo
ll" option)
[12641.590010] Pid: 1332, comm: Xorg Tainted: P O 3.2.
0-35-generic #55-Ubuntu
[12641.590013] Call Trace:
[12641.590022] [<c1560f2c>] ? printk+0x2d/0x2f
[12641.590028] [<c10b1259>] __report_
[12641.590032] [<c10b14b4>] note_interrupt+
[12641.590036] [<c10af38f>] handle_
[12641.590041] [<c10b1d90>] ? handle_
[12641.590045] [<c15776f1>] ? do_nmi+0x61/0x80
[12641.590050] [<c1027518>] ? default_
[12641.590054] [<c101e56d>] ? __io_apic_
[12641.590058] [<c10af51b>] handle_
[12641.590041] [<c10b1d90>] ? handle_
[12641.590045] [<c15776f1>] ? do_nmi+0x61/0x80
[12641.590050] [<c1027518>] ? default_
[12641.590054] [<c101e56d>] ? __io_apic_
[12641.590058] [<c10af51b>] handle_
[12641.590061] [<c10b1cc0>] ? unmask_
[12641.590065] [<c10b1d0e>] handle_
[12641.590067] <IRQ> [<c157d832>] ? do_IRQ+0x42/0xc0
[12641.590074] [<c1002de3>] ? sys_sigreturn+
[12641.590078] [<c157d770>] ? common_
[12641.590081] handlers:
[12641.590189] [<f95bb050>] nv_kern_isr
[12641.590192] Disabling IRQ #16
[19994.912076] NVRM: GPU at 0000:01:00.0 has fallen off the bus. bad_irq+ 0x29/0xd0 0x104/0x150 irq_event_ percpu+ 0x9f/0x1f0 fasteoi_ irq+0xd0/ 0xd0 spin_lock_ flags+0x8/ 0x10 lock_irqsave+ 0x2d/0x40 irq_event+ 0x3b/0x60 irq+0x30/ 0x30 fasteoi_ irq+0x4e/ 0xd0 task_fair+ 0x48/0x50 interrupt+ 0x30/0x38 ts+0x67/ 0x120 get_ts+ 0xf/0x20 gettime+ 0x24/0x70 call+0x7/ 0xb device+ 0x126/0x166
[19994.912079] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[19994.912095] NVRM: os_pci_init_handle: invalid context!
[19994.912098] NVRM: os_pci_init_handle: invalid context!
[19994.912148] NVRM: os_pci_init_handle: invalid context!
[19994.912159] NVRM: os_pci_init_handle: invalid context!
[19994.912161] NVRM: os_pci_init_handle: invalid context!
[19997.209478] irq 16: nobody cared (try booting with the "irqpo
ll" option)
[19997.209485] Pid: 7751, comm: kwin Tainted: P O 3.2.0-35-generic #55-Ubuntu
[19997.209488] Call Trace:
[19997.209496] [<c1560f2c>] ? printk+0x2d/0x2f
[19997.209503] [<c10b1259>] __report_
[19997.209507] [<c10b14b4>] note_interrupt+
[19997.209512] [<c10249c3>] ? read_hpet+0x13/0x20
[19997.209516] [<c10af38f>] handle_
[19997.209520] [<c10b1d90>] ? handle_
[19997.209525] [<c1027518>] ? default_
[19997.209529] [<c157614d>] ? _raw_spin_
[19997.209533] [<c10af51b>] handle_
[19997.209536] [<c10b1cc0>] ? unmask_
[19997.209540] [<c10b1d0e>] handle_
[19997.209542] <IRQ> [<c157d832>] ? do_IRQ+0x42/0xc0
[19997.209550] [<c103a198>] ? pick_next_
[19997.209554] [<c157d770>] ? common_
[19997.209558] [<c10249c3>] ? read_hpet+0x13/0x20
[19997.209563] [<c1074597>] ? ktime_get_
[19997.209567] [<c106860f>] ? posix_ktime_
[19997.209571] [<c1069544>] ? sys_clock_
[19997.209574] [<c15762f4>] ? syscall_
[19997.209579] [<c1570000>] ? add_i2c_
[19997.209581] handlers:
[19997.209694] [<f94f3050>] nv_kern_isr
[19997.209697] Disabling IRQ #16
[ 4750.600194] Pid: 0, comm: swapper/0 Tainted: P O 3.2.0-35-generic #55-Ubuntu bad_irq+ 0x29/0xd0 0x104/0x150 irq_enable+ 0x7/0xb irq_event_ percpu+ 0x9f/0x1f0 spin_lock_ flags+0x8/ 0x10 lock_irqsave+ 0x2d/0x40 spin_lock_ flags+0x8/ 0x10 lock_irqsave+ 0x2d/0x40 irq_event+ 0x3b/0x60 irq+0x30/ 0x30 fasteoi_ irq+0x4e/ 0xd0 0x8/0x10 local+0xcb/ 0x1c0 interrupt+ 0x30/0x38 store+0x48/ 0x60 irq_enable+ 0x7/0xb enter_simple+ 0xf3/0x133 idle_call+ 0xad/0x250 0x34d/0x353 .constprop. 2+0xe2/ 0xe2 kernel+ 0xa9/0xaf
[ 4750.600197] Call Trace:
[ 4750.600205] [<c1560f2c>] ? printk+0x2d/0x2f
[ 4750.600211] [<c10b1259>] __report_
[ 4750.600215] [<c10b14b4>] note_interrupt+
[ 4750.600220] [<c1316736>] ? arch_local_
[ 4750.600224] [<c10af38f>] handle_
[ 4750.600229] [<c1027518>] ? default_
[ 4750.600233] [<c157614d>] ? _raw_spin_
[ 4750.600236] [<c1027518>] ? default_
[ 4750.600240] [<c157614d>] ? _raw_spin_
[ 4750.600244] [<c10af51b>] handle_
[ 4750.600248] [<c10b1cc0>] ? unmask_
[ 4750.600251] [<c10b1d0e>] handle_
[ 4750.600254] <IRQ> [<c157d832>] ? do_IRQ+0x42/0xc0
[ 4750.600261] [<c1008bc8>] ? sched_clock+
[ 4750.600266] [<c10701bb>] ? sched_clock_
[ 4750.600269] [<c157d770>] ? common_
[ 4750.600273] [<c10700d8>] ? profiling_
[ 4750.600276] [<c1316736>] ? arch_local_
[ 4750.600280] [<c1317155>] ? acpi_idle_
[ 4750.600285] [<c144b4dd>] ? cpuidle_
[ 4750.600289] [<c100180c>] ? cpu_idle+0x9c/0xe0
[ 4750.600293] [<c15455a5>] ? rest_init+0x5d/0x68
[ 4750.600298] [<c1834771>] ? start_kernel+
[ 4750.600302] [<c18343b5>] ? pass_bootoption
[ 4750.600306] [<c18340a9>] ? i386_start_
[ 4750.600308] handlers:
[ 4750.600420] [<f94f3050>] nv_kern_isr
[ 4750.600423] Disabling IRQ #16