Hardy pm-hibernate oops with nvidia restricted driver

Bug #217938 reported by Mozg
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
nvidia-kernel-common (Ubuntu)
Expired
Undecided
Unassigned
pm-utils (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

Binary package hint: pm-utils

When resuming from hibernation to disk I constantly get several oopses in pm-hibernate and system stops responding. The oops is related to proprietary nvidia module. Below is the output of the kernel.log file (with a lot of repeats):

[ 88.631519] BUG: soft lockup - CPU#1 stuck for 11s! [pm-hibernate:3192]
[ 88.631520] CPU 1:
[ 88.631521] Modules linked in: ndiswrapper ecryptfs tpm_tis tpm tpm_bios snd_rtctimer hci_usb af_packet binfmt_misc rfcomm l2cap bluetooth ppdev ipv6 acpi_cpufreq cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_ondemand freq_table cpufreq_conservative sbs sbshc container bay dock iptable_filter ip_tables x_tables ext3 jbd mbcache input_polldev tp_smapi thinkpad_ec uinput sbp2 parport_pc lp parport loop arc4 ecb serial_cs snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_page_alloc snd_hwdep snd_seq_dummy snd_seq_oss snd_seq_midi pcmcia snd_rawmidi snd_seq_midi_event snd_seq joydev nvidia(P) ac sdhci battery psmouse i2c_core snd_timer video output mmc_core serio_raw snd_seq_device ricoh_mmc yenta_socket rsrc_nonstatic pcmcia_core wmi_acer thinkpad_acpi snd button nvram iTCO_wdt iTCO_vendor_support evdev pcspkr shpchp pci_hotplug soundcore xfs sha256_generic aes_x86_64 cbc blkcipher sr_mod cdrom pata_acpi sg sd_mod ata_piix dm_crypt ahci ata_generic libata scsi_mod ohci1394 ieee1394 ehci_hcd uhci_hcd usbcore dm_mirror dm_snapshot dm_mod thermal processor fan fuse vga16fb vgastate vesafb fbcon tileblit font bitblit softcursor
[ 88.631553] Pid: 3192, comm: pm-hibernate Tainted: P 2.6.24-16-generic #1
[ 88.631554] RIP: 0010:[<ffffffff886e9a63>] [<ffffffff886e9a63>] :nvidia:os_io_read_byte+0x3/0x10
[ 88.631628] RSP: 0018:ffff8100a9bc3c60 EFLAGS: 00000292
[ 88.631629] RAX: 0000000000000063 RBX: ffff8101062d2f08 RCX: ffffffff88b79420
[ 88.631631] RDX: 00000000000003d5 RSI: 00000000000003d5 RDI: ffff81012bdc0000
[ 88.631632] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000001
[ 88.631633] R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000000001
[ 88.631634] R13: 0000000000000000 R14: ffff8101062d2ef0 R15: 0000000000000000
[ 88.631636] FS: 00007f989d8606e0(0000) GS:ffff810137401800(0000) knlGS:0000000000000000
[ 88.631637] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 88.631638] CR2: 0000000000000000 CR3: 00000000a9580000 CR4: 00000000000006a0
[ 88.631639] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 88.631641] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 88.631642]
[ 88.631642] Call Trace:
[ 88.631718] [<ffffffff886c68a3>] :nvidia:_nv000016rm+0x13/0x1b
[ 88.631808] [<ffffffff8862d565>] :nvidia:_nv000217rm+0x11/0x26
[ 88.631888] [<ffffffff8841c8fe>] :nvidia:_nv000310rm+0x83/0xa4
[ 88.631964] [<ffffffff886c6ee9>] :nvidia:_nv004530rm+0x1b1/0x1f8
[ 88.632057] [<ffffffff88632e99>] :nvidia:_nv003881rm+0xc7/0x35b
[ 88.632151] [<ffffffff88637531>] :nvidia:_nv002862rm+0x2e5/0x550
[ 88.632241] [<ffffffff886374ed>] :nvidia:_nv002862rm+0x2a1/0x550
[ 88.632334] [<ffffffff88637db4>] :nvidia:_nv002999rm+0x46e/0x6ff
[ 88.632428] [<ffffffff886349dc>] :nvidia:_nv002857rm+0x71/0x79
[ 88.632519] [<ffffffff88634c07>] :nvidia:_nv002869rm+0x223/0x23e
[ 88.632612] [<ffffffff88639acd>] :nvidia:rm_power_management+0x1cd/0x269
[ 88.632691] [<ffffffff886e6b44>] :nvidia:nv_power_management+0x164/0x200
[ 88.632698] [<ffffffff803bde1a>] dpm_resume+0x6a/0x1a0
[ 88.632702] [<ffffffff803bdf6a>] device_resume+0x1a/0x30
[ 88.632705] [<ffffffff80267c1d>] hibernation_snapshot+0x9d/0x120
[ 88.632709] [<ffffffff80267e58>] hibernate+0xe8/0x1e0
[ 88.632714] [<ffffffff80266670>] state_store+0xe0/0xf0
[ 88.632721] [<ffffffff802fd97e>] sysfs_write_file+0xfe/0x160
[ 88.632729] [<ffffffff802adacd>] vfs_write+0xed/0x190
[ 88.632734] [<ffffffff802ae1d3>] sys_write+0x53/0x90
[ 88.632740] [<ffffffff8020c37e>] system_call+0x7e/0x83
[ 88.632752]

lspci output:
 lspci
00:00.0 Host bridge: Intel Corporation Mobile PM965/GM965/GL960 Memory Controller Hub (rev 0c)
00:01.0 PCI bridge: Intel Corporation Mobile PM965/GM965/GL960 PCI Express Root Port (rev 0c)
00:19.0 Ethernet controller: Intel Corporation 82566MM Gigabit Network Connection (rev 03)
00:1a.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 03)
00:1a.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801H (ICH8 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 3 (rev 03)
00:1c.3 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 4 (rev 03)
00:1c.4 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 5 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #3 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev f3)
00:1f.0 ISA bridge: Intel Corporation 82801HBM (ICH8M-E) LPC Interface Controller (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) IDE Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) SATA AHCI Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 03)
01:00.0 VGA compatible controller: nVidia Corporation Quadro FX 570M (rev a1)
03:00.0 Network controller: Intel Corporation PRO/Wireless 4965 AG or AGN Network Connection (rev 61)
15:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev ba)
15:00.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C832 IEEE 1394 Controller (rev 04)
15:00.2 SD Host controller: Ricoh Co Ltd R5C822 SD/SDIO/MMC/MS/MSPro Host Adapter (rev 21)
15:00.3 System peripheral: Ricoh Co Ltd R5C843 MMC Host Controller (rev 11)
15:00.4 System peripheral: Ricoh Co Ltd R5C592 Memory Stick Bus Host Adapter (rev 11)
15:00.5 System peripheral: Ricoh Co Ltd xD-Picture Card Controller (rev 11)

uname -a
Linux finka 2.6.24-16-generic #1 SMP Thu Apr 10 12:47:45 UTC 2008 x86_64 GNU/Linux

lsb_release -rd
Description: Ubuntu 8.04
Release: 8.04

 apt-cache policy pm-utils
pm-utils:
  Installed: 0.99.2-3ubuntu9
  Candidate: 0.99.2-3ubuntu9
  Version table:
 *** 0.99.2-3ubuntu9 0
        500 http://gb.archive.ubuntu.com hardy/main Packages
        100 /var/lib/dpkg/status

 apt-cache policy nvidia-glx-new
nvidia-glx-new:
  Installed: 169.12+2.6.24.500-500.23~envy
  Candidate: 169.12+2.6.24.500-500.23~envy
  Version table:
 *** 169.12+2.6.24.500-500.23~envy 0
        500 http://ppa.launchpad.net hardy/main Packages
        100 /var/lib/dpkg/status
     169.12+2.6.24.12-16.34 0
        500 http://gb.archive.ubuntu.com hardy/restricted Packages

Revision history for this message
vlowther (victor-lowther) wrote :

What happens if you use suspend as opposed to hibernate?

Revision history for this message
vlowther (victor-lowther) wrote :

Added nvidia-kernel-common because their input will be required to resolve issues of this sort.

Revision history for this message
Mozg (andrei-arhont) wrote :

The same happens regardless of suspend or hibernate.

Revision history for this message
Mozg (andrei-arhont) wrote :

I have managed to fix the suspend issue by using proper acpi-suspend configuration ( http://launchpadlibrarian.net/14044012/acpi-support ). However hibernation still produce the oops. I would be very happy to help debigging process as hibernation is very important to me and i would like to get this fixed.

Revision history for this message
DickeyWang (hwang313000) wrote :

    I have exactly the same problem with both of the Nvidia 169.12 and the 173.14.05 driver. In additional to the oops, occasionally the integrated bluetooth would stop working after resume from hibernate/suspend. I am running the 2.6.24-18 kernel from the repo.

Revision history for this message
Kamil Páral (kamil.paral) wrote :

I have similar problem on my ThinkPad R61. Nvidia 140m card with proprietary driver. With nv driver all works. With proprietary driver suspend and resume works, but resume from hibernation produces output similar to mentioned above and system freezes. Hibernation from console with gdm shut down works though.

Revision history for this message
jonie (jonie) wrote :

I can confirm identical behaviour on desktop system, based on Nforce3. Latest drivers (as of today 177.68) don't change a thing. Video card hangs on resume, for those 11s display goes into standby, after that it wakes again but stays black. When hibernate or suspend is commenced from console, display wakes up but hangs as soon as it's switched to vt7 (X). System is not frozen completely, still can be rebooted by ctrl+alt+del, but is also very unresponsive. Fiddling with acpi settings has no effect.

Revision history for this message
Thomas Hotz (thotz-deactivatedaccount) wrote :

Can you reproduce this bug with a newer Ubuntu version? Thank you!

Changed in pm-utils (Ubuntu):
status: New → Incomplete
Changed in nvidia-kernel-common (Ubuntu):
status: New → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for pm-utils (Ubuntu) because there has been no activity for 60 days.]

Changed in pm-utils (Ubuntu):
status: Incomplete → Expired
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for nvidia-kernel-common (Ubuntu) because there has been no activity for 60 days.]

Changed in nvidia-kernel-common (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.