Broken suspend on 2.6.20-16.29

Bug #123002 reported by David Jaša
6
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Won't Fix
Medium
Unassigned
linux-source-2.6.20 (Ubuntu)
Won't Fix
High
Unassigned
linux-source-2.6.22 (Ubuntu)
Won't Fix
Medium
Unassigned

Bug Description

Binary package hint: linux-source-2.6.20

I've got continous problems with suspend to both RAM and disk in feisty final on IBM T40 2374-MU7. I installed system as festy alpha 5 (last alpha) where both worked reliably. I upgraded to feisty final pretty late and a trouble begun. Some kernels since then sometime resume flawlessly (unfortunately I don't know, if apt has some packages history), but most times system just hangs in "black screen". I use now linux-image-2.6.20-16.29 and I followed Debugging Kernel Suspend article from wiki with these results:

$ dmesg |grep -i 'magic\|hash'
[ 2.369427] PID hash table entries: 4096 (order: 12, 16384 bytes)
[ 2.371134] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
[ 2.371824] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
[ 2.493524] Mount-cache hash table entries: 512
[ 2.957476] IP route cache hash table entries: 32768 (order: 5, 131072 bytes)
[ 2.957640] TCP established hash table entries: 131072 (order: 8, 1048576 bytes)
[ 2.959229] TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
[ 2.960344] TCP: Hash tables configured (established 131072 bind 65536)
[ 3.668758] Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
[ 4.096020] Magic number: 0:798:917
[ 4.096023] hash matches drivers/base/power/resume.c:46

Revision history for this message
David Jaša (dejv) wrote :
Revision history for this message
David Jaša (dejv) wrote :
Revision history for this message
Jérôme Guelfucci (jerome-guelfucci-deactivatedaccount) wrote :

Thank you for your bug report.

Please include the following additional information, if you have not already done so (please pay attention to lspci's additional options), as required by the Ubuntu Kernel Team:
1. Please include the output of the command "uname -a" in your next response. It should be one, long line of text which includes the exact kernel version you're running, as well as the CPU architecture.
2. Please run the command "dmesg > dmesg.log" and attach the resulting file "dmesg.log" to this bug report.
3. Please run the command "lspci -vvnn > lspci-vvnn.log" and attach the resulting file "lspci-vvnn.log" to this bug report.

For your reference, the full description of procedures for kernel-related bug reports is available at [WWW] http://wiki.ubuntu.com/KernelTeamBugPolicies. Thanks in advance!

Changed in linux-source-2.6.20:
importance: Undecided → High
status: New → Incomplete
Revision history for this message
David Jaša (dejv) wrote :

$ uname -a
Linux cihla 2.6.20-16-generic #2 SMP Thu Jun 7 20:19:32 UTC 2007 i686 GNU/Linux

Output of lspci -vvnn and dmesg had been attached before your reply.

Revision history for this message
Jérôme Guelfucci (jerome-guelfucci-deactivatedaccount) wrote :

Thank you, I'm marking this as confirmed.

Changed in linux-source-2.6.20:
assignee: nobody → ubuntu-kernel-team
status: Incomplete → Confirmed
Revision history for this message
David Jaša (dejv) wrote :

Just a question: what extra information provides 'uname -a' compared to first line of dmesg output?

Revision history for this message
David Jaša (dejv) wrote :

Another symptoms: resume from STR seems all right in the beginning, but after a while a system freezes. If I manage to switch to terminal, I could read some message about scsi error. I tried afterwards SysRq + u, s, b to remount ro and sync discs, but all I could recover after reboot is this part of log files:

/var/log/messages:
Jul 15 22:42:49 cihla gnome-power-manager: (david) Počítač se uspává do paměti, protože víko bylo uzavřeno při napájení z elektrické sítě [(david) Computer suspends to RAM, because lid was closed while powered from AC]
Jul 15 22:42:51 cihla kernel: [40351.004000] ADDRCONF(NETDEV_UP): eth0: link is not ready
Jul 15 22:42:52 cihla irattach: got SIGTERM or SIGINT
Jul 15 22:42:52 cihla irattach: Stopping device irda0
Jul 15 22:42:52 cihla irattach: exiting ...
Jul 15 22:42:52 cihla kernel: [40352.012000] e1000: eth0: e1000_watchdog: NIC Link is Up 100 Mbps Full Duplex
Jul 15 22:42:52 cihla kernel: [40352.016000] ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Jul 15 22:42:55 cihla kernel: [40354.668000] ACPI: PCI interrupt for device 0000:02:01.0 disabled
Jul 15 22:42:55 cihla kernel: [40354.992000] ACPI: PCI interrupt for device 0000:02:02.0 disabled
Jul 15 23:38:38 cihla syslogd 1.4.1#20ubuntu4: restart.
Jul 15 23:38:38 cihla kernel: Inspecting /boot/System.map-2.6.20-16-generic
Jul 15 23:38:38 cihla kernel: Loaded 24977 symbols from /boot/System.map-2.6.20-16-generic.
Jul 15 23:38:38 cihla kernel: Symbols match kernel version 2.6.20.
Jul 15 23:38:38 cihla kernel: No module symbols loaded - kernel modules not enabled.

/var/log/kern.log:
Jul 15 22:42:51 cihla kernel: [40351.004000] ADDRCONF(NETDEV_UP): eth0: link is not ready
Jul 15 22:42:52 cihla kernel: [40352.012000] e1000: eth0: e1000_watchdog: NIC Link is Up 100 Mbps Full Duplex
Jul 15 22:42:52 cihla kernel: [40352.016000] ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Jul 15 22:42:55 cihla kernel: [40354.668000] ACPI: PCI interrupt for device 0000:02:01.0 disabled
Jul 15 22:42:55 cihla kernel: [40354.992000] ACPI: PCI interrupt for device 0000:02:02.0 disabled
Jul 15 22:42:55 cihla kernel: [40355.040000] ieee80211_crypt: unregistered algorithm 'NULL'

Revision history for this message
David Jaša (dejv) wrote :

I worked it around by this how-to: http://blog.paulbetts.org/index.php/2007/02/11/fixing-software-suspend-hibernate-with-uswsusp-in-ubuntu-feisty-and-edgy/

(uswsusp from stock packages with modified scripts hal-system-power-{suspend,hibernate}-linux in /usr/lib/hal/scripts/linux (I keep backup of original ones)

Please tell me if you will work on this bug report, I'm tempted to use development branch :-] but I won't do so if I can help solve this issue.

Revision history for this message
David Jaša (dejv) wrote :

Actually, uswsusp makes work only suspend to disk. Resume from suspend to RAM is still broken in the same way: if I don't do nothing, it shows black screen with mouse cursor but after a while it stops responding and even swithing to terminal is impossible. If I manage to switch to console before X freeze, I can see these two kernel messages altering on terminal:

scsi 0:0:0:0: rejecting I/O to dead device
metapage_read_end_io: I/O error

and once appears this message, around 5 secs from waking computer up:

Buffer I/O error on device sda3, logical block 5

(sometimes is sda3 altered with sda4, sda 3 is / partiton with jfs and sda4 is data partition with ntfs).

Revision history for this message
David Jaša (dejv) wrote :

Source package changed to -2.6.24, this probem repeats in Hardy again (and in -2.6.22 also wasn't fixed).

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi David,

Can you include which version of the Hardy kernel you are running. You can find this via:

cat /proc/version_signature

Also, after an attempted suspend/resume cycle, care to reattach an updated dmesg output as outlined in https://wiki.ubuntu.com/DebuggingKernelSuspend

Thanks.

Changed in linux:
status: New → Incomplete
Revision history for this message
David Jaša (dejv) wrote :

# cat /proc/version_signature
Ubuntu 2.6.24-5.8-generic

Revision history for this message
David Jaša (dejv) wrote :

Running kernel package is "linux-image-2.6.24-5-generic" in version "2.6.24-5.8".

Revision history for this message
David Jaša (dejv) wrote :
Changed in linux:
assignee: nobody → ubuntu-kernel-team
importance: Undecided → Medium
status: Incomplete → Triaged
Changed in linux-source-2.6.22:
importance: Undecided → Medium
status: New → Triaged
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi David,

Would you be able to test the latest Hardy Alpha release: http://www.ubuntu.com/testing and just add a comment that the issue still exists or not (and include your dmesg output as you have been doing). I'll see if I can nudge the kernel team to take a peek. Also, note that we'll keep this report open against the actively developed kernel bug against 2.6.20 and 2.6.22 this will be closed. Thanks.

Changed in linux-source-2.6.20:
status: Confirmed → Won't Fix
Changed in linux-source-2.6.22:
status: Triaged → Won't Fix
Revision history for this message
David Jaša (dejv) wrote : Re: [Bug 123002] Re: Broken suspend on 2.6.20-16.29
Download full text (61.3 KiB)

On Po, 2008-03-10 at 17:38 +0000, Leann Ogasawara wrote:
> Hi David,
>
> Would you be able to test the latest Hardy Alpha release:
> http://www.ubuntu.com/testing and just add a comment that the issue
> still exists or not (and include your dmesg output as you have been
> doing).
I've tested with 2.6.24-11 and -12 (the one with fixed snd issues) and
both are broken (dmesg outputs attached).

> I'll see if I can nudge the kernel team to take a peek.
Thanks

[ 0.000000] Linux version 2.6.24-11-generic (buildd@rothera) (gcc version 4.2.3 (Ubuntu 4.2.3-2ubuntu1)) #1 SMP Fri Feb 29 22:08:31 UTC 2008 (Ubuntu 2.6.24-11.17-generic)
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000009f000 (usable)
[ 0.000000] BIOS-e820: 000000000009f000 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000d2000 - 00000000000d4000 (reserved)
[ 0.000000] BIOS-e820: 00000000000dc000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 000000003ff60000 (usable)
[ 0.000000] BIOS-e820: 000000003ff60000 - 000000003ff77000 (ACPI data)
[ 0.000000] BIOS-e820: 000000003ff77000 - 000000003ff79000 (ACPI NVS)
[ 0.000000] BIOS-e820: 000000003ff80000 - 0000000040000000 (reserved)
[ 0.000000] BIOS-e820: 00000000ff800000 - 0000000100000000 (reserved)
[ 0.000000] 127MB HIGHMEM available.
[ 0.000000] 896MB LOWMEM available.
[ 0.000000] Entering add_active_range(0, 0, 261984) 0 entries of 256 used
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0 -> 4096
[ 0.000000] Normal 4096 -> 229376
[ 0.000000] HighMem 229376 -> 261984
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[1] active PFN ranges
[ 0.000000] 0: 0 -> 261984
[ 0.000000] On node 0 totalpages: 261984
[ 0.000000] DMA zone: 32 pages used for memmap
[ 0.000000] DMA zone: 0 pages reserved
[ 0.000000] DMA zone: 4064 pages, LIFO batch:0
[ 0.000000] Normal zone: 1760 pages used for memmap
[ 0.000000] Normal zone: 223520 pages, LIFO batch:31
[ 0.000000] HighMem zone: 254 pages used for memmap
[ 0.000000] HighMem zone: 32354 pages, LIFO batch:7
[ 0.000000] Movable zone: 0 pages used for memmap
[ 0.000000] DMI present.
[ 0.000000] ACPI: RSDP signature @ 0xC00F6D70 checksum 0
[ 0.000000] ACPI: RSDP 000F6D70, 0024 (r2 IBM )
[ 0.000000] ACPI: XSDT 3FF6A6BD, 004C (r1 IBM TP-1R 3210 LTP 0)
[ 0.000000] ACPI: FACP 3FF6A800, 00F4 (r3 IBM TP-1R 3210 IBM 1)
[ 0.000000] ACPI Warning (tbfadt-0442): Optional field "Gpe1Block" has zero address or length: 000000000000102C/0 [20070126]
[ 0.000000] ACPI: DSDT 3FF6A9E7, C4E5 (r1 IBM TP-1R 3210 MSFT 100000E)
[ 0.000000] ACPI: FACS 3FF78000, 0040
[ 0.000000] ACPI: SSDT 3FF6A9B4, 0033 (r1 IBM TP-1R 3210 MSFT 100000E)
[ 0.000000] ACPI: ECDT 3FF76ECC, 0052 (r1 IBM TP-1R 3210 IBM 1)
[ 0.000000] ACPI: TCPA 3FF76F1E, 0032 (r1 IBM TP-1R 3210 PTL 1)
[ 0.000000] ACPI: BOOT 3FF76FD8, 0028...

Revision history for this message
David Jaša (dejv) wrote :

Problem still exists on 2.6.24-12.17

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi David,

Just curious what graphics driver you are using? If you unload that driver prior to suspending does that make a difference? Just curious.

Thanks.

Revision history for this message
David Jaša (dejv) wrote :
Download full text (59.8 KiB)

Leann Ogasawara napsal(a):
> Hi David,
>
> Just curious what graphics driver you are using? If you unload that
> driver prior to suspending does that make a difference? Just curious.
>
> Thanks.
>

xserver-xorg-video-ati

[ 0.000000] Linux version 2.6.24-12-generic (buildd@palmer) (gcc version 4.2.3 (Ubuntu 4.2.3-2ubuntu4)) #1 SMP Wed Mar 12 23:01:54 UTC 2008 (Ubuntu 2.6.24-12.22-generic)
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: 0000000000000000 - 000000000009f000 (usable)
[ 0.000000] BIOS-e820: 000000000009f000 - 00000000000a0000 (reserved)
[ 0.000000] BIOS-e820: 00000000000d2000 - 00000000000d4000 (reserved)
[ 0.000000] BIOS-e820: 00000000000dc000 - 0000000000100000 (reserved)
[ 0.000000] BIOS-e820: 0000000000100000 - 000000003ff60000 (usable)
[ 0.000000] BIOS-e820: 000000003ff60000 - 000000003ff77000 (ACPI data)
[ 0.000000] BIOS-e820: 000000003ff77000 - 000000003ff79000 (ACPI NVS)
[ 0.000000] BIOS-e820: 000000003ff80000 - 0000000040000000 (reserved)
[ 0.000000] BIOS-e820: 00000000ff800000 - 0000000100000000 (reserved)
[ 0.000000] 127MB HIGHMEM available.
[ 0.000000] 896MB LOWMEM available.
[ 0.000000] Entering add_active_range(0, 0, 261984) 0 entries of 256 used
[ 0.000000] Zone PFN ranges:
[ 0.000000] DMA 0 -> 4096
[ 0.000000] Normal 4096 -> 229376
[ 0.000000] HighMem 229376 -> 261984
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[1] active PFN ranges
[ 0.000000] 0: 0 -> 261984
[ 0.000000] On node 0 totalpages: 261984
[ 0.000000] DMA zone: 32 pages used for memmap
[ 0.000000] DMA zone: 0 pages reserved
[ 0.000000] DMA zone: 4064 pages, LIFO batch:0
[ 0.000000] Normal zone: 1760 pages used for memmap
[ 0.000000] Normal zone: 223520 pages, LIFO batch:31
[ 0.000000] HighMem zone: 254 pages used for memmap
[ 0.000000] HighMem zone: 32354 pages, LIFO batch:7
[ 0.000000] Movable zone: 0 pages used for memmap
[ 0.000000] DMI present.
[ 0.000000] ACPI: RSDP signature @ 0xC00F6D70 checksum 0
[ 0.000000] ACPI: RSDP 000F6D70, 0024 (r2 IBM )
[ 0.000000] ACPI: XSDT 3FF6A6BD, 004C (r1 IBM TP-1R 3210 LTP 0)
[ 0.000000] ACPI: FACP 3FF6A800, 00F4 (r3 IBM TP-1R 3210 IBM 1)
[ 0.000000] ACPI Warning (tbfadt-0442): Optional field "Gpe1Block" has zero address or length: 000000000000102C/0 [20070126]
[ 0.000000] ACPI: DSDT 3FF6A9E7, C4E5 (r1 IBM TP-1R 3210 MSFT 100000E)
[ 0.000000] ACPI: FACS 3FF78000, 0040
[ 0.000000] ACPI: SSDT 3FF6A9B4, 0033 (r1 IBM TP-1R 3210 MSFT 100000E)
[ 0.000000] ACPI: ECDT 3FF76ECC, 0052 (r1 IBM TP-1R 3210 IBM 1)
[ 0.000000] ACPI: TCPA 3FF76F1E, 0032 (r1 IBM TP-1R 3210 PTL 1)
[ 0.000000] ACPI: BOOT 3FF76FD8, 0028 (r1 IBM TP-1R 3210 LTP 1)
[ 0.000000] ACPI: PM-Timer IO Port: 0x1008
[ 0.000000] Allocating PCI resources starting at 50000000 (gap: 40000000:bf800000)
[ 0.000000] swsusp: Registered nosave memory region: 000000000009f000 - 00000000000a0000
[ 0.00...

Revision history for this message
David Jaša (dejv) wrote :

LP seems not to recognize attachments by email, so once more:

I'm using xserver-xorg-video-ati

Revision history for this message
David Jaša (dejv) wrote :
Revision history for this message
David Jaša (dejv) wrote :
Revision history for this message
David Jaša (dejv) wrote :

It still doesn't work.

Revision history for this message
David Jaša (dejv) wrote :
Revision history for this message
David Jaša (dejv) wrote :

I found out another things, that could help:

1) If I run just pm-hibernate, system hangs on resume with console cursor in top left corner of screen. Dmesg's above are from this case

2) If I change /usr/lib/pm-utils/functions according to description of bug #207002, pm-utils use s2disk instead of default way, but it freezes at some similar point, before or at restoring of X (attached dmesg is from this case)

3) If I run s2disk directly, system resumes fine

Is there some "dry run" option for pm-utils to find out what does it exactly do?

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

Revision history for this message
Launchpad Janitor (janitor) wrote : Kernel team bugs

Per a decision made by the Ubuntu Kernel Team, bugs will longer be assigned to the ubuntu-kernel-team in Launchpad as part of the bug triage process. The ubuntu-kernel-team is being unassigned from this bug report. Refer to https://wiki.ubuntu.com/KernelTeamBugPolicies for more information. Thanks.

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

*This is an automated response*

This bug report is being closed because we received no response to the previous request for information. Please reopen this if it is still an issue in the actively developed pre-release of Jaunty Jackalope 9.04 - http://cdimage.ubuntu.com/releases/jaunty . To reopen the bug report simply change the Status of the "linux" task back to "New".

Changed in linux:
status: Triaged → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.