[Acer Aspire 2920] Notebook crashes after suspend with new Kernel 2.6.27-11-generic

Bug #322886 reported by datenteiler
66
This bug affects 7 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
High
Unassigned
Intrepid
Invalid
High
Unassigned

Bug Description

Binary package hint: linux-image-2.6.27-11-generic

My notebook Acer Aspire 2920 crashes when it wakes up from suspend. This happens after today's new upgrade from kernel linux-image-generic 2.6.27.9.13 to 2.6.27.11.14. When I boot the old kernel linux-image-generic 2.6.27.9.13 suspend works perfect. So I think I can blame it to the kernel upgrade.

christian@Jandraligeli:~$ lspci
00:00.0 Host bridge: Intel Corporation Mobile PM965/GM965/GL960 Memory Controller Hub (rev 03)
00:02.0 VGA compatible controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 03)
00:02.1 Display controller: Intel Corporation Mobile GM965/GL960 Integrated Graphics Controller (rev 03)
00:1a.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 03)
00:1a.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801H (ICH8 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 03)
00:1c.2 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 3 (rev 03)
00:1d.0 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 03)
00:1d.2 USB Controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #3 (rev 03)
00:1d.7 USB Controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev f3)
00:1f.0 ISA bridge: Intel Corporation 82801HEM (ICH8M) LPC Interface Controller (rev 03)
00:1f.1 IDE interface: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) IDE Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation 82801HBM/HEM (ICH8M/ICH8M-E) SATA AHCI Controller (rev 03)
00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 03)
02:00.0 Ethernet controller: Broadcom Corporation NetLink BCM5787M Gigabit Ethernet PCI Express (rev 02)
04:00.0 Network controller: Intel Corporation PRO/Wireless 4965 AG or AGN [Kedron] Network Connection (rev 61)

Revision history for this message
datenteiler (datenteiler) wrote :
Revision history for this message
datenteiler (datenteiler) wrote :

This is the output of lspci -vvn

Revision history for this message
datenteiler (datenteiler) wrote :

This is the output of lsmod

description: updated
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Just adding a note that bug 322914 is likely a duplicate of this one. They have slightly different hardware but are experiencing the same regression from 2.6.27-9 to 2.6.27-11.

Changed in linux:
importance: Undecided → High
status: New → Triaged
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi Christian,

Can you also attach your dmesg output as well as tell us which specific version of the -11 kernel you are running? The specific version of the kernel can be found by looking at the output of 'cat /proc/version_signature'. Thanks.

Revision history for this message
Michael Schwartzkopff (misch) wrote :

hi,

My original bug was marked duplicate of this one. So I try to help here:
$ cat /proc/version_signature
Ubuntu 2.6.27-11.27-server

dmesg, lscpi and lsmod attached below.

Michael.
---

Revision history for this message
Michael Schwartzkopff (misch) wrote :
Revision history for this message
Michael Schwartzkopff (misch) wrote :
Revision history for this message
datenteiler (datenteiler) wrote :

Hi,

the specific version of the -11 kernel I am running is

christian@Jandraligeli:~$ cat /proc/version_signature
Ubuntu 2.6.27-11.27-generic

dmesg is attached below.

Thank you very much for your help.

Regards,
Christian

Andy Whitcroft (apw)
Changed in linux:
assignee: nobody → apw
status: Triaged → In Progress
Revision history for this message
Andy Whitcroft (apw) wrote :

@christian -- can you confirm how you knew that the machine crashed here. I do not see a specific crash message in those logged here. Did the machine just stop, if so could you describe its condition when it did. Also can you confirm that the machine correctly went to sleep and only once you tried to resume it did things go wrong?

If you have the ability to connect to the machine from outside, it is worth installing openssh-server and recording the machines ip address. Once the machine has resumed its worth then trying to login and see if the machine is still up under the covers.

Also it is worth swithing to VT1 with ctrl-alt-f1 and then logging in there, and using pm-suspend to try and suspend the machine, then if there is a panic we may see something on the VT on resume.

@Michael -- if yours is breaking even before you get all the way to sleep then its possible this is a different symptom and may not be a duplicate. Can you confirm that the machine does indeed never make it as far as sleeping.

Revision history for this message
Peter Bui (pnutzh4x0r) wrote :

@Andy -- I wasn't addressed in your questions but my bug was marked a duplicate of this one, so I figured I'd add my input. As noted in bug 322914, my laptop did not make it as far as sleeping, so it may not be a duplicate of this one.

Revision history for this message
datenteiler (datenteiler) wrote :

@Andy -- the notebook went to sleep and that works fine. When I press a button to wake it up it cuts out. Its totaly off till I press the power button. Then it starts again and boots Ubuntu 8.10 normally. I'll try to connect the machine with ssh while it has resumed to see if it is still up. I'll report that later.

Revision history for this message
Peter Bui (pnutzh4x0r) wrote :

So I went back to kernel-2.6.27-11.27 to see if I can gather anymore information, and here is what I found:

- If I do a pm-suspend on virtual console 1 before I log into GNOME, suspend and resume will work just fine.

- If I log into GNOME and let NetworkManager connect to a network I can NOT suspend (either using pm-suspend or the suspend key on my laptop).

- I managed to search through my kern.log and I get this message write before my laptop freezes during suspension:

iwl3945: Can't stop Rx DMA.
iwl3945 0000:0c:00.0: PCI INT A disabled

It appears perhaps there is a problem with unloading my wireless network module?

Revision history for this message
datenteiler (datenteiler) wrote :
Download full text (16.2 KiB)

@Andy -- OK, I rebooted with the new Kernel and connected with ssh:

christian@Gobuchul:~$ ssh 192.168.1.33
christian@192.168.1.33's password:
Linux Jandraligeli 2.6.27-11-generic #1 SMP Thu Jan 29 19:24:39 UTC 2009 i686

Then I watched via ssh in /var/log/messages and /var/log/kern.log as the notebook went to sleep. These are the results:

christian@Jandraligeli:~$ tail -f /var/log/messages
Feb 10 21:22:31 Jandraligeli kernel: [ 41.398446] tg3: eth0: Flow control is on for TX and on for RX.
Feb 10 21:22:31 Jandraligeli kernel: [ 41.398652] ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Feb 10 21:22:33 Jandraligeli kernel: [ 42.966881] [drm] Initialized drm 1.1.0 20060810
Feb 10 21:22:33 Jandraligeli kernel: [ 42.981785] pci 0000:00:02.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Feb 10 21:22:33 Jandraligeli kernel: [ 42.984666] [drm] Initialized i915 1.6.0 20060119 on minor 0
Feb 10 21:22:40 Jandraligeli pulseaudio[6044]: ltdl-bind-now.c: Failed to find original dlopen loader.
Feb 10 21:22:40 Jandraligeli pulseaudio[6046]: pid.c: Stale PID file, overwriting.
Feb 10 21:22:40 Jandraligeli pulseaudio[6046]: main.c: setrlimit(RLIMIT_NICE, (31, 31)) failed: Operation not permitted
Feb 10 21:22:40 Jandraligeli pulseaudio[6046]: main.c: setrlimit(RLIMIT_RTPRIO, (9, 9)) failed: Operation not permitted

christian@Jandraligeli:~$ tail -f /var/log/kern.log
Feb 10 21:31:48 Jandraligeli kernel: [ 76.440173] CPU1 attaching sched-domain:
Feb 10 21:31:48 Jandraligeli kernel: [ 76.440177] domain 0: span 0-1 level MC
Feb 10 21:31:48 Jandraligeli kernel: [ 76.440182] groups: 1 0
Feb 10 21:31:54 Jandraligeli kernel: [ 84.175932] wlan0: authenticate with AP 00:21:29:66:21:b0
Feb 10 21:31:54 Jandraligeli kernel: [ 84.177860] wlan0: authenticated
Feb 10 21:31:54 Jandraligeli kernel: [ 84.177872] wlan0: associate with AP 00:21:29:66:21:b0
Feb 10 21:31:54 Jandraligeli kernel: [ 84.180300] wlan0: RX AssocResp from 00:21:29:66:21:b0 (capab=0x411 status=0 aid=1)
Feb 10 21:31:54 Jandraligeli kernel: [ 84.180310] wlan0: associated
Feb 10 21:31:54 Jandraligeli kernel: [ 84.190007] ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
Feb 10 21:32:05 Jandraligeli kernel: [ 95.032120] wlan0: no IPv6 routers present

When the notebook resumed it cut off/crashed again and I have to boot it pressing the power button.

Then I tried it with WLAN disabled:

christian@Jandraligeli:~$ tail -f /var/log/messages
Feb 10 21:35:00 Jandraligeli pulseaudio[6035]: ltdl-bind-now.c: Failed to find original dlopen loader.
Feb 10 21:35:00 Jandraligeli pulseaudio[6037]: pid.c: Stale PID file, overwriting.
Feb 10 21:35:00 Jandraligeli pulseaudio[6037]: main.c: setrlimit(RLIMIT_NICE, (31, 31)) failed: Operation not permitted
Feb 10 21:35:00 Jandraligeli pulseaudio[6037]: main.c: setrlimit(RLIMIT_RTPRIO, (9, 9)) failed: Operation not permitted
Feb 10 21:35:39 Jandraligeli kernel: [ 103.990529] iwlagn: WARNING: Requesting MAC access during RFKILL wakes up NIC
Feb 10 21:35:40 Jandraligeli kernel: [ 105.596810] iwlagn: WARNING: Requesting MAC access during RFKILL wakes up NIC
Feb 10 21:35:40 Jandraligeli kernel: [ 105.597258] psmouse.c: TouchPad at isa0060/serio...

Revision history for this message
Michael Schwartzkopff (misch) wrote :
Download full text (4.6 KiB)

Hi,

1) Suspend to RAM from KDE
Nothing (!) in /var/log/kern.log
/var/log/syslog:
Feb 11 09:38:17 mucnb004 last message repeated 2 times
Feb 11 09:38:22 mucnb004 NetworkManager: <info> Sleeping...
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): now unmanaged
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): device state change: 8 -> 1
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): deactivating device (reason: 37).
Feb 11 09:38:22 mucnb004 NetworkManager: <info> eth1: canceled DHCP transaction, dhcp client pid 6328
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): removing resolv.conf from /sbin/resolvconf
Feb 11 09:38:22 mucnb004 NetworkManager: <WARN> check_one_route(): (eth1) error -34 returned from rtnl_route_del(): Sucess
Feb 11 09:38:22 mucnb004 avahi-daemon[4808]: Withdrawing address record for 192.168.144.111 on eth1.
Feb 11 09:38:22 mucnb004 avahi-daemon[4808]: Leaving mDNS multicast group on interface eth1.IPv4 with address 192.168.144.111.
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): cleaning up...
Feb 11 09:38:22 mucnb004 avahi-daemon[4808]: Interface eth1.IPv4 no longer relevant for mDNS.
Feb 11 09:38:22 mucnb004 NetworkManager: <info> (eth1): taking down device.
Feb 11 09:38:22 mucnb004 avahi-daemon[4808]: Withdrawing address record for fe80::212:f0ff:fe64:4a5f on eth1.
Feb 11 09:38:23 mucnb004 acpid: client has disconnected

I had to reboot hard by switching off the system with long power-off.

2) root enters pm-suspend. Waking up possible, but colors are horrible. Have to reboot to get colors correct.
/var/log/kern.log:
Feb 11 09:48:26 mucnb004 kernel: [ 171.954630] PM: Syncing filesystems ... done.
Feb 11 09:48:26 mucnb004 kernel: [ 171.980468] PM: Preparing system for mem sleep
Feb 11 09:48:26 mucnb004 kernel: [ 171.980472] Freezing user space processes ... (elapsed 0.00 seconds) done.
Feb 11 09:48:26 mucnb004 kernel: [ 171.981534] Freezing remaining freezable tasks ... (elapsed 0.00 seconds) done.
Feb 11 09:48:26 mucnb004 kernel: [ 171.981578] PM: Entering mem sleep
Feb 11 09:48:26 mucnb004 kernel: [ 171.981581] Suspending console(s) (use no_console_suspend to debug)
Feb 11 09:48:26 mucnb004 kernel: [ 171.983781] btusb_intr_complete: hci0 urb f5dff080 failed to resubmit (2)
Feb 11 09:48:26 mucnb004 kernel: [ 172.270059] sd 0:0:0:0: [sda] Synchronizing SCSI cache
Feb 11 09:48:26 mucnb004 kernel: [ 172.270392] sd 0:0:0:0: [sda] Stopping disk
Feb 11 09:48:26 mucnb004 kernel: [ 173.750996] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kernel: [ 173.752429] parport_pc 00:03: disabled
Feb 11 09:48:26 mucnb004 kernel: [ 173.753277] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kernel: [ 173.753286] sdhci-pci 0000:02:06.4: PCI INT C disabled
Feb 11 09:48:26 mucnb004 kernel: [ 173.753293] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kernel: [ 173.770085] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kernel: [ 173.770092] tifm_7xx1 0000:02:06.3: PCI INT B disabled
Feb 11 09:48:26 mucnb004 kernel: [ 173.770100] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kernel: [ 173.790111] ACPI handle has no context!
Feb 11 09:48:26 mucnb004 kern...

Read more...

Revision history for this message
Gergely Máté (sportember) wrote :

"When the notebook resumed it cut off/crashed again and I have to boot it pressing the power button."

I have the same symptoms as Christian Imhorst. The computer simply switches itself off on resuming from suspend.

Revision history for this message
Andy Whitcroft (apw) wrote :

@Michael, @Gergely -- can you confirm whether your hardware is the same as the original reporter, now in the bug title.

@all -- could you try and debug this further. Firstly could you boot with no_console_suspend on your kernel command line, and then follow the proceedure in https://wiki.ubuntu.com/DebuggingKernelSuspend. This may well tell us what has failed to suspend/resume. Please report back here.

Changed in linux:
status: In Progress → Incomplete
Revision history for this message
Gergely Máté (sportember) wrote :

Not exactly the same hardware: I use an Acer Extensa 5220.

What component of the system may be most related to the bug? Where happens the failure?

Revision history for this message
datenteiler (datenteiler) wrote :

@Andy: Done. File dmesg.txt ist with "no_console_suspend" option in the kernel command line of my menu.lst and dmesg1.txt is without these option.

Revision history for this message
datenteiler (datenteiler) wrote :
Stefan Bader (smb)
Changed in linux:
assignee: nobody → apw
importance: Undecided → High
status: New → Incomplete
Revision history for this message
endurans (eric-haltsoft) wrote :

Also my notebook Olivetti (most intel based) doesn't finish going to suspend state after kernel update. Screen become black but keeps turned on after that no responses from input devices, not consoles, only option is a hard reboot/shutdown.

Version signarure: Ubuntu 2.6.27-11.27-generic

Revision history for this message
Andy Whitcroft (apw) wrote :

@endurans -- as your machine is different, and fails at a different time it is best to file a separate bug on that one.

Revision history for this message
endurans (eric-haltsoft) wrote :

The problem seems be gone after installing backports pakage, while I don't know/remember now exactly what that means or if it's a common/fair solution, it works now. So please let me know if I should still create a new bug or not.

Revision history for this message
Andy Whitcroft (apw) wrote :

Ok so that has told us very little. Time to turn on some serious debugging and see what that tells us. There is a tracing mechanism described at the URL below:

    https://wiki.ubuntu.com/DebuggingKernelSuspend

If those affected by this could give that a go and report back the dmesg produced.

Revision history for this message
Peter Bui (pnutzh4x0r) wrote :

Attached is my dmesg following the debugging procedure. My machine does not complete resume. It gets to the virtual console and the cursor just blinks. I have to hold the power button down to shut it down and then restart it.

Revision history for this message
Gergely Máté (sportember) wrote :

I have done that.

From the dmesg:

[ 1.776755] Magic number: 1:940:309
[ 1.776782] tty ttyxc: hash matches
[ 1.776846] tty tty38: hash matches

Tried to figure out what those devices may be:

~$ ls -l /dev/ttyxc
crw-rw-rw- 1 root tty 3, 140 2009-03-05 04:20 /dev/ttyxc

~$ ls -l /dev/tty38
crw-rw---- 1 root dialout 4, 38 2009-03-05 04:20 /dev/tty38

The DebuggingKernelSuspend documentation says that tty38 may be the "culprit". How can I figure out more about that tty?

(I have a dialout connection through an usb hdspa modem I have had suspend issues with, but I had had physically disconnected it before "debugging".)

Revision history for this message
Andy Whitcroft (apw) wrote :

We have two different test results here:

    [ 2.668190] Magic number: 1:317:893
    [ 2.668302] tty ptyy3: hash matches

and:

    [ 1.776755] Magic number: 1:940:309
    [ 1.776782] tty ttyxc: hash matches
    [ 1.776846] tty tty38: hash matches

Both list tty devices as the culprit, if we assume that this is not a totally bogus match in either case then the tty device driver is fingered. In both cases the device mentioned is most likely not a real tty. One looks to be a master pty, the other may well be a USB tty device.

The debugging here is a little odd I would have expected to see files listed in addition to the device being hashed. I also see that the userdata is non-zero which implies we completed the resume of something, possibly in error. I have added some more trace markers to the resume side of things to see if we can better tell what is failing if you could try the kernels at the URL below and report back here. You will need to turn on the debugging as you did previously:

    http://people.ubuntu.com/~apw/lp322886-intrepid/

Andy Whitcroft (apw)
Changed in linux:
status: Incomplete → In Progress
status: In Progress → Incomplete
Revision history for this message
Richard Lee (rdlee632) wrote :

I also have a problem with suspend after upgrading the kernel. It worked well before then. Has never worked reliably since.

Revision history for this message
datenteiler (datenteiler) wrote :

For the records: After upgrade to Jaunty with Kernel 2.6.28-11-generic suspend works fine for me.

Revision history for this message
Gergely Máté (sportember) wrote :

For me, 2.6.28-11 fails the same way as 2.6.27-13.

So I went through the debug process with Andy's kernel.

[I'm sorry for doing this so late; I was a bit afraid of installing such a "custom" kernel; later I realized that I was simply paranoid :-(, as Andy had put online the sources of the changes and he is likely to work at Canonical anyway - it's just that thing that if you don't understand sg, you afraid of it - and a matter of general trust. So I am sorry.]

The exact happening is this: the computer suspends (led is blinking), on keypress the computer begins to wake up but immediately, before any video activities turns off or suspends again (led is blinking), after about 1 or 2 seconds it begins to wake again (fans are started), but after another second it remains in a steady state without video output: than turning it off is the only option. I am telling this because I can imagine that the "second wake" corrupts the data in the debug process (but I don't really understand these things).

Changed in linux (Ubuntu):
assignee: Andy Whitcroft (apw) → alf (aldanblues)
Revision history for this message
Alex Valavanis (valavanisalex) wrote :

Intrepid Ibex reached end-of-life on 30 April 2010 so I am closing the
report. Can anyone confirm whether this issue still exists in later versions of Ubuntu?

Changed in linux (Ubuntu Intrepid):
assignee: Andy Whitcroft (apw) → nobody
status: Incomplete → Invalid
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

This bug report was marked as Incomplete and has not had any updated comments for quite some time. As a result this bug is being closed. Please reopen if this is still an issue in the current Ubuntu release http://www.ubuntu.com/getubuntu/download . Also, please be sure to provide any requested information that may have been missing. To reopen the bug, click on the current status under the Status column and change the status back to "New". Thanks.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: kj-expired
Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.