wlan connection dies after a few minutes

Bug #555286 reported by Christophe
116
This bug affects 19 people
Affects Status Importance Assigned to Milestone
xserver-xorg-driver-ati
Invalid
Undecided
Unassigned
linux (Ubuntu)
Invalid
Undecided
Unassigned
Nominated for Lucid by Christophe

Bug Description

when I'm watching Zattoo (online tv) my wlan connection dies after a few minutes. Automatic reconnect is not possible, syslog shows the following error messages:

Apr 4 20:21:21 r60 kernel: [ 727.542280] No probe response from AP 00:1f:3f:1e:41:b5 after 500ms, disconnecting.
Apr 4 20:21:21 r60 wpa_supplicant[1048]: CTRL-EVENT-DISCONNECTED - Disconnect event - remove keys
Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection state: completed -> disconnected
Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection state: disconnected -> scanning
Apr 4 20:21:22 r60 kernel: [ 728.212555] iwl3945 0000:03:00.0: Error sending REPLY_RXON: time out after 500ms.
Apr 4 20:21:22 r60 kernel: [ 728.212565] iwl3945 0000:03:00.0: Error setting new configuration (-110).
(...)

As a workaround I have to switch wlan off and on.

ProblemType: KernelOops
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-19-generic 2.6.32-19.28
Regression: No
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.32-19.28-generic 2.6.32.10+drm33.1
Uname: Linux 2.6.32-19-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
Annotation: Your system might become unstable now and might need to be restarted.
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: AD198x Analog [AD198x Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: christophe 1591 F.... pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xee400000 irq 17'
   Mixer name : 'Analog Devices AD1981'
   Components : 'HDA:11d41981,17aa2025,00100200'
   Controls : 20
   Simple ctrls : 11
Card29.Amixer.info:
 Card hw:29 'ThinkPadEC'/'ThinkPad Console Audio Control at EC reg 0x30, fw 7CHT22WW-1.10'
   Mixer name : 'ThinkPad EC 7CHT22WW-1.10'
   Components : ''
   Controls : 2
   Simple ctrls : 1
Card29.Amixer.values:
 Simple mixer control 'Console',0
   Capabilities: pvolume pvolume-joined pswitch pswitch-joined penum
   Playback channels: Mono
   Limits: Playback 0 - 14
   Mono: Playback 14 [100%] [on]
Date: Sun Apr 4 20:25:12 2010
Failure: oops
HibernationDevice: RESUME=UUID=f0ac1718-219d-4794-b568-07a1df820bfc
MachineType: LENOVO 9461DXG
PccardctlIdent:
 Socket 0:
   no product info available
PccardctlStatus:
 Socket 0:
   no card
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-19-generic root=UUID=76a2ac5d-a68d-4595-8a11-e60177aff67e ro quiet splash
RelatedPackageVersions: linux-firmware 1.33
SourcePackage: linux
Title: WARNING: at /build/buildd/linux-2.6.32/net/wireless/core.c:614 wdev_cleanup_work+0xe9/0x110 [cfg80211]()
dmi.bios.date: 08/28/2009
dmi.bios.vendor: LENOVO
dmi.bios.version: 7CETD3WW (2.23 )
dmi.board.name: 9461DXG
dmi.board.vendor: LENOVO
dmi.board.version: Not Available
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: Not Available
dmi.modalias: dmi:bvnLENOVO:bvr7CETD3WW(2.23):bd08/28/2009:svnLENOVO:pn9461DXG:pvrThinkPadR60:rvnLENOVO:rn9461DXG:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
dmi.product.name: 9461DXG
dmi.product.version: ThinkPad R60
dmi.sys.vendor: LENOVO

Revision history for this message
Christophe (christophe-wk3) wrote :
Revision history for this message
quantenemitter (quantenemitter) wrote :

In my case, the problem appears on all encryption methods (unencrypted, WEP, WPA2) and on two differend APs. The more traffic I produce, the sooner the connection breaks up. On heavy usage, it dies after a few seconds.

I'm using Ubuntu Lucid and iwl3945.

summary: - Kernel crash when switching off wlan after wlan hangup (Thinkpad R60,
- iwl3945)
+ Kernel crash when switching off wlan after wlan hangup (iwl3945)
Revision history for this message
manatorg (manatorg) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

Hi,

Same Problem here. Network connection dies when i have much traffic. only thing that helps is disabling wireless network and reenabling it. but only for a few minutes.
problem appeared with update to lucid. all other devices in the net work find. i tried channel 3 und 13.

system is a thinkpad r60. This appears in syslog:
[15196.564183] iwl3945 0000:03:00.0: Error sending REPLY_RXON: enqueue_hcmd failed: -28
[15196.564189] iwl3945 0000:03:00.0: Error setting new configuration (-28).
[15196.564194] iwl3945 0000:03:00.0: No space for Tx

best regards,

Revision history for this message
Tom Morton (tomm) wrote :

I'm seeing the same problem on my Thinkpad T60, iwl3945.

Revision history for this message
Gabe Gorelick (gabegorelick) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. This particular bug has already been reported and is a duplicate of bug 528688, so it is being marked as such. Please look at the other bug report to see if there is any missing information that you can provide, or to see if there is a workaround for the bug. Additionally, any further discussion regarding the bug should occur in the other report. Please continue to report any other bugs you may find.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Eric (weisscoder) wrote :

I don't believe this qualifies as a duplicate of bug 528688 at all. This deals with the network connection dropping during heaving network traffic, specifically using iwl3945. The kernel crash mentioned in the original report only occurs during a workaround involving reloading iwl3945, and while that may be a duplicate of 528688, it is a separate issue from the main problem here.

Revision history for this message
Adrian (urobythos) wrote : Re: [Bug 555286] Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)
Download full text (7.3 KiB)

To be honest, I don't even remember this bug however what happens is that
when a bug is reported using apport, it basically just gives you a list of
bugs with the exact same title to choose from to determine whether to report
it or not. I basically just chose the first one in the list because the
complete title was likely cut off and up to a point they are identical (in
title).
Sorry for the inconvenience.

Cheers

On Mon, Apr 19, 2010 at 4:39 PM, Eric <email address hidden> wrote:

> *** This bug is a duplicate of bug 528688 ***
> https://bugs.launchpad.net/bugs/528688
>
> I don't believe this qualifies as a duplicate of bug 528688 at all.
> This deals with the network connection dropping during heaving network
> traffic, specifically using iwl3945. The kernel crash mentioned in the
> original report only occurs during a workaround involving reloading
> iwl3945, and while that may be a duplicate of 528688, it is a separate
> issue from the main problem here.
>
> --
> Kernel crash when switching off wlan after wlan hangup (iwl3945)
> https://bugs.launchpad.net/bugs/555286
> You received this bug notification because you are a direct subscriber
> of a duplicate bug (via bug 528688).
>
> Status in “linux” package in Ubuntu: Confirmed
>
> Bug description:
> Hi,
>
> when I'm watching Zattoo (online tv) my wlan connection dies after a few
> minutes. Automatic reconnect is not possible, syslog shows the following
> error messages:
>
> Apr 4 20:21:21 r60 kernel: [ 727.542280] No probe response from AP
> 00:1f:3f:1e:41:b5 after 500ms, disconnecting.
> Apr 4 20:21:21 r60 wpa_supplicant[1048]: CTRL-EVENT-DISCONNECTED -
> Disconnect event - remove keys
> Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection
> state: completed -> disconnected
> Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection
> state: disconnected -> scanning
> Apr 4 20:21:22 r60 kernel: [ 728.212555] iwl3945 0000:03:00.0: Error
> sending REPLY_RXON: time out after 500ms.
> Apr 4 20:21:22 r60 kernel: [ 728.212565] iwl3945 0000:03:00.0: Error
> setting new configuration (-110).
> (...)
>
> As a workaround I have to switch wlan off and on, while doing this in one
> of ten cases I get the kernel oops I'm reporting here. The Problems are
> probably related.
>
> Apr 4 20:25:06 r60 kernel: [ 953.043563] ------------[ cut here
> ]------------
> Apr 4 20:25:06 r60 kernel: [ 953.043599] WARNING: at
> /build/buildd/linux-2.6.32/net/wireless/core.c:614
> wdev_cleanup_work+0xe9/0x110 [cfg80211]()
> Apr 4 20:25:06 r60 kernel: [ 953.043604] Hardware name: 9461DXG
> Apr 4 20:25:06 r60 kernel: [ 953.043608] Modules linked in: cryptd
> aes_x86_64 aes_generic binfmt_misc rfcomm ppdev sco bridge stp bnep l2cap
> vboxnetadp vboxnetflt vboxdrv joydev dm_crypt snd_hda_codec_analog pcmcia
> usblp arc4 snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss
> snd_pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi
> snd_seq_midi_event iwl3945 snd_seq iwlcore snd_timer snd_seq_device
> yenta_socket mac80211 btusb bluetooth thinkpad_acpi led_class nvram
> rsrc_nonstatic lp pcmcia_core psmouse serio_raw cfg80211 snd parport
> soundcore...

Read more...

Revision history for this message
Gabe Gorelick (gabegorelick) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

I originally marked it as a a duplicate since the kernel traces are similar, but since you claim the crash is not the main issue here, I've unmarked it.

Revision history for this message
Øyvind Stegard (oyvindstegard) wrote :

Got some problems with iwl3945 wireless driver on a Lenovo Thinkpad Z61m. Connection drops, kernel log fills up with "No space for Tx" & friends, and the only way to get re-connected is to disable/enable wireless network (I do that through NetworkManager, kill switch works as well). There is no kernel crash, though. Problem seems to be more easily triggered with certain network traffic patterns like streaming video. linux-image-2.6.32-21-generic [2.6.32-21.32].

[...]
[19675.572084] iwl3945 0000:03:00.0: Error setting new configuration (-28).
[19675.572093] iwl3945 0000:03:00.0: No space for Tx
[...]

Revision history for this message
Øyvind Stegard (oyvindstegard) wrote :

This appears in kernel log when connection drops and the problem starts:
[...]
[19523.500338] No probe response from AP <HWADDR> after 500ms, disconnecting.
[...]

Revision history for this message
Øyvind Stegard (oyvindstegard) wrote :

Now a warning related to wireless appeared:

[21614.038862] CE: hpet increasing min_delta_ns to 864956 nsec
[...]
[21794.480386] iwl3945 0000:03:00.0: Error setting new configuration (-110).
[21794.980057] iwl3945 0000:03:00.0: Error sending REPLY_SCAN_ABORT_CMD: time out after 500ms.
[...]
[21794.983524] ------------[ cut here ]------------
[21794.983559] WARNING: at /build/buildd/linux-2.6.32/net/wireless/core.c:614 wdev_cleanup_work+0xa7/0xd0 [cfg80211]()
[21794.983566] Hardware name: 94503HG
[21794.983569] Modules linked in: binfmt_misc rfcomm sco bridge stp ppdev bnep l2cap dm_crypt snd_hda_codec_analog snd_hda_intel snd_hda_codec snd_hwdep thinkpad_acpi arc4 snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq iwl3945 pcmcia joydev snd_timer snd_seq_device yenta_socket iwlcore btusb sdhci_pci snd tifm_7xx1 rsrc_nonstatic bluetooth mac80211 soundcore pcmcia_core sdhci tifm_core lp psmouse led_class snd_page_alloc cfg80211 nvram parport serio_raw fbcon tileblit font bitblit softcursor vga16fb vgastate radeon ttm drm_kms_helper ohci1394 drm intel_agp video ieee1394 i2c_algo_bit tg3 ahci output agpgart
[21794.983687] Pid: 9, comm: events/0 Not tainted 2.6.32-21-generic #32-Ubuntu
[21794.983692] Call Trace:
[21794.983707] [<c014c3d2>] warn_slowpath_common+0x72/0xa0
[21794.983723] [<f8094ea7>] ? wdev_cleanup_work+0xa7/0xd0 [cfg80211]
[21794.983737] [<f8094ea7>] ? wdev_cleanup_work+0xa7/0xd0 [cfg80211]
[21794.983745] [<c014c41a>] warn_slowpath_null+0x1a/0x20
[21794.983760] [<f8094ea7>] wdev_cleanup_work+0xa7/0xd0 [cfg80211]
[21794.983769] [<c016369e>] run_workqueue+0x8e/0x150
[21794.983785] [<f8094e00>] ? wdev_cleanup_work+0x0/0xd0 [cfg80211]
[21794.983793] [<c01637e4>] worker_thread+0x84/0xe0
[21794.983801] [<c0167740>] ? autoremove_wake_function+0x0/0x50
[21794.983809] [<c0163760>] ? worker_thread+0x0/0xe0
[21794.983816] [<c01674b4>] kthread+0x74/0x80
[21794.983823] [<c0167440>] ? kthread+0x0/0x80
[21794.983831] [<c0104087>] kernel_thread_helper+0x7/0x10
[21794.983836] ---[ end trace f5d3de5d7fcb7546 ]---

About the hpet timer messages before, I've never seen it go that high before..

Revision history for this message
Michael Gasperl (michael-gasperl) wrote :

After updating my T60 to Lucid Lynx i have the same problem with lost connections. After a few minutes the network-manager loses the connection and cannot connect again until i deactivate and activate the network.
Is there any solution or workaround? Could this problem be solved with Wicd?

Revision history for this message
Timo Derstappen (teemow) wrote :

Today I installed the mainline kernel 2.6.34 and the problem with iwl3945 disappeared.
http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.34-rc6-lucid/

I'm using a Lenovo R60 notebook.

Revision history for this message
Michael Gasperl (michael-gasperl) wrote :

@Timo: Thanks very much, I installed it, too -- and the problem also disappeared on my Thinkpad T60 - everything fine again :-)

Revision history for this message
Mark K (weebyte) wrote :

I have the exact same issue. I am running a Dell Mini 9 and installing the mainline kernel (http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.34-rc6-lucid/) did not fix the issue for me.

Revision history for this message
Klaus Layer (klaus-layer) wrote :

Experience the same problem with my T60 with iwl3945 since upgrading to 10.04.

Revision history for this message
Øyvind Stegard (oyvindstegard) wrote :

I've now seen this issue even on 2.6.34-rc7 mainline (not PPA, I compiled it myself, using Ubuntu config and build scripts).

Revision history for this message
In , JasonPorter (jasonporter) wrote :

Several Launchpad bug reports in process report radeon KMS conflicts with Intel wifi and audio output, causing wireless to hang/drop until power cycle and audio to crackle during any high GPU load. Disabling KMS as a workaround returns normal system operation.

Relevant Launchpad reports:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/564376
https://bugs.launchpad.net/ubuntu/+source/alsa-driver/+bug/578342
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/571770
And one on Bugzilla:
https://bugzilla.kernel.org/show_bug.cgi?id=15912

My own hardware is a Thinkpad T60 with Radeon X1400 and Intel wireless (iwl3945 driver) running vanilla Ubuntu 10.04. Users of Radeon X1250, X1300, and Xpress 200M chipsets have also reported the same behavior, on Lenovo, Dell and LG laptops. Using a mainline kernel does not change the behavior.

This may possibly be due to a difference in PCI configuration between KMS and UMS, they use different IRQs for "Pin A" as detailed in the Bugzilla report linked above.

Thanks for any assistance you can provide! And thanks for all your hard work!

Revision history for this message
JasonPorter (jasonporter) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

Does the issue still occur with radeon KMS disabled? I suspect this may be related to similar problems other users are having, the common denominator in those cases seems to be Radeon video with KMS enabled by default. I have linked the bug reports upstream on those issues, but it would be worthwhile to see if this is related.

Set radeon.modeset=0 on kernel load to test.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Confirming this one, Thinkpad Z61m, ATI X1400. However, the IRQ difference does not seem to matter for the audio problems (I've tested with radeon KMS both with and without MSI, which is the difference between the two PCI configs listed in bug at https://bugzilla.kernel.org/show_bug.cgi?id=15912).

Revision history for this message
In , agd5f (agd5f) wrote :

Is there an option in your bios to assign different irqs to different pci devices? Some systems set every device to the same irq.

Revision history for this message
In , Maarten Fonville (maarten-fonville) wrote :

I don't think it is directly an IRQ issue that can be solved in the BIOS.
Because on my girlfriend's laptop which is also hit by this problem the radeon takes IRQ 17 with IO-APIC-fasteoi and hda_intel takes IRQ 24 with PCI-MSI-edge

Revision history for this message
In , Maarten Fonville (maarten-fonville) wrote :

(In reply to comment #3)
> I don't think it is directly an IRQ issue that can be solved in the BIOS.
> Because on my girlfriend's laptop which is also hit by this problem the radeon
> takes IRQ 17 with IO-APIC-fasteoi and hda_intel takes IRQ 24 with PCI-MSI-edge

Actually, before booting the kernel itself (thus it can not be seen in DMESG) there is the message that starts with:
pci 0000:00:00.0: address space collision [..more stuff here..]
Just like in this mail I believe: http://lkml.org/lkml/2010/3/12/92

I don't know whether this could be relevant.

Revision history for this message
Bernhard (bernhard-kornacker) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

radeon.modeset=0 seems to work around this issue

Revision history for this message
Colin (colin-macdonald) wrote :

radeon.modeset=0 works for me as well.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

I have no messages about address space collisions in kernel boot log. I'll add my hardware info to this bug tomorrow (interrupts, PCI, dmesg, Xorg, etc).

Revision history for this message
In , JasonPorter (jasonporter) wrote :

This appears to also be causing kernel crashes on some systems when wifi is powered off using the hardware switch. Disabling radeon KMS causes the crash behavior to disappear. I will inform devs in the upstream bug reports on that issue.

Reported in https://bugs.launchpad.net/ubuntu/+source/linux/+bug/555286

Revision history for this message
In , JasonPorter (jasonporter) wrote :

(In reply to comment #6)

Oops, nevermind... I'm already upstream. Too many tabs open, sorry.

Revision history for this message
In , agd5f (agd5f) wrote :

Is there an option in your bios to assign different irqs to different pci
devices? Some systems set every device to the same irq. If so, please try changing the setting to auto, or select different irqs for each device and see if that helps. Also, please try both with and without msi enabled (boot with pci=nomsi).

Revision history for this message
JasonPorter (jasonporter) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

I will let the radeon devs know in the upstream bug report... it's here if anyone is interested: https://bugs.freedesktop.org/show_bug.cgi?id=28106

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

(In reply to comment #8)
> Is there an option in your bios to assign different irqs to different pci
> devices? Some systems set every device to the same irq. If so, please try
> changing the setting to auto, or select different irqs for each device and see
> if that helps. Also, please try both with and without msi enabled (boot with
> pci=nomsi).

Yes. However, the list looks very uninformative. It's under PCI config, and basically contains just INTA-> 11, INTB -> 11, INTC -> 11, and so on. Tried setting to Auto-select on all entries (instead of 11). System booted OK, but it didn't help (on KMS+audio problems). Tried assigning sequentially from IRQ 3 and up, but then I got a really loud Thinkpad-style alarm beep, system didn't get past POST, and BIOS informing that network controller was missing IRQ. So obviously I switched back to default settings. The /proc/interrupts list didn't really look any different with BIOS-autoconfig for PCI IRQs (IIRC).

Booting with option pci=nomsi does not help at all, even though it definitely affects IRQ config, since /proc/interrupts contains no MSI-entries when booting with this option.

I will now be attaching some info for system running 2.6.34 kernel on Ubuntu Lucid x86 with ATIX1400 (KMS-mode). System has severe audio glitching with KMS, and no glitching at all in UMS mode.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Created an attachment (id=35734)
Kernel boot log

Audio glitching reproduced immediately after logging in to X session.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Created an attachment (id=35735)
Contents of /proc/interrupts

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Created an attachment (id=35736)
Contents of /proc/interrupts with MSI disabled

Does not resolve issue.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Created an attachment (id=35737)
PCI device config

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Created an attachment (id=35738)
Xorg startup log

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

The test setup I use to quickly reproduce and verify that the problem is there:

1) Play a pure sound which easily revelase playback glitches:
$ gst-launch-0.10 audiotestsrc ! pulsesink

2) Play some video (doesn't matter what) with MPlayer, using plain x11 output and no sound:
$ mplayer /path/to/some/movie.avi -vo x11 -zoom -nosound

3) Test audio starts glitching and the glitching becomes worse if I put the video in fullscreen.

And some observations about test:

1) There is not much load on system during test (MPlayer uses around 40% CPU, Xorg floats under 12% CPU, for a 1024x576 video with no sound).

2) There is a lot less audio glitching if I run things in a completely composited environment (e.g. Compiz with no unredirection for fullscreen windows). If I *do* unredirect fullscreen windows with Compiz the glitching becomes worse in fullscreen.

3) There is alomst no glitching at all if using XV for video playback in the test, instead of plain old x11. Obviously that's not going to help for apps that don't use XV, like fullscreen Flash video streaming or any affected non-video app.

4) The glitching is worse when video window is full screen.

5) HDA intel driver typically always reports that IRQ timing work-around has been activated. This doesn't happen in UMS-mode.

And some observations not just related to the specific test setup:
1) Flash fullscreen video playback causes more severe audio glitches if *not* running composited in fullscreen, for instance under Compiz with "Unredirect fullscreen windows" enabled. This is what typically also takes down wireless (just happened now with current setup, as I was testing Flash and writing this).

2) Flash fullscreen video causes glitches even when redirected in composited env.

3) Flash doesn't use XV, seems to correspond well with MPlayer -vo x11 being much worse than when using -vo xv.

4) There are no playback issues with video during any of these tests, the video is smooth, system not overloaded. And besides, the test sound generator requires almost no resources at all.

5) [ 1535.437114] CE: hpet increased min_delta_ns to 7500 nsec
Don't really know the meaning of this one, but it typically appears during problems with wireless and/or audio when running under KMS. Kernel compensating for what is seen as accuracy issues with system timing ?

6) It doesn't have to be MPlayer-vo-x11 or Flash that triggers problem, I just use them because they so easily reproduce it. I have heard audio popping when scrolling Firefox pages or moving windows around, and I've managed to take down wireless when launching Neverball (OpenGL-game).

7) No matter what IRQ config or snd-hda-intel options I test, the problems are always there with KMS and disappear with UMS.

Revision history for this message
In , JasonPorter (jasonporter) wrote :

To add my own results... booting with pci=nomsi doesn't seem to have an effect on my system (Thinkpad T60). I have the same PCI options in BIOS as Øyvind reports, all are set to "11" by default but changing to Auto seems to have very little effect on actual behavior after boot. I will test further to see if there are any other differences with MSI off, but so far it doesn't seem to have helped.

Revision history for this message
In , agd5f (agd5f) wrote :

Does booting with radeon.disp_priority=1 help?

Revision history for this message
In , agd5f (agd5f) wrote :

Apply the settings with a cold boot.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

(In reply to comment #17)
> Does booting with radeon.disp_priority=1 help?

Does not help on my hardware. Tested with kernel 2.6.34, cold boot.

Revision history for this message
In , JasonPorter (jasonporter) wrote :

(In reply to comment #17)
> Does booting with radeon.disp_priority=1 help?

For some reason, passing that option at boot seems to disable KMS on my system. I've tried it both without a radeon.modeset declaration, and with radeon.modeset=1, and in both cases Xorg.0.log shows KMS to be disabled when radeon.disp_priority=1 is on.

Revision history for this message
In , agd5f (agd5f) wrote :

(In reply to comment #20)
> (In reply to comment #17)
> > Does booting with radeon.disp_priority=1 help?
>
> For some reason, passing that option at boot seems to disable KMS on my system.
> I've tried it both without a radeon.modeset declaration, and with
> radeon.modeset=1, and in both cases Xorg.0.log shows KMS to be disabled when
> radeon.disp_priority=1 is on.

If your kernel is too old, the option is not valid and the module won't load. See modinfo radeon to verify.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

I've noticed that the audio glitches become a lot worse when many windows are open and running Compiz (I did an artificial test). I can open lots of windows on one desktop, switch back to an empty desktop, open a single window there and trigger audio drop-outs (simple test sound) just by toggling maximization state of that single window, even though all the other windows are not in view and system load is close to nil. Basically, most Compiz-operations besides simple window movement will cause glitches.

Interestingly, I might have pushed things too far, since Compiz crashed with this message:
drmRadeonCmdBuffer: -12. Kernel failed to parse or rejected command stream. See dmesg for more info.

Kernel log contained this:
[ 543.577306] [drm:radeon_cs_ioctl] *ERROR* Failed to parse relocation -12!
[ 574.532539] [drm:radeon_cs_ioctl] *ERROR* Failed to parse relocation -12!
[ 742.437808] [drm:radeon_cs_ioctl] *ERROR* Failed to parse relocation -12!

* This was all with radeon.disp_priority=1, cannot say whether that mattered or not.
* Don't know the consequences of using the Ubuntu default Xorg-driver/libdrm/DRI-stuff together with 2.6.34 mainline kernel DRM. I haven't noticed any bad things in particular during normal usage (quite the contrary, KMS performance with 2.6.34 seems better), except for the issues at hand of course.
* The version of the radeon module in the default Ubuntu kernel does not have disp_priority option.

Revision history for this message
In , JasonPorter (jasonporter) wrote :

(In reply to comment #21)
> If your kernel is too old, the option is not valid and the module won't load.
> See modinfo radeon to verify.

I'm running the standard 2.6.32-22-generic that is current in Ubuntu Lucid.

Revision history for this message
In , Michel Dänzer (michel-daenzer) wrote :

Has it been considered that this might be due to interrupt latency caused by radeon KMS? E.g. spending too much time in the IRQ handler or unnecessarily running it with other IRQs disabled.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Large blocks of screen updates seems to be most problematic. I can take down wireless by scrolling through a maximized gnome-terminal, for instance.. Same thing will fullscreen Flash and mplayer -fs -vo x11 ..

Revision history for this message
In , Łukasz Krotowski (lukasz-krotowski) wrote :

It seems that problem does not occur in current drm-radeon-testing (9e67e5b1a6fd4bdca48a9c267386afb236d08783). At least sound does not skips.

Revision history for this message
Fabi (fabsi-deactivatedaccount) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

the problem still exist on Maverick + current xorg-edgers (with and without gallium)

Revision history for this message
In , Maarten Fonville (maarten-fonville) wrote :

(In reply to comment #26)
> It seems that problem does not occur in current drm-radeon-testing
> (9e67e5b1a6fd4bdca48a9c267386afb236d08783). At least sound does not skips.

I myself don't have the possibility to test this at the moment. I will only have access to the laptop involved in about 3 weeks.
But maybe Oyvind could this the drm-radeon-testing branch on his laptop?

And if it does solve the problem, we should bisect to find out what does solve this problem.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

(In reply to comment #27)
> (In reply to comment #26)
> > It seems that problem does not occur in current drm-radeon-testing
> > (9e67e5b1a6fd4bdca48a9c267386afb236d08783). At least sound does not skips.
>
> I myself don't have the possibility to test this at the moment. I will only
> have access to the laptop involved in about 3 weeks.
> But maybe Oyvind could this the drm-radeon-testing branch on his laptop?
>
> And if it does solve the problem, we should bisect to find out what does solve
> this problem.

Assuming you mean this branch:
http://git.kernel.org/?p=linux/kernel/git/airlied/drm-2.6.git;a=shortlog;h=refs/heads/drm-radeon-testing

I compiled the branch head and gave it a spin. KMS performance is noticeably better compared to 2.6.32(+drm2.6.33)-kernel in Ubuntu. However I only had to open 20-25 windows and do some jiggly Compiz effects to cause:
1. Severe audio skipping/crackling whenever something was moving, changing, minimizing, maximizing, etc.
2. Wireless was extremely unstable, in fact I add to boot back into Lucid kernel to write this comment, because it kept falling down. Of course, that might just be the kernel itself, I don't know.

Conclusion is that nothing is better on my hardware with this kernel, except the KMS performance (Compiz feels somewhat snappier when window count is high).

So it's back to good old UMS and DFS-corruption for me :) :/ ...

Revision history for this message
In , Flockmock (flockmock) wrote :

Hi,
I am affected by this bug as well and tried to bisect it. My bisect log:

git bisect start
# good: [60b341b778cc2929df16c0a504c91621b3c6a4ad] Linux 2.6.33
git bisect good 60b341b778cc2929df16c0a504c91621b3c6a4ad
# bad: [57d54889cd00db2752994b389ba714138652e60c] Linux 2.6.34-rc1
git bisect bad 57d54889cd00db2752994b389ba714138652e60c
# good: [47871889c601d8199c51a4086f77eebd77c29b0b] Merge branch 'master' of /home/davem/src/GIT/linux-2.6/
git bisect good 47871889c601d8199c51a4086f77eebd77c29b0b
# bad: [1154fab73ccbab010cfaa272b6987c624cfd63c6] SLUB: Fix per-cpu merge conflict
git bisect bad 1154fab73ccbab010cfaa272b6987c624cfd63c6
# good: [3e9cc2f3b7ddabbbfc9abd043887030c669380aa] firewire: ohci: add module parameter to activate quirk fixes
git bisect good 3e9cc2f3b7ddabbbfc9abd043887030c669380aa
# good: [5619c28061ff9d2559a93eaba492935530f2a513] x86: Convert i8259_lock to raw_spinlock
git bisect good 5619c28061ff9d2559a93eaba492935530f2a513

so there are ~800 commits left and now every kernel will fail to boot due to some SCSI issues, I tried many kernels "by hand" via 'git visualize'. Can anybody track this further down, perhaps someone with no SCSI devices in their machines? ;)
thanks a lot and we should really get this regression, 2.6.35-rc5 is affected as well.
cheers, florian

Revision history for this message
In , Ben Hutchings (benh-debian) wrote :

I have one user report that this was fixed between Debian kernel versions 2.6.32-15 and 2.6.32-18, which have radeon drivers from 2.6.33.5 and 2.6.33.7 respectively. So one of the changes to output handling in there may have fixed this.

Revision history for this message
In , Ben Hutchings (benh-debian) wrote :

(In reply to comment #30)
> I have one user report that this was fixed between Debian kernel versions
> 2.6.32-15 and 2.6.32-18, which have radeon drivers from 2.6.33.5 and 2.6.33.7
> respectively. So one of the changes to output handling in there may have fixed
> this.

Sorry, the latter version is actually 2.6.33.6. There aren't many radeon changes between these:

commit f417b91c30e84e759d395f45d524eeee95250822
Author: Dave Airlie <email address hidden>
Date: Sat May 29 06:50:37 2010 +1000

    drm/radeon: fix the r100/r200 ums block 0 page fix

commit 1a0c0aa4945dfa8ac3adc2818e166b40eb5dc346
Author: Dave Airlie <email address hidden>
Date: Wed Feb 24 17:17:13 2010 +1000

    drm/radeon: r100/r200 ums: block ability for userspace app to trash 0 page a
nd beyond

commit 7840726875499c2e4b195776f2a0090935d33f39
Author: Alex Deucher <email address hidden>
Date: Tue May 18 00:23:15 2010 -0400

    drm/radeon/kms/atom: fix typo in LVDS panel info parsing

commit 66ff9ff4525f96b24867f734d99950b5d654f76b
Author: Alex Deucher <email address hidden>
Date: Tue May 18 19:26:46 2010 -0400

    drm/radeon/kms: reset ddc_bus in object header parsing

Revision history for this message
In , agd5f (agd5f) wrote :

(In reply to comment #31)
>
> Sorry, the latter version is actually 2.6.33.6. There aren't many radeon
> changes between these:

None of these are really likely to be related to this bug; the first 2 are ums, and the other two were fairly specific bug fixes. I suspect if that kernel update does fix this bug the change is probably in the wifi or sound drivers or possibly something else in the kernel.

Revision history for this message
In , Christian-speckner (christian-speckner) wrote :

I'd like to chime in here: I am on a Thinkpad T60 (core duo -> 32 bit) with a radeon X1300 and switched to KMS + gallium recently and, while everything is working remarkably nice and stable in general (big kudos to the developers), I've got some negative side effects which look like the IRQ issues observed by the others in this thread to me. Symptoms are:

1) SATA is being reconfigured and even reset from time to time. This can be reproduced reliably by switiching VTs which causes

ata1.00: configured for UDMA/133
ata1: EH complete

to appear in dmesg most of the time. Under heavy GPU load, the link will be reset from time to time:

ata1: hard resetting link
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata1.00: ACPI cmd ef/02:00:00:00:00:a0 (SET FEATURES) succeeded
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
ata1.00: ACPI cmd ef/02:00:00:00:00:a0 (SET FEATURES) succeeded
ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
ata1.00: configured for UDMA/133
ata1.00: configured for UDMA/133
ata1: EH complete

Very occasionally, the controller will lock for some seconds before the reset occurs.

2) Under heavy GPU load, sound (intel HDA) has a tendency to crackle. I am not using a sound daemon, just plain alsa. The "snd-hda-intel timing workaround activated, blablabla..." message appears in dmesg.

3) I get NMIs from time to time (however, I don't see how those might be related):

Uhhuh. NMI received for unknown reason b1 on CPU 0.

I am running gentoo (mostly stable) with kernel 2.6.35 (gentoo patchset), xf86-video-ati 6.13.1, mesa bleeding-edge git (master), xorg 1.7.7 and libdrm git (master again). I've been trying different versions of everything (including vanilla 2.6.36-rc1) as well as different preemtion settings without any effect on those symptoms. As I never had such issues before switching to KMS, I stronly suspect KMS to be the culprit (however, SATA reconfiguration messages _have_ appeared before from time to time, so it might be that KMS is aggrevating a timing glitch already present in the hardware). To me, this looks somewhat like interrupts being masked overly long somewhere in the KMS code.

Revision history for this message
In , Christian-speckner (christian-speckner) wrote :

I might add that disabling MSI has no effect for me either.

Revision history for this message
In , Christian-speckner (christian-speckner) wrote :
Download full text (3.4 KiB)

Also adding lspci

00:00.0 Host bridge: Intel Corporation Mobile 945GM/PM/GMS, 943/940GML and 945GT Express Memory Controller Hub (rev 03)
00:01.0 PCI bridge: Intel Corporation Mobile 945GM/PM/GMS, 943/940GML and 945GT Express PCI Express Root Port (rev 03)
00:1b.0 Audio device: Intel Corporation 82801G (ICH7 Family) High Definition Audio Controller (rev 02)
00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1 (rev 02)
00:1c.1 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 2 (rev 02)
00:1c.2 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 3 (rev 02)
00:1c.3 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 4 (rev 02)
00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #1 (rev 02)
00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #2 (rev 02)
00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #3 (rev 02)
00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #4 (rev 02)
00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI Controller (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev e2)
00:1f.0 ISA bridge: Intel Corporation 82801GBM (ICH7-M) LPC Interface Bridge (rev 02)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller (rev 02)
00:1f.2 SATA controller: Intel Corporation 82801GBM/GHM (ICH7 Family) SATA AHCI Controller (rev 02)
00:1f.3 SMBus: Intel Corporation 82801G (ICH7 Family) SMBus Controller (rev 02)
01:00.0 VGA compatible controller: ATI Technologies Inc M52 [Mobility Radeon X1300]
02:00.0 Ethernet controller: Intel Corporation 82573L Gigabit Ethernet Controller
03:00.0 Network controller: Intel Corporation PRO/Wireless 3945ABG [Golan] Network Connection (rev 02)
15:00.0 CardBus bridge: Texas Instruments PCI1510 PC card Cardbus Controller

and /proc/interrupts

           CPU0 CPU1
  0: 876624 195 IO-APIC-edge timer
  1: 9878 4 IO-APIC-edge i8042
  3: 2 0 IO-APIC-edge
  9: 4703 0 IO-APIC-fasteoi acpi
 12: 550556 0 IO-APIC-edge i8042
 16: 1 0 IO-APIC-fasteoi yenta, uhci_hcd:usb2
 17: 0 0 IO-APIC-fasteoi uhci_hcd:usb3
 18: 0 0 IO-APIC-fasteoi uhci_hcd:usb4
 19: 0 0 IO-APIC-fasteoi ehci_hcd:usb1, uhci_hcd:usb5
 45: 24014 1 PCI-MSI-edge ahci
 46: 1644 1 PCI-MSI-edge eth0
 47: 66832 6 PCI-MSI-edge iwl3945
 48: 138 0 PCI-MSI-edge hda_intel
 49: 104672 0 PCI-MSI-edge radeon
NMI: 1 0 Non-maskable interrupts
LOC: 88899 357836 Local timer interrupts
SPU: 0 0 Spurious interrupts
PMI: 0 0 Performance monitoring interrupts
PND: 0 0 Performance pending work
RES: 258412 350544 Rescheduling interrupts
CAL: 19 24 Function call interrupts
TLB: 1029 ...

Read more...

Changed in xserver-xorg-driver-ati:
importance: Unknown → Medium
status: Unknown → Confirmed
Revision history for this message
d4yw41k3r (christian-drewing) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

I am having the same problem with an NVidia Geforce Go 7300.

Revision history for this message
Mark K (weebyte) wrote : Re: [Bug 555286] Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)
Download full text (6.4 KiB)

Have you tried to change the wireless chanel on your router? That fixed it
for me. Seems interference causes some strange errors.

Good luck

Mark

On 2011-01-01 11:41 AM, "d4yw41k3r" <email address hidden> wrote:

I am having the same problem with an NVidia Geforce Go 7300.

--
You received this bug notification because you are a direct subscriber
of the bug.
https://bugs.launchpad.net/bugs/555286

Title:
 Kernel crash when switching off wlan after wlan hangup (iwl3945)

Status in X.org XServer - ATI gfx chipset driver:
 Confirmed
Status in “linux” package in Ubuntu:
 Confirmed

Bug description:
 Hi,

when I'm watching Zattoo (online tv) my wlan connection dies after a few
minutes. Automatic reconnect is not possible, syslog shows the following
error messages:

Apr 4 20:21:21 r60 kernel: [ 727.542280] No probe response from AP
00:1f:3f:1e:41:b5 after 500ms, disconnecting.
Apr 4 20:21:21 r60 wpa_supplicant[1048]: CTRL-EVENT-DISCONNECTED -
Disconnect event - remove keys
Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection
state: completed -> disconnected
Apr 4 20:21:21 r60 NetworkManager: <info> (wlan0): supplicant connection
state: disconnected -> scanning
Apr 4 20:21:22 r60 kernel: [ 728.212555] iwl3945 0000:03:00.0: Error
sending REPLY_RXON: time out after 500ms.
Apr 4 20:21:22 r60 kernel: [ 728.212565] iwl3945 0000:03:00.0: Error
setting new configuration (-110).
(...)

As a workaround I have to switch wlan off and on, while doing this in one of
ten cases I get the kernel oops I'm reporting here. The Problems are
probably related.

Apr 4 20:25:06 r60 kernel: [ 953.043563] ------------[ cut here
]------------
Apr 4 20:25:06 r60 kernel: [ 953.043599] WARNING: at
/build/buildd/linux-2.6.32/net/wireless/core.c:614
wdev_cleanup_work+0xe9/0x110 [cfg80211]()
Apr 4 20:25:06 r60 kernel: [ 953.043604] Hardware name: 9461DXG
Apr 4 20:25:06 r60 kernel: [ 953.043608] Modules linked in: cryptd
aes_x86_64 aes_generic binfmt_misc rfcomm ppdev sco bridge stp bnep l2cap
vboxnetadp vboxnetflt vboxdrv joydev dm_crypt snd_hda_codec_analog pcmcia
usblp arc4 snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss
snd_pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi
snd_seq_midi_event iwl3945 snd_seq iwlcore snd_timer snd_seq_device
yenta_socket mac80211 btusb bluetooth thinkpad_acpi led_class nvram
rsrc_nonstatic lp pcmcia_core psmouse serio_raw cfg80211 snd parport
soundcore snd_page_alloc fbcon tileblit font bitblit softcursor vga16fb
vgastate usbhid hid radeon ttm ums_cypress ohci1394 usb_storage ieee1394
drm_kms_helper ahci video output drm i2c_algo_bit intel_agp tg3
Apr 4 20:25:06 r60 kernel: [ 953.043716] Pid: 9, comm: events/0 Not
tainted 2.6.32-19-generic #28-Ubuntu
Apr 4 20:25:06 r60 kernel: [ 953.043721] Call Trace:
Apr 4 20:25:06 r60 kernel: [ 953.043736] [<ffffffff81066ceb>]
warn_slowpath_common+0x7b/0xc0
Apr 4 20:25:06 r60 kernel: [ 953.043743] [<ffffffff81066d44>]
warn_slowpath_null+0x14/0x20
Apr 4 20:25:06 r60 kernel: [ 953.043759] [<ffffffffa02a70f9>]
wdev_cleanup_work+0xe9/0x110 [cfg80211]
Apr 4 20:25:06 r60 kernel: [ 953.043774] [<ffffffffa02a7010>] ?
wdev_cleanup_work+...

Read more...

Changed in xserver-xorg-driver-ati:
importance: Medium → Unknown
Changed in xserver-xorg-driver-ati:
importance: Unknown → Medium
Revision history for this message
In , Jantaegert (jantaegert) wrote :

Hi, what is the state of this bug?

I'm using the same hardware like Jason does (Thinkpad T60 with Radeon X1400 and Intel wireless) and running Ubuntu 11.04. Kernel Version is 2.6.38 and the problems with audio glitches and wireless drops are still there. From time to time the system isn't going to hibernate or doesn't wake up after.
All this problems are gone without kms (radeon.modeset=0).

But now the Problem has gotten even worse as without kms the radeon driver only works with software rendering and the unity desktop won't load. Same with gnome 3 running Debian unstable/experimental.

If I can anything contribute to solve this bug, please let me know!
Reading all the comments bisecting the kernel appears the most promising way to me. Anyone who wants to explain me how to do that?

Thanks.

Revision history for this message
In , Michel Dänzer (michel-daenzer) wrote :

(In reply to comment #36)
> Hi, what is the state of this bug?

It's weird, and nobody's had any good ideas what could cause it.

> But now the Problem has gotten even worse as without kms the radeon driver only
> works with software rendering and the unity desktop won't load. Same with gnome
> 3 running Debian unstable/experimental.

That's probably an installation / configuration problem (e.g. firmware not available where the radeon kernel driver initializes) and not directly related to this bug.

Revision history for this message
In , JasonPorter (jasonporter) wrote :

An update, since I posted the original bug report... I'm now running Ubuntu 11.04, and still having the same problems that everyone else is experiencing.

I'm running the full xorg-edgers PPA, so I am running radeon 6.14.99+git20110504 and mesa 7.11.0+git20110504, with the latest standard Natty kernel (64-bit).

I'm in a quandry, because without kms I can't run Unity (which no-one seems to be able to explain), and with kms I get glitchy audio and wifi dropouts.

Please help, devs!

Revision history for this message
In , Fabi (fabsi-deactivatedaccount) wrote :

Its not a fix, but a little workaround.
open terminal
xrandr
look whats your minimum resolution and tha name of the screen/output
now edit the grub linux line with video=[OUTPUTNAME]-1:[MINIMALRES]
for me its:
video=LVDS-1:320x200
now reboot. KMS/VT will now have a res of 320x200, plymouth and X are normal. sound crickeling is gone (or I just can not notice it anymore)

Revision history for this message
In , Christian-speckner (christian-speckner) wrote :

Taking opportunity of the fact that this ticket is coming back to life: this issue has still been accompanying over all mesa, X, ddx driver and kernel updates. I'm on 2.6.38 now, and sound is still stuttering.

> Its not a fix, but a little workaround.
> open terminal
> xrandr
> look whats your minimum resolution and tha name of the screen/output
> now edit the grub linux line with video=[OUTPUTNAME]-1:[MINIMALRES]
> for me its:
> video=LVDS-1:320x200
> now reboot. KMS/VT will now have a res of 320x200, plymouth and X are normal.
> sound crickeling is gone (or I just can not notice it anymore)

I can confirm that this works for me too; I set the VT framebuffer to 640x480, and the crackling is much better. In addition, my desktop feels significantly faster (I'm using KDE 4.6 with kwin compositing) --- in particular, switching desktops with many open, complex windows like firefox sitting is considerably better now.

To the developers (and don't hesitate to beat me down if this is completely out of question): could it be that the issue is caused by too many transfers from system to video memory and the driver blocking interrupts while waiting for them to complete? As I suppose that changing the framebuffer resolution affects the VRAM layout, the impact of the above trick could be explained this way. This could also explain the fact this issue seems to predominantly affect users of laptops which have hardware relatively low on VRAM (64MB for my x1300 mobility). Speaking of which: the driver reports the following about my hardware:

radeon 0000:01:00.0: VRAM: 128M 0x0000000000000000 - 0x0000000007FFFFFF (64M used)
radeon 0000:01:00.0: GTT: 512M 0x0000000008000000 - 0x0000000027FFFFFF
[drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
[drm] Driver supports precise vblank timestamp query.
radeon 0000:01:00.0: irq 49 for MSI/MSI-X
radeon 0000:01:00.0: radeon: using MSI.
[drm] radeon: irq initialized.
[drm] Detected VRAM RAM=128M, BAR=128M
[drm] RAM width 64bits DDR
[TTM] Zone kernel: Available graphics memory: 440368 kiB.
[TTM] Zone highmem: Available graphics memory: 1295860 kiB.

I never gave it much thought, but thinking about it, this seems suspicious to me as I am pretty sure that this device has only 64MB of VRAM.

Revision history for this message
In , agd5f (agd5f) wrote :

(In reply to comment #40)
>
> radeon 0000:01:00.0: VRAM: 128M 0x0000000000000000 - 0x0000000007FFFFFF (64M
> used)
> radeon 0000:01:00.0: GTT: 512M 0x0000000008000000 - 0x0000000027FFFFFF
> [drm] Supports vblank timestamp caching Rev 1 (10.10.2010).
> [drm] Driver supports precise vblank timestamp query.
> radeon 0000:01:00.0: irq 49 for MSI/MSI-X
> radeon 0000:01:00.0: radeon: using MSI.
> [drm] radeon: irq initialized.
> [drm] Detected VRAM RAM=128M, BAR=128M
> [drm] RAM width 64bits DDR
> [TTM] Zone kernel: Available graphics memory: 440368 kiB.
> [TTM] Zone highmem: Available graphics memory: 1295860 kiB.
>
> I never gave it much thought, but thinking about it, this seems suspicious to
> me as I am pretty sure that this device has only 64MB of VRAM.

Your device does have 64 MB of vram. The pci aperture is 128 MB, but there is only 64 MB of vram. That's why is says "64M used".

Revision history for this message
In , JasonPorter (jasonporter) wrote :

(In reply to comment #40)
> I never gave it much thought, but thinking about it, this seems suspicious to
> me as I am pretty sure that this device has only 64MB of VRAM.

I'm using an X1400 here, with 128MB of vram. I'm not sure if that is helpful or not.

Here's some extra info for the fire, and one that surprised me when I discovered it yesterday: there are a few apps that render fullscreen video that do not exhibit the stuttering at all. This is new behavior, previously ANY video would cause the stuttering when fullscreened. This change that I'm seeing may not be the same on your systems, but it's worth testing.

A good example is the Hulu Desktop for Linux application, which streams high quality video fullscreen without any stutter. This uses the video card, the wifi connection, and the sound card at the same time, which is exactly the situation that causes problems for most people. The app specifies that it requires Flash 10.0.32 or higher, which suggests that it uses Flash for the video transport in some fashion. The application can be downloaded from http://www.hulu.com/labs/hulu-desktop-linux

Also, on my system, switching into HTML5 video mode on Youtube (using Google Chrome) allows fullscreen 720p streaming without audio stutter. The same video played in standard Flash video mode (in Chrome or in Firefox) stutters heavily when maximized, in both 480p and 720p formats. In fact, any Flash-based web-embedded video stutters when maximized, on any site that I've tried (Vimeo, CBS, etc), including even Hulu's own web-based player.

I'm not sure what the story is on this, it's confusing. Maybe on my particular hardware the radeon driver is able to cope differently with the particular video rendering method used by Hulu Desktop and the browser HTML5 video implementations. If the Hulu Desktop application is using Flash, it's doing it differently than viewing the same video on Hulu.com in a browser, because the web-embedded player stutters and the Hulu Desktop application doesn't, and both are (theoretically) streaming from the same source.

I'm running very current versions of Mesa, Gallium and the radeon driver, so the particular combination of behaviors that I'm seeing may be a recent change. I also have a relatively fast system (Core 2 Duo with an SSD) and as always, system load seems to have a big impact on this issue appearing or not. So your results may not be the same as mine.

possibly relevant information from glxinfo:
  direct rendering: Yes
  OpenGL vendor string: X.Org R300 Project
  OpenGL renderer string: Gallium 0.4 on ATI RV515
  OpenGL version string: 2.1 Mesa 7.11-devel

Revision history for this message
In , Denk (denk) wrote :

Anything new on this?
I've got a Thinkpad T60 with Radeon X1400, IWL3945 and Intel HDA (AD198x) with Linux 2.6.39, mesa 7.11-devel, xserver 1.10.1-devel, xf86-video-ati 6.14.1-devel and suffer from the already mentioned stuttering/choopy audio and frequent iwlwifi-crashes, too!

Regards
denk

Revision history for this message
In , Jantaegert (jantaegert) wrote :

Created attachment 47928
Kernel log while loading the radeon driver

With kernel 2.6.39 I get the NMI message (see attachment) on every boot while the radeon driver is loaded. With earlier versions this happened only while i switched the console.

Maybe it would help to investigate the module loading?
If anyone can tell me how to do that, i can submit more detailled informations.

Thanks,
jan.

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

(In reply to comment #44)
> Created an attachment (id=47928) [details]
> Kernel log while loading the radeon driver
>
> With kernel 2.6.39 I get the NMI message (see attachment) on every boot while
> the radeon driver is loaded. With earlier versions this happened only while i
> switched the console.
>
> Maybe it would help to investigate the module loading?
> If anyone can tell me how to do that, i can submit more detailled informations.
>
> Thanks,
> jan.

This might shed some light:
http://mailman.linux-thinkpad.org/pipermail/linux-thinkpad/2011-February/049418.html

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

Created attachment 48007
dmesg from Ubuntu 11.04 - unaffected

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

Created attachment 48008
dmesg from Fedora 15 - affected

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

I've added dmesg output from Ubuntu 11.04 which is not affected with the bug (at least on my laptop - Thinkpad T60, 2623P2U, X1300) and Fedora 15 which does have the problem. Unless I missed something, the only relevant difference is DRM version reported, which is 2.8 for Ubuntu and 2.10 for Fedora (how come that's possible is another question).
So far, the following does not help:
- disp_priority=1
- agpmode=1
- gartsize=64
- dynclks=0

Revision history for this message
In , Jantaegert (jantaegert) wrote :

(In reply to comment #45)

Dynamic Powermanagement is disabled here (powermanagement profile is fixed to "default", what means, that all pcie lines stay allways enabled).

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

I played a bit with power_profile settings and it turns our that whenever profile is set to high, mid, sound stuttering is pronounced. Yet, once set to low, stuttering is gone (or unnoticeable).
If anyone wants to try (adjust path to suit your hardware):
echo low > /sys/class/drm/card0/device/power_profile
Then the actual frequency can be verified by:
cat /sys/kernel/debug/dri/0/radeon_pm_info

Also dynpm power_method does not really work, there was another bug report recenty that the frequency is never lowered if dynpm is used.

Additionally, given the recent fuss about pcie_aspm=force (and possible effect on PCIe), sound stuttering is present regardless of the setting.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

(In reply to comment #50)
> I played a bit with power_profile settings and it turns our that whenever
> profile is set to high, mid, sound stuttering is pronounced. Yet, once set to
> low, stuttering is gone (or unnoticeable).
> If anyone wants to try (adjust path to suit your hardware):
> echo low > /sys/class/drm/card0/device/power_profile
> Then the actual frequency can be verified by:
> cat /sys/kernel/debug/dri/0/radeon_pm_info

Confirming this on ATI X1400 mobile, Ubuntu 11.04 x86. When using the "low" power profile, audio stuttering/crackling is much less prevalent (or maybe not even noticable) in Youtube fullscreen vids. Using the "high" setting results in definite audio crackling when switching to fullscreen. This is with the very latest Flashplayer 11 beta for Linux released today.

On my card, low setting results in:
# cat /sys/kernel/debug/dri/0/radeon_pm_info
default engine clock: 392000 kHz
current engine clock: 128250 kHz
default memory clock: 350000 kHz
current memory clock: 135000 kHz
PCIE lanes: 1

High setting gives:
# cat /sys/kernel/debug/dri/0/radeon_pm_info
default engine clock: 392000 kHz
current engine clock: 391500 kHz
default memory clock: 350000 kHz
current memory clock: 342000 kHz
PCIE lanes: 0

There's a difference not only in board frequencies, but also the PCIE lanes number (0 means full throttle, or is "more performant" than 1 I guess ??).

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

(In reply to comment #51)
> There's a difference not only in board frequencies, but also the PCIE lanes
> number (0 means full throttle, or is "more performant" than 1 I guess ??).

The PCIe lanes information seems to be read from the card itself (at least that's for RV515), look here:
http://lxr.linux.no/#linux+v2.6.39/drivers/gpu/drm/radeon/r300.c#L553
While 1 is understandable (PCIe x1), 0 value is confusing and I can't tell if that'x x16 or not. Maybe one of the driver authorsa can provide some input.

I've failed to locate any docs that contain information on the registers exposed on the PCI.

Revision history for this message
In , agd5f (agd5f) wrote :

Does adding noapic to the kernel commandline in grub help? See the last few comments in bug 37679.

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

(In reply to comment #53)
> Does adding noapic to the kernel commandline in grub help? See the last few
> comments in bug 37679.
No, it seems to have no effect on the problem, at least on my setup.

Revision history for this message
In , Richtigfalsch (richtigfalsch) wrote :

I'd like the importance of this bug being corrected to 'major' because disabling KMS in fact means a major loss of functionality. I'm now on kernel 2.6.40 and sadly this heavy bug still is there.

I skipped form Windows to Linux, mainly because the ATI Driver for the x1400 in my Thinkpad T60 (with iwl3945 od course) is crap.Now having tried many different distributions and kernels, i can confirm the bug still is there, and is making the notebook unusable. There's no 3D acceleration available at all on this GPU with KMS disabled. When playing a flash video it needs about 30 seconds fpr reaction if I clock some control with the mouse. Compiz or DirectX in Wine aren't working at all, and make the display crash. The GPU is wasting much energy and the notebook is running very hot, and overall just sluggish and not enjoyable in any fashion, I'd rather use my old Pentium M notebook, if it wasn't defective.

Please consider creating a solution fot this problem, as there's no single alternative for many Notebook owners, of expecially good notebooks (Thinkpad, Dell and more).

Problems with KMS enabled remain as before:
-iwl3945 WLAN gets slower and slower, until disconnect.
-heavy video (especially fullscreen) make the sound stutter in a fashion that makes it impossible to understand spoken word

Thanks,

Changed in xserver-xorg-driver-ati:
importance: Medium → High
Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Seems worse than ever on Ubuntu 11.10 just released (kernel 3.0, libdrm 2.4.26, xserver 1.10.4, using the new Unity-interface-thing). Just moving the mouse pointer is enough to disturb audio now, apparently. And moving windows around turns audio into bubbling porridge.

Revision history for this message
In , Øyvind Stegard (oyvindstegard) wrote :

Unity is sluggish on the X1400. Guess it's too old to cope now, with the latest desktop tech. Anyways, audio interruption is more or less constant after a while. Don't even need to move anything. Got these:

[ 931.698537] CE: hpet increased min_delta_ns to 515452 nsec
[ 1019.733804] CE: hpet increased min_delta_ns to 773178 nsec
[ 1023.203405] hrtimer: interrupt took 7398146 ns
[ 1171.028962] CE: hpet increased min_delta_ns to 1159767 nsec

Revision history for this message
In , Maciej Borzecki (maciek-borzecki) wrote :

(In reply to comment #57)
> Unity is sluggish on the X1400. Guess it's too old to cope now, with the latest
> desktop tech.
Not really. Worked great with UMS. Compiz with way more advanced effects than fade in/out was smooth, same for ioquake running at decent framerate.

Revision history for this message
In , Tom Morton (tomm) wrote :

I get this on my Thinkpad T60, Radeon X1300 running debian sid.

As well as crackle on the internal intel audio, I get even worse crackle and popping when using my Logitech V20 USB speakers.

The only thing that resolves the problem for me is:

echo mid > /sys/class/drm/card0/device/power_profile

But then gnome-shell and all 3d apps are really slow.

Revision history for this message
In , Steffen-schloenvoigt (steffen-schloenvoigt) wrote :

Same problem on openSUSE 12.1 with lenovo T60, Radeon Mobility X1400

Revision history for this message
In , stevenb (stevenb) wrote :

Hi

Jan Kouba put me on track to this bug-page. I don't know if "official" developers use this channel as a information or judgment source of bugs. As far as I know Ubuntu works on launchpad to administrate bugs.

For this problem I created https://bugs.launchpad.net/ubuntu/+bug/879790 in order to make Ubuntu know that problem. It may be an idea to "make some noise" there in form of clicking on the button "This bug affects...". And - if you feel like - to reproduce some of your statements from this bug-report-page. Hopefully they will take notice finally.

keep fingers crossed
Quesst

Revision history for this message
In , Neven Klacar (nklacar) wrote :

Looks like this will never get solved ;(

If anyone has ideas on how to debug this issue, I have a t60 that I can use.

Revision history for this message
In , Neven Klacar (nklacar) wrote :

(In reply to comment #62)
> Looks like this will never get solved ;(
>
> If anyone has ideas on how to debug this issue, I have a t60 that I can use.

I tried enabling msi on alsa and its alot better. Still fulscreen is not good, but much sounds like 11.04

Add: options snd-hda-intel enable_msi=1" to the bottom of your /etc/modprobe.d/alsa-base file.

I think radeon is somehow not playing nice with msi. If we could disable it it might fix the problem...I tried doing it with echo 0 > msi_bus in the radeon bridge but it wasnt working

Revision history for this message
In , agd5f (agd5f) wrote :

With a new enough kernel, you can disable MSIs on radeon by setting radeon.msi=0 on the kernel command line in grub or when you load the module. You can disable MSIs globally by setting pci=nomsi on the kernel command line in grub.

Revision history for this message
In , agd5f (agd5f) wrote :
Revision history for this message
In , Neven Klacar (nklacar) wrote :

(In reply to comment #65)
> You might also try this patch:
> http://cgit.freedesktop.org/~airlied/linux/commit/?h=drm-fixes&id=b7f5b7dec3d539a84734f2bcb7e53fbb1532a40b

I tried to git the 11.10 package from ubuntu and recompile with the change you mentioned in rs600.c file and r100.c . It didnt seem to help. I noticed someone tried disabling MSI so this might not be related to MSI at all...

I was thinking of trying the latest radeon src?

Revision history for this message
In , agd5f (agd5f) wrote :

Has anyone tried messing with the audio or wifi drivers? It's possible the issue is on that side. How about messing with the cpufreq governors? Is it still an issue if you force the cpu power state to performance, etc.?

Revision history for this message
penalvch (penalvch) wrote : Re: Kernel crash when switching off wlan after wlan hangup (iwl3945)

Christophe, thank you for reporting this and helping make Ubuntu better. Regarding your bug description:
> "when I'm watching Zattoo (online tv) my wlan connection dies after a few minutes. Automatic reconnect is not possible, syslog shows the following error... As a workaround I have to switch wlan off and on, while doing this in one of ten cases I get the kernel oops I'm reporting..."

Please do not stack multiple issues into one report. Hence, this bug will focus only on the wlan connection dropping off and automatic reconnect not being possible.

If you want the kernel oops addressed, please file a new bug via the Terminal and feel free to subscribe me to it:
ubuntu-bug linux

As well, can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux <replace-with-bug-number>

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

Changed in linux (Ubuntu):
status: Confirmed → Incomplete
summary: - Kernel crash when switching off wlan after wlan hangup (iwl3945)
+ wlan connection dies after a few minutes
description: updated
tags: removed: apport-kerneloops kernel-oops
Revision history for this message
Christophe (christophe-wk3) wrote :

Hey, this bug report is two years old and I switched to more recent versions since then. I don't mind it being closed.

Revision history for this message
penalvch (penalvch) wrote :

Christophe, this bug report is being closed due to your last comment regarding this being fixed with an update. For future reference you can manage the status of your own bugs by clicking on the current status in the yellow line and then choosing a new status in the revealed drop down box. You can learn more about bug statuses at https://wiki.ubuntu.com/Bugs/Status. Thank you again for taking the time to report this bug and helping to make Ubuntu better. Please submit any future bugs you may find.

Changed in linux (Ubuntu):
status: Incomplete → Invalid
Changed in xserver-xorg-driver-ati:
importance: High → Undecided
status: Confirmed → New
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.