page allocation failure caused by rtl8139 driver

Bug #164018 reported by LGB [Gábor Lénárt]
52
This bug affects 7 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

This is 32 bit Ubuntu 7.10 (gutsy) running on a desktop machine with a realtek ethernet NIC:

02:03.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)
        Subsystem: Micro-Star International Co., Ltd. Unknown device 254c
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
        Latency: 64 (8000ns min, 16000ns max)
        Interrupt: pin A routed to IRQ 16
        Region 0: I/O ports at de00 [size=256]
        Region 1: Memory at fddff000 (32-bit, non-prefetchable) [size=256]
        Capabilities: [50] Power Management version 2
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA PME(D0-,D1+,D2+,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-

The problem: time-to-time I had messages in kernel log starting with line 'swapper: page allocation failure. order:1, mode:0x4020'

This bug seems to be similar to #150611 but in my case it seems to be ralted to rtl8139 ...

Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :
Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

The Hardy Heron kernel was recently uploaded for testing. We'd really appreciate it if you could try testing with this newer kernel and verify if this issue still exists. Unfortunately, the Hardy Heron Alpha1 LiveCD was released with the older 2.6.22 kernel. You'll have to manually install the newer Hardy Heron kernel in order to test. This should not be the case for Alpha2 which is set to come out around Dec 20. However, here are the instructions to install if you choose to do so, otherwise just wait for Alpha2 to come out:

1) edit the file /etc/apt/sources.list and add the following line:

deb http://archive.ubuntu.com/ubuntu hardy main restricted

2) sudo apt-get update
3) sudo apt-get install linux-image-2.6.24-1-generic
4) reboot and select the new kernel from the grub menu

After you've tested, please feel free to revert back - ie boot into the old kernel, sudo apt-get remove linux-image-2.6.24-1-generic, and remove the line from /etc/apt/sources.list . Please update this report with your results. Thanks in advance!

Changed in linux:
importance: Undecided → Medium
status: New → Incomplete
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

This bug report is remaining open against the actively developed kernel however against linux-source-2.6.22 this bug does not meet the criteria for a stable release update and is being closed. You can learn more about the stable release update process at https://wiki.ubuntu.com/StableReleaseUpdates . Thanks!

Changed in linux-source-2.6.22:
status: New → Won't Fix
Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :

Ok, I'll try (now there's even 2.6.24-4 if I'm right) however since I don't know what trigger the problem exactly, I don't know how I can sure that the problem is solved by 2.6.24-4 kernel or not.

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi LGB,

I just wanted to let you know that you can download and try the new Hardy Heron Alpha release from http://cdimage.ubuntu.com/releases/hardy/ . You should be able to then test the new kernel via the LiveCD. It might be easier than having to install the kernel by hand - which was only necessary to test 2.6.24-1 since it didn't make it into the first Alpha release. Thanks.

Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :

I'm using hardy for some time, and I haven't got this issue yet, but since this problem didn't occure very often (and I don't know how to trigger) with kernel of gutsy neither, I can't say that it's fixed in hardy though ...

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Hi LGB,

Thanks for the update. It sounds like a good sign that you have not ran into this while running Hardy. I'll go ahead and leave this report open for a while longer before we close it. If you are able to reproduce the issue, definitely let us know. Thanks!

Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :

I think I have a similar issue now with 2.6.24, check bug #150611 , thanks!

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :
Download full text (4.9 KiB)

I have this bug with 2.6.27-10-generic ( 2.6.27-10.20) using Ubuntu intrepid on 32 bit arch.

lgb@vega:~$ uname -a
Linux vega 2.6.27-10-generic #1 SMP Fri Nov 21 12:00:22 UTC 2008 i686 GNU/Linux

Dec 7 12:03:05 vega kernel: [356296.455435] swapper: page allocation failure. order:0, mode:0x4020
Dec 7 12:03:05 vega kernel: [356296.455449] Pid: 0, comm: swapper Tainted: P 2.6.27-10-generic #1
Dec 7 12:03:05 vega kernel: [356296.455455] [<c037c9f6>] ? printk+0x1d/0x1f
Dec 7 12:03:05 vega kernel: [356296.455470] [<c018a447>] __alloc_pages_internal+0x387/0x490
Dec 7 12:03:05 vega kernel: [356296.455479] [<c01aafc5>] alloc_slab_page+0x35/0x40
Dec 7 12:03:05 vega kernel: [356296.455486] [<c01ab6da>] new_slab+0x5a/0x1e0
Dec 7 12:03:05 vega kernel: [356296.455491] [<c01ac86d>] ? unfreeze_slab+0x7d/0xc0
Dec 7 12:03:05 vega kernel: [356296.455496] [<c01acae5>] __slab_alloc+0x1b5/0x260
Dec 7 12:03:05 vega kernel: [356296.455501] [<c01ad821>] __kmalloc_track_caller+0xd1/0x100
Dec 7 12:03:05 vega kernel: [356296.455506] [<c02ec071>] ? dev_alloc_skb+0x21/0x40
Dec 7 12:03:05 vega kernel: [356296.455514] [<c02ec071>] ? dev_alloc_skb+0x21/0x40
Dec 7 12:03:05 vega kernel: [356296.455523] [<c02ebdb2>] __alloc_skb+0x52/0x110
Dec 7 12:03:05 vega kernel: [356296.455527] [<c02ec071>] dev_alloc_skb+0x21/0x40
Dec 7 12:03:05 vega kernel: [356296.455554] [<f8938791>] rtl8139_rx+0xe1/0x2f0 [8139too]
Dec 7 12:03:05 vega kernel: [356296.455566] [<c014b22b>] ? hrtimer_get_next_event+0x10b/0x130
Dec 7 12:03:05 vega kernel: [356296.455577] [<f89390d6>] rtl8139_poll+0x46/0xc8 [8139too]
Dec 7 12:03:05 vega kernel: [356296.455587] [<f8938f84>] ? rtl8139_interrupt+0xb4/0x190 [8139too]
Dec 7 12:03:05 vega kernel: [356296.455596] [<c02f4e15>] net_rx_action+0xf5/0x230
Dec 7 12:03:05 vega kernel: [356296.455602] [<c0137672>] __do_softirq+0x92/0x120
Dec 7 12:03:05 vega kernel: [356296.455607] [<c013775d>] do_softirq+0x5d/0x60
Dec 7 12:03:05 vega kernel: [356296.455611] [<c01378d5>] irq_exit+0x55/0x90
Dec 7 12:03:05 vega kernel: [356296.455615] [<c0106c1a>] do_IRQ+0x4a/0x80
Dec 7 12:03:05 vega kernel: [356296.455620] [<c013790c>] ? irq_exit+0x8c/0x90
Dec 7 12:03:05 vega kernel: [356296.455624] [<c0113f8d>] ? smp_apic_timer_interrupt+0x5d/0x90
Dec 7 12:03:05 vega kernel: [356296.455631] [<c0105003>] common_interrupt+0x23/0x30
Dec 7 12:03:05 vega kernel: [356296.455635] [<c010acca>] ? mwait_idle+0x4a/0x50
Dec 7 12:03:05 vega kernel: [356296.455640] [<c010288d>] cpu_idle+0x7d/0x140
Dec 7 12:03:05 vega kernel: [356296.455643] [<c036f3c3>] rest_init+0x53/0x60
Dec 7 12:03:05 vega kernel: [356296.455650] =======================
Dec 7 12:03:05 vega kernel: [356296.455652] Mem-Info:
Dec 7 12:03:05 vega kernel: [356296.455655] DMA per-cpu:
Dec 7 12:03:05 vega kernel: [356296.455658] CPU 0: hi: 0, btch: 1 usd: 0
Dec 7 12:03:05 vega kernel: [356296.455660] CPU 1: hi: 0, btch: 1 usd: 0
Dec 7 12:03:05 vega kernel: [356296.455662] Normal per-cpu:
Dec 7 12:03:05 vega kernel: [356296.455665] CPU 0: hi: 186, btch: 31 usd: 118
Dec 7 12:03:05 vega kernel: [356296.455667] CPU 1: hi: 186, btch: 31 usd: 174
D...

Read more...

Changed in linux:
status: Incomplete → Triaged
Revision history for this message
Amit Kucheria (amitk) wrote :

This bug was reported a while ago but there hasn't been any recent comments or updates. Is this still an issue with the latest pre-release of Jaunty 9.04? Refer to http://www.ubuntu.com/testing/jaunty/beta . Please let us know.

Changed in linux (Ubuntu):
status: Triaged → Incomplete
Revision history for this message
gna (nagy-gergely) wrote :
Download full text (3.2 KiB)

Since Hardy is LTS, and i am sticked to OCFS version on 5 servers, i cannot test it on Jaunty, but i can confirm it on 2.6.24-24-server, as you see also with bnx2 network card. So i do not want see a statement like this: "Move away from LTS" because that would make me very sad.

Linux web2 2.6.24-24-server #1 SMP Wed Apr 15 16:36:01 UTC 2009 i686 GNU/Linux

swapper: page allocation failure. order:1, mode:0x4020
Pid: 0, comm: swapper Not tainted 2.6.24-24-server #1
 [<c01785ab>] __alloc_pages+0x2ab/0x380
 [<c02dbb89>] ip_local_deliver_finish+0xf9/0x210
 [<c0196226>] __slab_alloc+0x186/0x4a0
 [<c0197446>] __kmalloc_track_caller+0x106/0x110
 [<c02b4d32>] __netdev_alloc_skb+0x22/0x50
 [<c02b4d32>] __netdev_alloc_skb+0x22/0x50
 [<c02b4125>] __alloc_skb+0x55/0x120
 [<c02b4d32>] __netdev_alloc_skb+0x22/0x50
 [<f89592c1>] bnx2_poll+0x6f1/0xd10 [bnx2]
 [<c012d974>] load_balance_fair+0xc4/0x100
 [<f8958c00>] bnx2_poll+0x30/0xd10 [bnx2]
 [<f8958e15>] bnx2_poll+0x245/0xd10 [bnx2]
 [<c014c024>] getnstimeofday+0x34/0xf0
 [<c02bb7ed>] net_rx_action+0x12d/0x210
 [<c01498f8>] ktime_get+0x18/0x40
 [<c0136e42>] __do_softirq+0x82/0x110
 [<c0136f25>] do_softirq+0x55/0x60
 [<c013720d>] irq_exit+0x6d/0x80
 [<c010a930>] do_IRQ+0x40/0x70
 [<c014989e>] ktime_get_ts+0x1e/0x60
 [<c0108def>] common_interrupt+0x23/0x28
 [<c01062e6>] mwait_idle_with_hints+0x46/0x60
 [<c01066c3>] cpu_idle+0x73/0xd0
 =======================
Mem-info:
DMA per-cpu:
CPU 0: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0
CPU 1: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0
Normal per-cpu:
CPU 0: Hot: hi: 186, btch: 31 usd: 176 Cold: hi: 62, btch: 15 usd: 57
CPU 1: Hot: hi: 186, btch: 31 usd: 149 Cold: hi: 62, btch: 15 usd: 50
HighMem per-cpu:
CPU 0: Hot: hi: 186, btch: 31 usd: 53 Cold: hi: 62, btch: 15 usd: 14
CPU 1: Hot: hi: 186, btch: 31 usd: 181 Cold: hi: 62, btch: 15 usd: 11
Active:486821 inactive:222382 dirty:1086 writeback:43 unstable:0 free:1236121 slab:125422 mapped:5242 pagetables:1068 bounce:0
DMA free:3508kB min:68kB low:84kB high:100kB active:4kB inactive:0kB present:16256kB pages_scanned:0 all_unreclaimable? no lowmem_reserve[]: 0 873 8874 8874
Normal free:122668kB min:3744kB low:4680kB high:5616kB active:85852kB inactive:111552kB present:894080kB pages_scanned:0 all_unreclaimable? no lowmem_reserve[]: 0 0 64008 64008
HighMem free:4818308kB min:512kB low:9096kB high:17684kB active:1861428kB inactive:777976kB present:8193024kB pages_scanned:0 all_unreclaimable? no lowmem_reserve[]: 0 0 0 0
DMA: 137*4kB 48*8kB 32*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 3492kB
Normal: 30507*4kB 1*8kB 1*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 0*2048kB 0*4096kB = 122628kB
HighMem: 6664*4kB 14962*8kB 19387*16kB 19951*32kB 10681*64kB 4402*128kB 1783*256kB 819*512kB 323*1024kB 200*2048kB 210*4096kB = 4818304kB
Swap cache: add 1, delete 0, find 0/0, race 0+0
Free swap = 1951884kB
Total swap = 1951888kB
Free swap: 1951884kB
2293759 pages of RAM
2064383 pages of HIGHMEM
216253 reserved pages
341800 pages shared
1 pages swap cached
1086 pages dirty
43 pages writeb...

Read more...

Revision history for this message
Kaustubh (kaustubh-gadkari) wrote :
Download full text (5.2 KiB)

I have Ubuntu 9.04 server installed, and I see this problem too:

[199458.249123] Pid: 0, comm: swapper Not tainted 2.6.28-11-server #42-Ubuntu
[199458.249126] Call Trace:
[199458.249128] <IRQ> [<ffffffff802b694e>] __alloc_pages_internal+0x3ee/0x4f0
[199458.249144] [<ffffffff802df8d8>] alloc_slab_page+0x28/0x30
[199458.249148] [<ffffffff802e099a>] new_slab+0x5a/0x210
[199458.249152] [<ffffffff802e1fc8>] __slab_alloc+0x188/0x290
[199458.249158] [<ffffffff805acd0f>] ? __netdev_alloc_skb+0x1f/0x40
[199458.249161] [<ffffffff802e2ed7>] __kmalloc_track_caller+0xd7/0x110
[199458.249165] [<ffffffff805acd0f>] ? __netdev_alloc_skb+0x1f/0x40
[199458.249168] [<ffffffff805ac9ae>] __alloc_skb+0x6e/0x150
[199458.249172] [<ffffffff805acd0f>] __netdev_alloc_skb+0x1f/0x40
[199458.249186] [<ffffffffa0082714>] bnx2_rx_skb+0x74/0x7e0 [bnx2]
[199458.249191] [<ffffffff805ddce4>] ? ip_rcv+0x264/0x2f0
[199458.249198] [<ffffffffa008594e>] bnx2_rx_int+0x44e/0x7b0 [bnx2]
[199458.249202] [<ffffffff805ac0aa>] ? __kfree_skb+0x3a/0xa0
[199458.249208] [<ffffffffa0083fd1>] ? bnx2_tx_int+0x151/0x2a0 [bnx2]
[199458.249214] [<ffffffff80227818>] ? __trigger_all_cpu_backtrace+0x48/0x50
[199458.249220] [<ffffffffa0085d20>] bnx2_poll_work+0x70...

Read more...

Revision history for this message
Ike Panhc (ikepanhc) wrote :

Hi Kaustubh,

I saw the dmesg you post, but it seems you are using broadcom ethernet card (bnx2)
Could you confirm that?

Revision history for this message
Blacknight (michael-blacknight) wrote :
Download full text (6.5 KiB)

Hi all,

my, well lets call my little homeserver (a fit-pc2), just nearly died on me.
Commands like ps -aux wouldn't complete, they just stopped halfway through.

i found this in my logs.

Jul 30 20:50:21 home kernel: [118560.183051] swapper: page allocation failure. order:0, mode:0x4020
Jul 30 20:50:22 home kernel: [118560.183069] Pid: 0, comm: swapper Not tainted 2.6.28-14-server #47-Ubuntu
Jul 30 20:50:22 home kernel: [118560.183079] Call Trace:
Jul 30 20:50:22 home kernel: [118560.183101] [<c050b3b6>] ? printk+0x18/0x1a
Jul 30 20:50:22 home kernel: [118560.183119] [<c019bdc7>] __alloc_pages_internal+0x357/0x450
Jul 30 20:50:22 home kernel: [118560.183136] [<c01bf545>] alloc_slab_page+0x35/0x40
Jul 30 20:50:22 home kernel: [118560.183150] [<c01bfc45>] new_slab+0x55/0x1e0
Jul 30 20:50:22 home kernel: [118560.183164] [<c01c0d85>] ? unfreeze_slab+0x75/0xb0
Jul 30 20:50:22 home kernel: [118560.183177] [<c01c0fdd>] __slab_alloc+0x1ad/0x260
Jul 30 20:50:22 home kernel: [118560.183190] [<c01c1c49>] __kmalloc_track_caller+0xc9/0xf0
Jul 30 20:50:22 home kernel: [118560.183205] [<c04373d3>] ? __netdev_alloc_skb+0x23/0x50
Jul 30 20:50:22 home kernel: [118560.183218] [<c04373d3>] ? __netdev_alloc_skb+0x23/0x50
Jul 30 20:50:22 home kernel: [118560.183231] [<c04370e9>] __alloc_skb+0x49/0x100
Jul 30 20:50:22 home kernel: [118560.183243] [<c04373d3>] __netdev_alloc_skb+0x23/0x50
Jul 30 20:50:22 home kernel: [118560.183277] [<f80a931d>] rtl8169_rx_fill+0xad/0x1c0 [r8169]
Jul 30 20:50:22 home kernel: [118560.183300] [<f80a97d0>] rtl8169_rx_interrupt+0x3a0/0x510 [r8169]
Jul 30 20:50:22 home kernel: [118560.183322] [<f80acf6b>] rtl8169_poll+0x2b/0xd0 [r8169]
Jul 30 20:50:22 home kernel: [118560.183339] [<c04402cd>] net_rx_action+0xed/0x1d0
Jul 30 20:50:22 home kernel: [118560.183355] [<c01467f7>] __do_softirq+0x97/0x170
Jul 30 20:50:22 home kernel: [118560.183368] [<c014692d>] do_softirq+0x5d/0x60
Jul 30 20:50:22 home kernel: [118560.183381] [<c0146aa5>] irq_exit+0x55/0x90
Jul 30 20:50:22 home kernel: [118560.183394] [<c010c1c3>] do_IRQ+0x83/0xa0
Jul 30 20:50:22 home kernel: [118560.183406] [<c0146adc>] ? irq_exit+0x8c/0x90
Jul 30 20:50:22 home kernel: [118560.183420] [<c011f8fb>] ? smp_apic_timer_interrupt+0x5b/0x90
Jul 30 20:50:22 home kernel: [118560.183433] [<c010ab13>] common_interrupt+0x23/0x28
Jul 30 20:50:22 home kernel: [118560.183447] [<c0110a12>] ? mwait_idle+0x42/0x50
Jul 30 20:50:22 home kernel: [118560.183461] [<c010884d>] cpu_idle+0x6d/0xd0
Jul 30 20:50:22 home kernel: [118560.183473] [<c04fb1de>] rest_init+0x4e/0x60
Jul 30 20:50:22 home kernel: [118560.183482] Mem-Info:
Jul 30 20:50:22 home kernel: [118560.183488] DMA per-cpu:
Jul 30 20:50:22 home kernel: [118560.183496] CPU 0: hi: 0, btch: 1 usd: 0
Jul 30 20:50:22 home kernel: [118560.183505] CPU 1: hi: 0, btch: 1 usd: 0
Jul 30 20:50:22 home kernel: [118560.183512] Normal per-cpu:
Jul 30 20:50:22 home kernel: [118560.183520] CPU 0: hi: 186, btch: 31 usd: 47
Jul 30 20:50:22 home kernel: [118560.183529] CPU 1: hi: 186, btch: 31 usd: 172
Jul 30 20:50:22 home kernel: [118560.183537] HighMem per-cpu:
Jul 30 20:50:22 home kernel: [118560.18...

Read more...

Revision history for this message
Keng-Yu Lin (lexical) wrote :

Since napi has changed a lot in the newer version of linux kernel, would you please try the latest mainline build (available at http://kernel.ubuntu.com/~kernel-ppa/mainline) to see whether the issue still exists.

Revision history for this message
Marcus Blomenkamp (mblomenk) wrote :
Download full text (4.2 KiB)

The problem is still present in Ubuntu 9.10. In my case the machine has 120MB of memory and 192MB of swap. Swap usage was about 10MB when the following happened during heavy network IO:

[ 2489.732639] swapper: page allocation failure. order:0, mode:0x4020
[ 2489.732659] Pid: 0, comm: swapper Not tainted 2.6.31-16-generic-pae #53-Ubuntu
[ 2489.732668] Call Trace:
[ 2489.732696] [<c057481e>] ? printk+0x18/0x1a
[ 2489.732720] [<c01ba2c0>] __alloc_pages_slowpath+0x340/0x480
[ 2489.732735] [<c01ba50f>] __alloc_pages_nodemask+0x10f/0x120
[ 2489.732754] [<c01e201a>] new_slab+0x19a/0x200
[ 2489.732769] [<c01e2fcf>] __slab_alloc+0xaf/0x260
[ 2489.732783] [<c01e3f6a>] __kmalloc_track_caller+0x16a/0x180
[ 2489.732803] [<c0499c8e>] ? __netdev_alloc_skb+0x1e/0x40
[ 2489.732815] [<c0499c8e>] ? __netdev_alloc_skb+0x1e/0x40
[ 2489.732828] [<c049995d>] __alloc_skb+0x4d/0x130
[ 2489.732840] [<c0499c8e>] __netdev_alloc_skb+0x1e/0x40
[ 2489.732881] [<c8723a53>] rtl8139_rx+0xd3/0x280 [8139too]
[ 2489.732901] [<c8723c45>] rtl8139_poll+0x45/0xc0 [8139too]
[ 2489.732922] [<c0168f0d>] ? clocksource_get_next+0x3d/0x50
[ 2489.732941] [<c04a4a75>] net_rx_action+0xe5/0x1c0
[ 2489.732958] [<c87233e6>] ? rtl8139_interrupt+0x96/0x180 [8139too]
[ 2489.732978] [<c014cc40>] __do_softirq+0x90/0x1a0
[ 2489.732998] [<c0128098>] ? default_spin_lock_flags+0x8/0x10
[ 2489.733013] [<c0576bda>] ? _spin_lock_irqsave+0x2a/0x40
[ 2489.733032] [<c0106b92>] ? enable_8259A_irq+0x42/0x60
[ 2489.733044] [<c014cd8d>] do_softirq+0x3d/0x40
[ 2489.733056] [<c014cecd>] irq_exit+0x5d/0x70
[ 2489.733067] [<c0104f10>] do_IRQ+0x50/0xc0
[ 2489.733078] [<c014cec0>] ? irq_exit+0x50/0x70
[ 2489.733099] [<c011ddc7>] ? smp_apic_timer_interrupt+0x57/0x90
[ 2489.733111] [<c01039b0>] common_interrupt+0x30/0x40
[ 2489.733124] [<c0127365>] ? native_safe_halt+0x5/0x10
[ 2489.733136] [<c010a5b6>] default_idle+0x46/0xd0
[ 2489.733148] [<c010202c>] cpu_idle+0x8c/0xd0
[ 2489.733170] [<c0564815>] rest_init+0x55/0x60
[ 2489.733182] [<c07958cd>] start_kernel+0x2e6/0x2ec
[ 2489.733193] [<c0795406>] ? unknown_bootoption+0x0/0x1ab
[ 2489.733212] [<c079507c>] i386_start_kernel+0x7c/0x83
[ 2489.733220] Mem-Info:
[ 2489.733226] DMA per-cpu:
[ 2489.733234] CPU 0: hi: 0, btch: 1 usd: 0
[ 2489.733241] Normal per-cpu:
[ 2489.733248] CPU 0: hi: 42, btch: 7 usd: 41
[ 2489.733262] Active_anon:370 active_file:13055 inactive_anon:468
[ 2489.733267] inactive_file:12624 unevictable:0 dirty:0 writeback:0 unstable:0
[ 2489.733272] free:236 slab:1936 mapped:4460 pagetables:398 bounce:0
[ 2489.733288] DMA free:496kB min:176kB low:220kB high:264kB active_anon:124kB inactive_anon:184kB active_file:3592kB inactive_file:3304kB unevictable:0kB present:15804kB pages_scanned:32 all_unreclaimable? no
[ 2489.733301] lowmem_reserve[]: 0 109 109 109
[ 2489.733321] Normal free:448kB min:1252kB low:1564kB high:1876kB active_anon:1356kB inactive_anon:1688kB active_file:48628kB inactive_file:47192kB unevictable:0kB present:112584kB pages_scanned:806 all_unreclaimable? no
[ 2489.733334] lowmem_reserve[]: 0 0 0 0
[ 2489.733347] DMA: 124*4kB 0*8kB 0*16kB 0*3...

Read more...

Revision history for this message
Mark Bergsma (mark-wikimedia) wrote :
Download full text (6.3 KiB)

We are seeing this bug often with Ubuntu 8.04 LTS on most of our servers, with a variety of network cards/drivers (bnx2, tg3, etc).

Here is an example trace:

Dec 28 00:43:42 brewster kernel: [5113787.548225] swapper: page allocation failure. order:1, mode:0x4020
Dec 28 00:43:42 brewster kernel: [5113787.548231] Pid: 0, comm: swapper Not tainted 2.6.24-25-server #1
Dec 28 00:43:42 brewster kernel: [5113787.548233]
Dec 28 00:43:42 brewster kernel: [5113787.548233] Call Trace:
Dec 28 00:43:42 brewster kernel: [5113787.548235] <IRQ> [<ffffffff8028c54d>] __alloc_pages+0x2fd/0x3d0
Dec 28 00:43:42 brewster kernel: [5113787.548261] [<ffffffff802aeea0>] new_slab+0x220/0x260
Dec 28 00:43:42 brewster kernel: [5113787.548266] [<ffffffff802af1d5>] __slab_alloc+0x2f5/0x410
Dec 28 00:43:42 brewster kernel: [5113787.548269] [<ffffffff803eb4fb>] __netdev_alloc_skb+0x2b/0x50
Dec 28 00:43:42 brewster kernel: [5113787.548273] [<ffffffff803eb4fb>] __netdev_alloc_skb+0x2b/0x50
Dec 28 00:43:42 brewster kernel: [5113787.548276] [<ffffffff802b0461>] __kmalloc_node_track_caller+0x121/0x130
Dec 28 00:43:42 brewster kernel: [5113787.548281] [<ffffffff803ea75b>] __alloc_skb+0x7b/0x160
Dec 28 00:43:42 brewster kernel: [5113787.548285] [<ffffffff803eb4fb>] __netdev_alloc_skb+0x2b/0x50
Dec 28 00:43:42 brewster kernel: [5113787.548293] [<ffffffff880f1a46>] :tg3:tg3_alloc_rx_skb+0xb6/0x190
Dec 28 00:43:42 brewster kernel: [5113787.548300] [<ffffffff880fae79>] :tg3:tg3_poll+0x8a9/0xa60
Dec 28 00:43:42 brewster kernel: [5113787.548314] [<ffffffff803f27e8>] net_rx_action+0x128/0x230
Dec 28 00:43:42 brewster kernel: [5113787.548319] [<ffffffff802440a5>] __do_softirq+0x75/0xe0
Dec 28 00:43:42 brewster kernel: [5113787.548323] [<ffffffff8020d52c>] call_softirq+0x1c/0x30
Dec 28 00:43:42 brewster kernel: [5113787.548325] [<ffffffff8020ed45>] do_softirq+0x35/0x90
Dec 28 00:43:42 brewster kernel: [5113787.548327] [<ffffffff80244028>] irq_exit+0x88/0x90
Dec 28 00:43:42 brewster kernel: [5113787.548329] [<ffffffff8020ef90>] do_IRQ+0x80/0x100
Dec 28 00:43:42 brewster kernel: [5113787.548331] [<ffffffff8020b390>] default_idle+0x0/0x40
Dec 28 00:43:42 brewster kernel: [5113787.548333] [<ffffffff8020c8b1>] ret_from_intr+0x0/0xa
Dec 28 00:43:42 brewster kernel: [5113787.548334] <EOI> [<ffffffff803dcbd0>] menu_reflect+0x0/0xb0
Dec 28 00:43:42 brewster kernel: [5113787.548343] [<ffffffff8803c234>] :processor:acpi_safe_halt+0x28/0x39
Dec 28 00:43:42 brewster kernel: [5113787.548348] [<ffffffff8803c2ac>] :processor:acpi_idle_enter_c1+0x67/0x70
Dec 28 00:43:42 brewster kernel: [5113787.548349] [<ffffffff8020b390>] default_idle+0x0/0x40
Dec 28 00:43:42 brewster kernel: [5113787.548352] [<ffffffff803dbf62>] cpuidle_idle_call+0xa2/0xe0
Dec 28 00:43:42 brewster kernel: [5113787.548354] [<ffffffff8020b390>] default_idle+0x0/0x40
Dec 28 00:43:42 brewster kernel: [5113787.548355] [<ffffffff803dbec0>] cpuidle_idle_call+0x0/0xe0
Dec 28 00:43:42 brewster kernel: [5113787.548357] [<ffffffff8020b418>] cpu_idle+0x48/0xe0
Dec 28 00:43:42 brewster kernel: [5113787.548361] [<ffffffff80632885>] start_kernel+0x2c5/0x350
Dec 28 00:43:42 brewster kernel: [5113787.548364] [<ffffffff806...

Read more...

Changed in linux (Ubuntu):
status: Incomplete → New
description: updated
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

LGB [Gábor Lénárt],
       Please verify if this issue exists in the latest build of Lucid so that we can triage and work the issue appropriately. I look forward to the results of your testing.

Thanks!

~JFo

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
LGB [Gábor Lénárt] (lgb) wrote :

I haven't met with the problem yet, it seems ... At least I couldn't find anything similar in kernel logs ...

Revision history for this message
Oliver Leitner (shadow333) wrote :
Download full text (6.1 KiB)

Problem still exists:

May 1 09:32:11 localhost kernel: [36788.743138] swapper: page allocation failure. order:3, mode:0x4020
May 1 09:32:11 localhost kernel: [36788.743143] Pid: 0, comm: swapper Tainted: P 2.6.32-21-generic #32-Ubuntu
May 1 09:32:11 localhost kernel: [36788.743146] Call Trace:
May 1 09:32:11 localhost kernel: [36788.743148] <IRQ> [<ffffffff810f9d7e>] __alloc_pages_slowpath+0x56e/0x580
May 1 09:32:11 localhost kernel: [36788.743162] [<ffffffff810f9eee>] __alloc_pages_nodemask+0x15e/0x1a0
May 1 09:32:11 localhost kernel: [36788.743168] [<ffffffff81131c12>] kmalloc_large_node+0x62/0xb0
May 1 09:32:11 localhost kernel: [36788.743172] [<ffffffff81136265>] __kmalloc_node_track_caller+0x125/0x180
May 1 09:32:11 localhost kernel: [36788.743178] [<ffffffff81453a46>] ? skb_copy+0x36/0xa0
May 1 09:32:11 localhost kernel: [36788.743182] [<ffffffff81452b90>] __alloc_skb+0x80/0x190
May 1 09:32:11 localhost kernel: [36788.743186] [<ffffffff81453a46>] skb_copy+0x36/0xa0
May 1 09:32:11 localhost kernel: [36788.743193] [<ffffffffa0ba95e8>] vboxNetFltLinuxPacketHandler+0x58/0xd0 [vboxnetflt]
May 1 09:32:11 localhost kernel: [36788.743199] [<ffffffff8145d3d9>] dev_queue_xmit_nit+0x129/0x190
May 1 09:32:11 localhost kernel: [36788.743217] [<ffffffff8145d74f>] dev_hard_start_xmit+0x4f/0x1e0
May 1 09:32:11 localhost kernel: [36788.743223] [<ffffffff81475eef>] sch_direct_xmit+0x15f/0x1d0
May 1 09:32:11 localhost kernel: [36788.743227] [<ffffffff81475fcb>] __qdisc_run+0x6b/0xe0
May 1 09:32:11 localhost kernel: [36788.743231] [<ffffffff8145aec0>] net_tx_action+0x120/0x180
May 1 09:32:11 localhost kernel: [36788.743236] [<ffffffff8106e3a7>] __do_softirq+0xb7/0x1e0
May 1 09:32:11 localhost kernel: [36788.743242] [<ffffffff81031b52>] ? ack_apic_level+0x82/0x1f0
May 1 09:32:11 localhost kernel: [36788.743248] [<ffffffff810142ec>] call_softirq+0x1c/0x30
May 1 09:32:11 localhost kernel: [36788.743252] [<ffffffff81015cb5>] do_softirq+0x65/0xa0
May 1 09:32:11 localhost kernel: [36788.743255] [<ffffffff8106e245>] irq_exit+0x85/0x90
May 1 09:32:11 localhost kernel: [36788.743260] [<ffffffff81545e75>] do_IRQ+0x75/0xf0
May 1 09:32:11 localhost kernel: [36788.743264] [<ffffffff81013b13>] ret_from_intr+0x0/0x11
May 1 09:32:12 localhost kernel: [36788.743266] <EOI> [<ffffffff8130d44e>] ? acpi_idle_enter_simple+0x117/0x14b
May 1 09:32:12 localhost kernel: [36788.743276] [<ffffffff8130d447>] ? acpi_idle_enter_simple+0x110/0x14b
May 1 09:32:12 localhost kernel: [36788.743280] [<ffffffff8130d151>] ? acpi_idle_enter_bm+0xd8/0x2be
May 1 09:32:12 localhost kernel: [36788.743285] [<ffffffff81543aa6>] ? notifier_call_chain+0x16/0x80
May 1 09:32:12 localhost kernel: [36788.743291] [<ffffffff814374d7>] ? cpuidle_idle_call+0xa7/0x140
May 1 09:32:12 localhost kernel: [36788.743297] [<ffffffff81011e73>] ? cpu_idle+0xb3/0x110
May 1 09:32:12 localhost kernel: [36788.743301] [<ffffffff8153ad1b>] ? start_secondary+0xa8/0xaa
May 1 09:32:12 localhost kernel: [36788.743304] Mem-Info:
May 1 09:32:12 localhost kernel: [36788.743307] Node 0 DMA per-cpu:
May 1 09:32:12 localhost kernel: [36788.743311] CPU 0: hi: ...

Read more...

Revision history for this message
madorc (orc-homesrv) wrote :

02:05.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)
Network goes down (all another tasks works fine)...
Part of syslog in attachment...

Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

This bug report was marked as Incomplete and has not had any updated comments for quite some time. As a result this bug is being closed. Please reopen if this is still an issue in the current Ubuntu release http://www.ubuntu.com/getubuntu/download . Also, please be sure to provide any requested information that may have been missing. To reopen the bug, click on the current status under the Status column and change the status back to "New". Thanks.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: kj-expired
Changed in linux (Ubuntu):
status: Incomplete → Expired
Changed in linux (Ubuntu):
status: Expired → New
Revision history for this message
Alex Tomlins (alex-tomlins) wrote :

I've just hit this bug on Lucid

# uname -a
Linux arwen 2.6.32-29-generic #58-Ubuntu SMP Fri Feb 11 20:52:10 UTC 2011 x86_64 GNU/Linux

kern.log attached.

This occurred while rsyncing a large amount of data over rsh.

Revision history for this message
DaMoGan (dmgdsm) wrote :

I have likewise hit this on Lucid with latest patches:

Linux matrix 2.6.32-30-server #59-Ubuntu SMP Tue Mar 1 22:46:09 UTC 2011 x86_64 GNU/Linux

Hardware is an IBM eServer Xseries 346 Model 8840U with two attached Infortrend Eonstor 24 bay RAID arrays, each hanging off an LSI22320-R. Attached is kern.log, lsmod and lspci -vvv.

I'm in a similar situation:

Revision history for this message
DaMoGan (dmgdsm) wrote :

Oops, posted before I was finished:

I'm in a similar situation -- I'm rsyncing a bunch of stuff to the server I'm seeing the errors on.

Thanks.

Revision history for this message
DaMoGan (dmgdsm) wrote :

Ok, this is weird -- I just now saw this same bug on an updated lucid system consisting of entirely different hardware -- looks like perhaps a bug was introduced in the last kernel update?

Linux nmrfs 2.6.32-30-server #59-Ubuntu SMP Tue Mar 1 22:46:09 UTC 2011 x86_64 GNU/Linux

Hardware is a whitebox 1U server with attached 24 bay Area DNS-1400 JBOD SAS-SATA, hung off an LSI MegaRAID SAS 9280-8e.

Revision history for this message
Tim Gardner (timg-tpi) wrote :

DaMoGan - please start a new bug using 'ubuntu-bug linux'

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Péter Molnár (empece) wrote :

I have the same problem with 2.6.32-33-generic x86_64.

Revision history for this message
Josh (joshhansen) wrote :
Download full text (10.6 KiB)

I'm getting something similar, but with the rtl8192 driver. I was connected to a wireless network but otherwise the computer was idle when this happened. The system dropped down to the text console and was completely unresponsive.

==Other information==
Thinkpad Edge E420
Ubuntu 11.04 x86_64
Home filesystem is encrypted and mounted on .Private

# uname -a
Linux ibuse 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

relevant kern.log output -> see attachment

# lspci -vvv
00:00.0 Host bridge: Intel Corporation 2nd Generation Core Processor Family DRAM Controller (rev 09)
 Subsystem: Lenovo Device 21e2
 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx-
 Latency: 0
 Capabilities: <access denied>
 Kernel driver in use: agpgart-intel

00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller])
 Subsystem: Lenovo Device 21e2
 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
 Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Latency: 0
 Interrupt: pin A routed to IRQ 42
 Region 0: Memory at d0000000 (64-bit, non-prefetchable) [size=4M]
 Region 2: Memory at c0000000 (64-bit, prefetchable) [size=256M]
 Region 4: I/O ports at 5000 [size=64]
 Expansion ROM at <unassigned> [disabled]
 Capabilities: <access denied>
 Kernel driver in use: i915
 Kernel modules: i915

00:16.0 Communication controller: Intel Corporation 6 Series Chipset Family MEI Controller #1 (rev 04)
 Subsystem: Lenovo Device 21e2
 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Latency: 0
 Interrupt: pin A routed to IRQ 11
 Region 0: Memory at d1605000 (64-bit, non-prefetchable) [size=16]
 Capabilities: <access denied>

00:1a.0 USB Controller: Intel Corporation 6 Series Chipset Family USB Enhanced Host Controller #2 (rev 04) (prog-if 20 [EHCI])
 Subsystem: Lenovo Device 21e2
 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Latency: 0
 Interrupt: pin A routed to IRQ 16
 Region 0: Memory at d160a000 (32-bit, non-prefetchable) [size=1K]
 Capabilities: <access denied>
 Kernel driver in use: ehci_hcd

00:1b.0 Audio device: Intel Corporation 6 Series Chipset Family High Definition Audio Controller (rev 04)
 Subsystem: Lenovo Device 21e2
 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
 Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Latency: 0, Cache Line Size: 64 bytes
 Interrupt: pin A routed to IRQ 43
 Region 0: Memory at d1600000 (64-bit, non-prefetchable) [size=16K]
 Capabiliti...

Revision history for this message
jan (jan-ubuntu-h-i-s) wrote :

I see the same behavior on an USB network stick.
Bus 002 Device 006: ID 050d:945a Belkin Components

I have added the error log from my lucid system
Linux 2.6.32-36-generic-pae #79-Ubuntu SMP Tue Nov 8 23:25:26 UTC 2011 i686 GNU/Linux

It seems that memory management is broken, if this is related to the network driver is unclear to me.
The messages seem dangerous to me, as also other processes than the network driver are affected.
In all messages in my log (tens of them) a similar calling tree inside the network driver is shown.

Revision history for this message
jan (jan-ubuntu-h-i-s) wrote :

Related to newer bug:
https://bugs.launchpad.net/ubuntu/+source/network-manager/+bug/605045

The links in that bug state that it is a kernel bug, and solution is known. When may it come to Lucid ?

Revision history for this message
penalvch (penalvch) wrote :

LGB [Gábor Lénárt], this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command in the development release from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux <replace-with-bug-number>

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please do not test the kernel in the daily folder, but the one all the way at the bottom. Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. As well, please comment on which kernel version specifically you tested.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream', and comment as to why specifically you were unable to test it.

Please let us know your results. Thanks in advance.

tags: added: needs-upstream-testing
removed: rtl8139
no longer affects: linux-source-2.6.22 (Ubuntu)
Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.