Bug #720095 “vsftpd causes a vmalloc space leak in Lucid” : Bugs : linux package : Ubuntu

Revision history for this message

Peter Matulis (petermatulis) wrote on 2011-02-16:

#1

BootDmesg.txt Edit (23.1 KiB, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (1.9 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (1.9 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (620 bytes, text/plain; charset="utf-8")
UdevDb.txt Edit (57.7 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (125.4 KiB, text/plain; charset="utf-8")

description:

updated

Revision history for this message

Peter Matulis (petermatulis) wrote on 2011-02-16:

#2

feedftp Edit (531 bytes, text/plain)

Revision history for this message

Peter Matulis (petermatulis) wrote on 2011-02-16:

#3

dmesg-oom.32.txt Edit (245.6 KiB, text/plain)

Revision history for this message

Peter Matulis (petermatulis) wrote on 2011-02-16:

#4

vmallocinfo.32.tar Edit (410.0 KiB, application/x-tar)

description:	updated
description:	updated

Peter Matulis (petermatulis) on 2011-02-16

description:

updated

Jeremy Foshee (jeremyfoshee) on 2011-02-16

tags:

added: kernel-key

Peter Matulis (petermatulis) on 2011-02-16

description:

updated

Revision history for this message

Andy Whitcroft (apw) wrote on 2011-02-16:

#5

From the report it seems that this was broken in v2.6.32 and maybe fixed in v2.6.35. The commit below sounds plausable and might be worth looking at:

  commit 02b709df817c0db174f249cc59e5f7fd01b64d92
  Author: Nick Piggin <email address hidden>
  Date: Mon Feb 1 22:25:57 2010 +1100

mm: purge fragmented percpu vmap blocks

    Improve handling of fragmented per-CPU vmaps. We previously don't free
    up per-CPU maps until all its addresses have been used and freed. So
    fragmented blocks could fill up vmalloc space even if they actually had
    no active vmap regions within them.

    Add some logic to allow all CPUs to have these blocks purged in the case
    of failure to allocate a new vm area, and also put some logic to trim
    such blocks of a current CPU if we hit them in the allocation path (so
    as to avoid a large build up of them).

    Christoph reported some vmap allocation failures when using the per CPU
    vmap APIs in XFS, which cannot be reproduced after this patch and the
    previous bug fix.

    Cc: <email address hidden>
    Cc: <email address hidden>
    Tested-by: Christoph Hellwig <email address hidden>
    Signed-off-by: Nick Piggin <email address hidden>
    --
    Signed-off-by: Linus Torvalds <email address hidden>

summary:

- vsftpd causes memory leak in Lucid
+ vsftpd causes a vmalloc space leak in Lucid

Revision history for this message

Stefan Bader (smb) wrote on 2011-02-16:

#6

Seems we got this already in Lucid.

Revision history for this message

Stefan Bader (smb) wrote on 2011-02-16:

#7

It may be interesting if you had a spare disk and could do a bare metal installation of Lucid to repeat that test there.

Stefan Bader (smb) on 2011-02-16

Changed in linux (Ubuntu):
assignee:	nobody → Stefan Bader (stefan-bader-canonical)
importance:	Undecided → Medium
status:	New → Confirmed

Revision history for this message

Walter Richards (walter-richards-ec) wrote on 2011-02-16:

#8

Hi, we have this issue on bare metal installation as you say. On a Dell M605 (AMD processor) and on an IBM x3650 (Intel).

As well, it might help you to know that we tested it on SUSE with kernel 2.6.32-24, and the bug is not there.

Revision history for this message

Jef Goupil (jef-goupil-ec) wrote on 2011-02-16:

#9

Hi, I am working with Walter Richards. During our testing, we realized that it takes several hours to retrieve the memory after we stopped the ftp connections.
On our 32GB Dell server, it took 5 hours to fill all the memory (until OOM kills at 2.1GB of free memory). Then, it took 8 hours before it started to retrieve the free memory, and then it took another 7 hours to completely get the 30GB back...

Peter Matulis (petermatulis) on 2011-02-17

description:

updated

Revision history for this message

Stefan Bader (smb) wrote on 2011-02-17:

#10

Ok, so this is not really something conclusive yet, but it seems to me (when playing with that locally) that for that memory allocations to grow, there is no actual file transfer needed. Just looping through doing a connect and immediately disconnect again showed those 2M chunks growing. So I guess I can concentrate on that area further on.

Revision history for this message

Stefan Bader (smb) wrote on 2011-02-24:

#11

So this is what I found out so far: whenever a client connects to vsftpd, it forks of a process to handle the connection. This is done in a way that also duplicates the network namespace (beside of the process namespace). This can actually be observed by the fact that every time that happens there is a message about "lo: Disabled Privacy Extensions" (which is slightly stupid to note as that is the default for lo). Anyway, so cloning the network namespace also sets up the snmp mib structures and those are allocated by pcpu.

The main problem seems to be that cleaning up those structures is done in Lucid for each interface on its own by putting it onto a work queue. This seems to be rather slow, so while the test case is running its seems the system is too busy with creating new namespaces than it is able to clean them up. Even after stopping the test this takes a while and because the way that vmalloc pcpu areas are handled can potentially stick even longer (the areas are not exclusively used by network namespace, something else may use parts of the area and the area cannot be cleaned up until the last user is gone).

Between 2.6.32 and 2.6.35, there was a series of changes that allowed to batch the cleanup of network namespace. The comment on one of those indicated that 4000 namespaces would have taken more than 7 minutes before and could be reduced to 44 seconds (at the price of an increased cpu load). I was able to backport all required changes and this seems to avoid the build up of the vmalloc area (I am not sure about that but it felt like the speed of connects and disconnects was lower). I am still reluctant to go that road because the required changes were somewhat big and the more gets changed, the higher chance to pick up some regression. Also the question is whether the test case models a realistic usage. To clarify, this is not a real leakage. It is a combination of specially allocated memory and slowness / complicated policy to free that allocations.

So this is what I found out so far: whenever a client connects to vsftpd, it forks of a process to handle the connection. This is done in a way that also duplicates the network namespace (beside of the process namespace). This can actually be observed by the fact that every time that happens there is a message about "lo: Disabled Privacy Extensions" (which is slightly stupid to note as that is the default for lo). Anyway, so cloning the network namespace also sets up the snmp mib structures and those are allocated by pcpu.

The main problem seems to be that cleaning up those structures is done in Lucid for each interface on its own by putting it onto a work queue. This seems to be rather slow, so while the test case is running its seems the system is too busy with creating new namespaces than it is able to clean them up. Even after stopping the test this takes a while and because the way that vmalloc pcpu areas are handled can potentially stick even longer (the areas are not exclusively used by network namespace, something else may use parts of the area and the area cannot be cleaned up until the last user is gone).

Between 2.6.32 and 2.6.35, there was a series of changes that allowed to batch the cleanup of network namespace. The comment on one of those indicated that 4000 namespaces would have taken more than 7 minutes before and could be reduced to 44 seconds (at the price of an increased cpu load). I was able to backport all required changes and this seems to avoid the build up of the vmalloc area (I am not sure about that but it felt like the speed of connects and disconnects was lower). I am still reluctant to go that road because the required changes were somewhat big and the more gets changed, the higher chance to pick up some regression. Also the question is whether the test case models a realistic usage. To clarify, this is not a real leakage. It is a combination of specially allocated memory and slowness / complicated policy to free that allocations.

Revision history for this message

Jef Goupil (jef-goupil-ec) wrote on 2011-02-24:

#12

Hi,
our production servers have more than 250000 vsftpd connections for day:
# zgrep CONNECT vsftpd.log.3.gz|wc -l
260210
which is about 3 connections per second. Each connection may have up to 200 file transferts. The testcase produces about 15 connections per seconds. It is 5 times more than our reality, but we can see the problem with only 3 connections per seconds.

Btw, for the "Disabled Privacy Extensions" messages, this can be avoided by disabling the ipv6 (ipv6.disable=1 on the grub command line).
Thanks

Revision history for this message

Stefan Bader (smb) wrote on 2011-02-25:

#13

The privacy extension message was more of a side note. It has been removed in current code (probably because of the limited use).

So the test case is making sense. And I was also beginning to think whether this could be seen as a security issue. As someone could bring down the server by doing many connects. It probably takes a while. Still...

Unfortunately I am not really happy with backports I mentioned before. They really seem to slow down the number of connects. So it could only be due to that, that the vmalloc space is not filled up quicker that it gets released. I think I have to do a few more experiments. Not sure it can be done but I would rather want to avoid making too large changes.

Revision history for this message

Stefan Bader (smb) wrote on 2011-03-17:

#14

Unfortunately this got a bit hit by other issues I was looking at. So I did not really see any small change to improve things. I guess I need to reach out for help from upstream. Meanwhile there is this series of patches I backported from 2.6.35 that allow network namespaces to be cleaned up in batches. From my feeling this seems to slow down the rate I see the test tasks connecting, but this seems to be the same on 2.6.35. Maybe that is something one can live with.
In order to give other people a chance to look at that I have prepared kernel packages and put them to:
http://people.canonical.com/~smb/lucid-netnsbp
Meanwhile I try to get some help...

Revision history for this message

Jef Goupil (jef-goupil-ec) wrote on 2011-03-21:

#15

Hi,
I installed the following files from your site and rebooted:

/root/linux-headers-2.6.32-31-server_2.6.32-31.60+netnsbp1_amd64.deb
/root/linux-headers-2.6.32-31_2.6.32-31.60+netnsbp1_all.deb
/root/linux-image-2.6.32-31-server_2.6.32-31.60+netnsbp1_amd64.deb
/root/linux-libc-dev_2.6.32-31.60+netnsbp1_amd64.deb
/root/linux-tools-2.6.32-31_2.6.32-31.60+netnsbp1_amd64.deb
/root/linux-tools-common_2.6.32-31.60+netnsbp1_all.deb

Then, my ftp tests worked properly without using any free memory... The file /proc/vmallocinfo stayed at around 5 lines (2 new lines only) (grep pcpu /proc/vmallocinfo). Before, it was constantly growing.

Seems it is fixed in this kernel level.

Thanks a lot!

Revision history for this message

Jef Goupil (jef-goupil-ec) wrote on 2011-03-21:

#16

Hi,
Sorry, I forgot to add a comment about the process netns: it was using a lot of CPU during my tests...:

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
37 root 20 0 0 0 0 D 15 0.0 0:51.89 netns
7218 root 20 0 33464 1328 888 D 1 0.0 0:00.03 vsftpd
7220 root 20 0 33464 1328 888 D 1 0.0 0:00.03 vsftpd
7221 root 20 0 33464 1328 888 D 1 0.0 0:00.03 vsftpd

I think this is what you were affraid of... slowing down. I am not sure what would be the effect on my production servers.

Thanks!

Revision history for this message

Stefan Bader (smb) wrote on 2011-03-22:

#17

Yes, basically cleanup is rather done in batches, which takes more cpu but could also affect lock contention. That and the fact that it requires backporting several patches which may cause effects we don't know of, causes me to be a bit reluctant about the changes.

Revision history for this message

Stefan Bader (smb) wrote on 2011-03-30:

#18

After trying various approaches of backports which all seemed not really satisfying, it was decided that the safest way to go is to just turn off support for network namespaces. While this can have some impact on use-cases which try to containerize network, the feature was too immature to be turned on the first place. To use network namespaces in Lucid people should use the lts backport kernel.

Changed in linux (Ubuntu Lucid):
assignee:	nobody → Stefan Bader (stefan-bader-canonical)
importance:	Undecided → Medium
status:	New → Fix Committed

Revision history for this message

Stefan Bader (smb) wrote on 2011-03-30:

#19

The problem only occurs on Lucid with network namespaces turned on. So not valid for Maverick and later.

Changed in linux (Ubuntu):
assignee:	Stefan Bader (stefan-bader-canonical) → nobody
status:	Confirmed → Invalid

Stefan Bader (smb) on 2011-03-30

description:

updated

Revision history for this message

Rachel Greenham (rachel-strangenoises) wrote on 2011-04-16:

#20

I think I've been experiencing this bug on a production vmware guest server running Lucid with vsftp being connected to frequently by client machines.

The thing is, this bug shows as being "fix committed" - and the implication I get from comment #18 is that the current production (ie: not backport) kernel has netns disabled. I'm all up to date, with kernel 2.6.32-30-server, and still seeing elevated netns cpu usage, and a general slowdown of other activity which I believe is related.

Is there something we need to do specifically to ensure netns is disabled?

Revision history for this message

Clint Byrum (clint-fewbar) wrote on 2011-04-16: Re: [Bug 720095] Re: vsftpd causes a vmalloc space leak in Lucid

#21

Excerpts from Rachel Greenham's message of Sat Apr 16 11:25:10 UTC 2011:
> I think I've been experiencing this bug on a production vmware guest
> server running Lucid with vsftp being connected to frequently by client
> machines.
>
> The thing is, this bug shows as being "fix committed" - and the
> implication I get from comment #18 is that the current production (ie:
> not backport) kernel has netns disabled. I'm all up to date, with
> kernel 2.6.32-30-server, and still seeing elevated netns cpu usage, and
> a general slowdown of other activity which I believe is related.
>
> Is there something we need to do specifically to ensure netns is
> disabled?

Rachel, Fix Committed means that the developers have it in their tree
but it hasn't been released yet. Presumably this means that netns will
be disabled in the next lucid kernel update.

Revision history for this message

Rachel Greenham (rachel-strangenoises) wrote on 2011-04-18:

#22

Quite right, I should have noticed that. :-) I may just need to be patient then, although this being a production machine experiencing real problems since go-live with users connecting in volume may preclude patience. I had been considering embedding a java FTP server into our application instead, although now I've thought of that other advantages of doing so come to mind. :-) Late reply because commenting on this bug didn't subscribe me to it like I expected. Will subscribe now.

Revision history for this message

Martin Pitt (pitti) wrote on 2011-04-25: Please test proposed package

#23

Accepted linux into lucid-proposed, the package will build now and be available in a few hours. Please test and give feedback here. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you in advance!

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-04-25:

#24

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed' to 'verification-done'.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-lucid

Revision history for this message

Rachel Greenham (rachel-strangenoises) wrote on 2011-04-26:

#25

Applied successfully to two test instances; can confirm absence of netns process, and nothing seems to be broken. :-) My problem is only exhibiting on a production server under load though, and while I can see elevated netns cpu usage most of the time it only becomes a problem intermittently, so it may take a little longer to install this there and see if it really helps.

Clint Byrum (clint-fewbar) on 2011-04-26

tags:

added: verification-done
removed: verification-needed-lucid

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-05-30:

#26

Download full text (12.8 KiB)

This bug was fixed in the package linux - 2.6.32-32.62

---------------
linux (2.6.32-32.62) lucid-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #767370

[ Stefan Bader ]

* (config) Disable CONFIG_NET_NS
- LP: #720095

[ Upstream Kernel Changes ]

  * Revert "drm/radeon/kms: Fix retrying ttm_bo_init() after it failed
    once."
    - LP: #736234
  * Revert "drm/radeon: fall back to GTT if bo creation/validation in VRAM
    fails."
    - LP: #736234
  * x86: pvclock: Move scale_delta into common header
  * KVM: x86: Fix a possible backwards warp of kvmclock
  * KVM: x86: Fix kvmclock bug
  * cpuset: add a missing unlock in cpuset_write_resmask()
    - LP: #736234
  * keyboard: integer underflow bug
    - LP: #736234
  * RxRPC: Fix v1 keys
    - LP: #736234
  * ixgbe: fix for 82599 erratum on Header Splitting
    - LP: #736234
  * mm: fix possible cause of a page_mapped BUG
    - LP: #736234
  * powerpc/kdump: CPUs assume the context of the oopsing CPU
    - LP: #736234
  * powerpc/kdump: Use chip->shutdown to disable IRQs
    - LP: #736234
  * powerpc: Use more accurate limit for first segment memory allocations
    - LP: #736234
  * powerpc/pseries: Add hcall to read 4 ptes at a time in real mode
    - LP: #736234
  * powerpc/kexec: Speedup kexec hash PTE tear down
    - LP: #736234
  * powerpc/crashdump: Do not fail on NULL pointer dereferencing
    - LP: #736234
  * powerpc/kexec: Fix orphaned offline CPUs across kexec
    - LP: #736234
  * netfilter: nf_log: avoid oops in (un)bind with invalid nfproto values
    - LP: #736234
  * nfsd: wrong index used in inner loop
    - LP: #736234
  * r8169: use RxFIFO overflow workaround for 8168c chipset.
    - LP: #736234
  * Staging: comedi: jr3_pci: Don't ioremap too much space. Check result.
    - LP: #736234
  * net: don't allow CAP_NET_ADMIN to load non-netdev kernel modules,
    CVE-2011-1019
    - LP: #736234
    - CVE-2011-1019
  * ip6ip6: autoload ip6 tunnel
    - LP: #736234
  * Linux 2.6.32.33
    - LP: #736234
  * drm/radeon: fall back to GTT if bo creation/validation in VRAM fails.
    - LP: #652934, #736234
  * drm/radeon/kms: Fix retrying ttm_bo_init() after it failed once.
    - LP: #652934, #736234
  * drm: fix unsigned vs signed comparison issue in modeset ctl ioctl,
    CVE-2011-1013
    - LP: #736234
    - CVE-2011-1013
  * Linux 2.6.32.33+drm33.15
    - LP: #736234
  * econet: Fix crash in aun_incoming(). CVE-2010-4342
    - LP: #736394
    - CVE-2010-4342
  * igb: only use vlan_gro_receive if vlans are registered, CVE-2010-4263
    - LP: #737024
    - CVE-2010-4263
  * irda: prevent integer underflow in IRLMP_ENUMDEVICES, CVE-2010-4529
    - LP: #737823
    - CVE-2010-4529
  * hwmon/f71882fg: Set platform drvdata to NULL later
    - LP: #742056
  * mtd: add "platform:" prefix for platform modalias
    - LP: #742056
  * libata: no special completion processing for EH commands
    - LP: #742056
  * MIPS: MTX-1: Make au1000_eth probe all PHY addresses
    - LP: #742056
  * x86/mm: Handle mm_fault_error() in kernel space
    - LP: #742056
  * ftrace: Fix memory leak with function graph and cpu hotplug
    - LP: #742056
  * x86: Fix panic when ...

This bug was fixed in the package linux - 2.6.32-32.62

---------------
linux (2.6.32-32.62) lucid-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #767370

[ Stefan Bader ]

* (config) Disable CONFIG_NET_NS
    - LP: #720095

[ Upstream Kernel Changes ]

* Revert "drm/radeon/kms: Fix retrying ttm_bo_init() after it failed
    once."
    - LP: #736234
  * Revert "drm/radeon: fall back to GTT if bo creation/validation in VRAM
    fails."
    - LP: #736234
  * x86: pvclock: Move scale_delta into common header
  * KVM: x86: Fix a possible backwards warp of kvmclock
  * KVM: x86: Fix kvmclock bug
  * cpuset: add a missing unlock in cpuset_write_resmask()
    - LP: #736234
  * keyboard: integer underflow bug
    - LP: #736234
  * RxRPC: Fix v1 keys
    - LP: #736234
  * ixgbe: fix for 82599 erratum on Header Splitting
    - LP: #736234
  * mm: fix possible cause of a page_mapped BUG
    - LP: #736234
  * powerpc/kdump: CPUs assume the context of the oopsing CPU
    - LP: #736234
  * powerpc/kdump: Use chip->shutdown to disable IRQs
    - LP: #736234
  * powerpc: Use more accurate limit for first segment memory allocations
    - LP: #736234
  * powerpc/pseries: Add hcall to read 4 ptes at a time in real mode
    - LP: #736234
  * powerpc/kexec: Speedup kexec hash PTE tear down
    - LP: #736234
  * powerpc/crashdump: Do not fail on NULL pointer dereferencing
    - LP: #736234
  * powerpc/kexec: Fix orphaned offline CPUs across kexec
    - LP: #736234
  * netfilter: nf_log: avoid oops in (un)bind with invalid nfproto values
    - LP: #736234
  * nfsd: wrong index used in inner loop
    - LP: #736234
  * r8169: use RxFIFO overflow workaround for 8168c chipset.
    - LP: #736234
  * Staging: comedi: jr3_pci: Don't ioremap too much space. Check result.
    - LP: #736234
  * net: don't allow CAP_NET_ADMIN to load non-netdev kernel modules,
    CVE-2011-1019
    - LP: #736234
    - CVE-2011-1019
  * ip6ip6: autoload ip6 tunnel
    - LP: #736234
  * Linux 2.6.32.33
    - LP: #736234
  * drm/radeon: fall back to GTT if bo creation/validation in VRAM fails.
    - LP: #652934, #736234
  * drm/radeon/kms: Fix retrying ttm_bo_init() after it failed once.
    - LP: #652934, #736234
  * drm: fix unsigned vs signed comparison issue in modeset ctl ioctl,
    CVE-2011-1013
    - LP: #736234
    - CVE-2011-1013
  * Linux 2.6.32.33+drm33.15
    - LP: #736234
  * econet: Fix crash in aun_incoming(). CVE-2010-4342
    - LP: #736394
    - CVE-2010-4342
  * igb: only use vlan_gro_receive if vlans are registered, CVE-2010-4263
    - LP: #737024
    - CVE-2010-4263
  * irda: prevent integer underflow in IRLMP_ENUMDEVICES, CVE-2010-4529
    - LP: #737823
    - CVE-2010-4529
  * hwmon/f71882fg: Set platform drvdata to NULL later
    - LP: #742056
  * mtd: add "platform:" prefix for platform modalias
    - LP: #742056
  * libata: no special completion processing for EH commands
    - LP: #742056
  * MIPS: MTX-1: Make au1000_eth probe all PHY addresses
    - LP: #742056
  * x86/mm: Handle mm_fault_error() in kernel space
    - LP: #742056
  * ftrace: Fix memory leak with function graph and cpu hotplug
    - LP: #742056
  * x86: Fix panic when handling "mem={invalid}" param
    - LP: #553464, #742056
  * x86: Emit "mem=nopentium ignored" warning when not supported
    - LP: #553464, #742056
  * ahci: AHCI and RAID mode SATA patch for Intel Patsburg DeviceIDs
    - LP: #742056
  * ahci: AHCI mode SATA patch for Intel DH89xxCC DeviceIDs
    - LP: #742056
  * ahci: AHCI mode SATA patch for Intel Patsburg SATA RAID controller
    - LP: #742056
  * RDMA/cma: Fix crash in request handlers
    - LP: #742056
  * IB/cm: Bump reference count on cm_id before invoking callback
    - LP: #742056
  * ath9k_hw: Fix incorrect macversion and macrev checks
    - LP: #742056
  * USB: serial/kobil_sct, fix potential tty NULL dereference
    - LP: #742056
  * USB: serial: ch341: add new id
    - LP: #742056
  * xhci: Fix cycle bit calculation during stall handling.
    - LP: #742056
  * ALSA: hda - fix digital mic selection in mixer on 92HD8X codecs
    - LP: #742056
  * PCI: remove quirk for pre-production systems
    - LP: #742056
  * PCI: add more checking to ICH region quirks
    - LP: #742056
  * PCI: do not create quirk I/O regions below PCIBIOS_MIN_IO for ICH
    - LP: #742056
  * PCI: sysfs: Fix failure path for addition of "vpd" attribute
    - LP: #742056
  * ALSA: ctxfi - Fix incorrect SPDIF status bit mask
    - LP: #742056
  * ALSA: ctxfi - Fix SPDIF status retrieval
    - LP: #742056
  * ALSA: ctxfi - Clear input settings before initialization
    - LP: #742056
  * SUNRPC: Ensure we always run the tk_callback before tk_action
    - LP: #742056
  * perf, powerpc: Handle events that raise an exception without
    overflowing
    - LP: #742056
  * ext3: Always set dx_node's fake_dirent explicitly.
    - LP: #742056
  * call_function_many: fix list delete vs add race
    - LP: #742056
  * call_function_many: add missing ordering
    - LP: #742056
  * x86: Flush TLB if PGD entry is changed in i386 PAE mode
    - LP: #742056
  * isdn: avoid calling tty_ldisc_flush() in atomic context
    - LP: #742056
  * smp_call_function_many: handle concurrent clearing of mask
    - LP: #742056
  * fix per-cpu flag problem in the cpu affinity checkers
    - LP: #742056
  * i2c: Fix typo in instantiating-devices document
    - LP: #742056
  * mmc: sdio: remember new card RCA when redetecting card
    - LP: #742056
  * powerpc/kexec: Fix race in kexec shutdown
    - LP: #742056
  * powerpc/kdump: Fix race in kdump shutdown
    - LP: #742056
  * powerpc: rtas_flash needs to use rtas_data_buf
    - LP: #742056
  * x86, binutils, xen: Fix another wrong size directive
    - LP: #742056
  * hwmon: (sht15) Fix integer overflow in humidity calculation
    - LP: #742056
  * Linux 2.6.32.34
    - LP: #742056
  * Linux 2.6.32.35
    - LP: #742056
  * aio: wake all waiters when destroying ctx
    - LP: #744921
  * shmem: let shared anonymous be nonlinear again
    - LP: #744921
  * PCI hotplug: acpiphp: set current_state to D0 in register_slot
    - LP: #744921
  * xen: set max_pfn_mapped to the last pfn mapped
    - LP: #744921
  * PCI: return correct value when writing to the "reset" attribute
    - LP: #744921
  * Prevent rt_sigqueueinfo and rt_tgsigqueueinfo from spoofing the signal
    code
    - LP: #744921
  * ext3: skip orphan cleanup on rocompat fs
    - LP: #744921
  * procfs: fix /proc/<pid>/maps heap check
    - LP: #744921
  * proc: protect mm start_code/end_code in /proc/pid/stat, CVE-2011-0726
    - LP: #744921
    - CVE-2011-0726
  * fbcon: Bugfix soft cursor detection in Tile Blitting
    - LP: #744921
  * nfsd41: modify the members value of nfsd4_op_flags
    - LP: #744921
  * nfsd: wrong index used in inner loop
    - LP: #744921
  * uvcvideo: Fix uvc_fixup_video_ctrl() format search
    - LP: #744921
  * ehci-hcd: Bug fix: don't set a QH's Halt bit
    - LP: #744921
  * USB: uss720 fixup refcount position
    - LP: #744921
  * USB: cdc-acm: fix memory corruption / panic
    - LP: #744921
  * USB: cdc-acm: fix potential null-pointer dereference
    - LP: #744921
  * USB: cdc-acm: fix potential null-pointer dereference on disconnect
    - LP: #744921
  * Input: xen-kbdfront - advertise either absolute or relative coordinates
    - LP: #744921
  * SUNRPC: Never reuse the socket port after an xs_close()
    - LP: #744921
  * fs: call security_d_instantiate in d_obtain_alias V2
    - LP: #744921
  * dcdbas: force SMI to happen when expected
    - LP: #744921
  * Linux 2.6.32.36
    - LP: #744921
  * drm/radeon/kms: check AA resolve registers on r300, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * drm/radeon: fix regression with AA resolve checking, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * xen: events: do not unmask event channels on resume
    - LP: #681083
  * drm/radeon/kms: check AA resolve registers on r300
    - LP: #754584
  * drm/radeon: fix regression with AA resolve checking
    - LP: #754584
  * Linux 2.6.32.36+drm33.16
    - LP: #754584
  * ALSA: hda - Fix SPDIF out regression on ALC889
    - LP: #764685
  * ALSA: Fix yet another race in disconnection
    - LP: #764685
  * perf: Better fit max unprivileged mlock pages for tools needs
    - LP: #764685
  * myri10ge: fix rmmod crash
    - LP: #764685
  * cciss: fix lost command issue
    - LP: #764685
  * sound/oss/opl3: validate voice and channel indexes
    - LP: #764685
  * mac80211: initialize sta->last_rx in sta_info_alloc
    - LP: #764685
  * ses: show devices for enclosures with no page 7
    - LP: #764685
  * ses: Avoid kernel panic when lun 0 is not mapped
    - LP: #764685
  * eCryptfs: Unlock page in write_begin error path
    - LP: #764685
  * eCryptfs: ecryptfs_keyring_auth_tok_for_sig() bug fix
    - LP: #764685
  * staging: usbip: bugfixes related to kthread conversion
    - LP: #764685
  * staging: usbip: bugfix add number of packets for isochronous frames
    - LP: #764685
  * staging: usbip: bugfix for isochronous packets and optimization
    - LP: #764685
  * staging: hv: Fix GARP not sent after Quick Migration
    - LP: #764685
  * staging: hv: use sync_bitops when interacting with the hypervisor
    - LP: #764685
  * Relax si_code check in rt_sigqueueinfo and rt_tgsigqueueinfo
    - LP: #764685
  * xfs: prevent leaking uninitialized stack memory in FSGEOMETRY_V1
    - LP: #764685
  * irda: validate peer name and attribute lengths
    - LP: #764685
  * irda: prevent heap corruption on invalid nickname
    - LP: #764685
  * nilfs2: fix data loss in mmap page write for hole blocks
    - LP: #764685
  * ASoC: Explicitly say registerless widgets have no register
    - LP: #764685
  * ALSA: ens1371: fix Creative Ectiva support
    - LP: #764685
  * ROSE: prevent heap corruption with bad facilities
    - LP: #764685
  * Btrfs: Fix uninitialized root flags for subvolumes
    - LP: #764685
  * x86, mtrr, pat: Fix one cpu getting out of sync during resume
    - LP: #764685
  * ath9k: fix a chip wakeup related crash in ath9k_start
    - LP: #764685
  * UBIFS: do not read flash unnecessarily
    - LP: #764685
  * UBIFS: fix oops on error path in read_pnode
    - LP: #764685
  * UBIFS: fix debugging failure in dbg_check_space_info
    - LP: #764685
  * quota: Don't write quota info in dquot_commit()
    - LP: #764685
  * mm: avoid wrapping vm_pgoff in mremap()
    - LP: #764685
  * p54usb: IDs for two new devices
    - LP: #764685
  * b43: allocate receive buffers big enough for max frame len + offset
    - LP: #764685
  * Bluetooth: sco: fix information leak to userspace
    - LP: #764685
  * bridge: netfilter: fix information leak
    - LP: #764685
  * Bluetooth: bnep: fix buffer overflow
    - LP: #764685
  * Bluetooth: add support for Apple MacBook Pro 8,2
    - LP: #764685
  * Treat writes as new when holes span across page boundaries
    - LP: #764685
  * char/tpm: Fix unitialized usage of data buffer
    - LP: #764685
  * netfilter: ip_tables: fix infoleak to userspace
    - LP: #764685
  * netfilter: arp_tables: fix infoleak to userspace
    - LP: #764685
  * netfilter: ipt_CLUSTERIP: fix buffer overflow
    - LP: #764685
  * ipv6: netfilter: ip6_tables: fix infoleak to userspace
    - LP: #764685
  * mfd: ab3100: world-writable debugfs *_priv files
    - LP: #764685
  * drivers/rtc/rtc-ds1511.c: world-writable sysfs nvram file
    - LP: #764685
  * drivers/misc/ep93xx_pwm.c: world-writable sysfs files
    - LP: #764685
  * econet: 4 byte infoleak to the network
    - LP: #764685
  * sound/oss: remove offset from load_patch callbacks
    - LP: #764685
  * sound: oss: midi_synth: check get_user() return value
    - LP: #764685
  * repair gdbstub to match the gdbserial protocol specification
    - LP: #764685
  * gro: Reset dev pointer on reuse
    - LP: #764685
  * gro: reset skb_iif on reuse
    - LP: #764685
  * x86, amd-ucode: Remove needless log messages
    - LP: #764685
  * x86, microcode, AMD: Extend ucode size verification
    - LP: #764685
  * powerpc/kexec: Add ifdef CONFIG_PPC_STD_MMU_64 to PPC64 code
    - LP: #764685
  * powerpc: Fix default_machine_crash_shutdown #ifdef botch
    - LP: #764685
  * Squashfs: handle corruption of directory structure
    - LP: #764685
  * sctp: fix to calc the INIT/INIT-ACK chunk length correctly is set
    - LP: #764685
  * atm/solos-pci: Don't include frame pseudo-header on transmit hex-dump
    - LP: #764685
  * ext4: fix credits computing for indirect mapped files
    - LP: #764685
  * nfsd: fix auth_domain reference leak on nlm operations
    - LP: #764685
  * CAN: Use inode instead of kernel address for /proc file
    - LP: #764685
  * exec: make argv/envp memory visible to oom-killer
    - LP: #764685
  * exec: copy-and-paste the fixes into compat_do_execve() paths
    - LP: #764685
  * xfs: zero proper structure size for geometry calls
    - LP: #764685
  * Linux 2.6.32.37
    - LP: #764685
  * Linux 2.6.32.38
    - LP: #764685
 -- Brad Figg <brad.figg@canonical.com>   Wed, 20 Apr 2011 08:28:25 -0700

Changed in linux (Ubuntu Lucid):
status:	Fix Committed → Fix Released

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2011-05-31:

#27

lxc is now not usable on lucid.

Revision history for this message

Stefan Metzmacher (metze) wrote on 2011-06-01:

#28

Yes, lxc is broken now see https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790863

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2011-06-01:

#29

debdiff Edit (757 bytes, text/plain)

Actually, this isn't making sense to me. CLONE_NEWNET requires privilege, so this isn't something a random user can exploit. So what is the value in turning netns support off in the kernel as opposed to just stopping vsftpd from using it? (Attached debdiff not tested, but should suffice. I'll test if it will be considered IN PLACE of turning off CONFIG_NET_NS).

Revision history for this message

Stefan Metzmacher (metze) wrote on 2011-06-01:

#30

Fixing vsftpd looks like a much better fix for this!

Revision history for this message

Rachel Greenham (rachel-strangenoises) wrote on 2011-06-02: Re: [Bug 720095] Re: vsftpd causes a vmalloc space leak in Lucid

#31

On 01/06/11 17:08, Stefan Metzmacher wrote:
> Fixing vsftpd looks like a much better fix for this

It would seem at first sight to be simpler; but presumably the problem
was that there are bugs in the implementation in the Lucid kernel (and
upstream) that won't necessarily *only* impact vsftpd users, although we
were the ones who first reported it. Certainly from my practical point
of view I'd have been happy with a simple vsftpd update to remove the
problem. :-)

The bug being in the kernel, and backporting the fix to it being deemed
too complicated (see nearer the top of this bug report thread) the
decision was therefore to disable the feature.

To those that depend on the feature, ie: lxc users (aside: i hadn't
heard of that! after googling i may want to use it now!), given the
feature is buggy in the lucid - and upstream - kernel *anyway*, maybe
the appropriate action is to use the maverick backport kernel?

--
Rachel

Revision history for this message

Rachel Greenham (rachel-strangenoises) wrote on 2011-06-02:

#32

On 02/06/11 14:32, Rachel Greenham wrote:
> On 01/06/11 17:08, Stefan Metzmacher wrote:
>> Fixing vsftpd looks like a much better fix for this

Also presumably disabling it in vsftpd will hurt people who want to use
that in an lxc setting without providing an easily-applied solution.

--
Rachel

Revision history for this message

Stefan Metzmacher (metze) wrote on 2011-06-03:

#33

As far as I understand the problem, the problem comes with creating a new network namespace with every clone() syscall.

In a lxc setup only the startup process creates a new network namespace, just once.

I can't see why vsftpd (without CLONE_NEWNET) won't run within an already established lxc session.

Revision history for this message

Alex Bligh (ubuntu-alex-org) wrote on 2011-09-07:

#34

The released resolution broke a production environment here: See #844185

I propose this is instead fixed by disabling it in vsftpd.

Revision history for this message

Stefan Bader (smb) wrote on 2011-09-07:

#35

On 07.09.2011 12:16, Alex Bligh wrote:
> The released resolution broke a production environment here: See #844185
>
> I propose this is instead fixed by disabling it in vsftpd.
>
The problem is that nobody can say that vsftp was or is the only vector that
allows to DOS a system doing something that involves network namespaces.

If netns is essential. It is probably a better solution to move to the
LTS-backports kernel which is newer and does not have those memory cleanup issues.

Revision history for this message

Alex Bligh (ubuntu-alex-org) wrote on 2011-09-07:

#36

That is sadly not an option. LTS-backport kernel has a spectacular and easy to repeat Oops when namespaces are used. See #843892.

It is not guaranteed by the kernel (it certainly wasn't in 2.6.32) that namespaces would be created and deleted instantly and without undue system pressure. It seems to me that the bug is in applications which think this can be done. As an example, in 2.6.32 creating 1000 interfaces and deleting them takes a huge time on delete due to RCU sync issues. If a userspace program did this, prompted by an external user, our response would not be to disable creation and deletion of interfaces. Rather we'd fix the userspace program not to do it.

Revision history for this message

Alex Bligh (ubuntu-alex-org) wrote on 2011-09-07:

#37

Note also that if vsftp continues to use clone(NEW_NETNS) (i.e. network namespaces) it is likely to suffer from #843892 anyway, so not using network namespaces will give you a stability increase. (NB - I have not tested vsftp against the bug in #843892 but as you can see from the text, it is hardly difficult to hit).

Ubuntu
linux package

vsftpd causes a vmalloc space leak in Lucid

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Bug attachments

Remote bug watches

Affects		Status	Importance	Assigned to	Milestone
	linux (Ubuntu)	Invalid	Medium	Unassigned
	Lucid	Fix Released	Medium	Stefan Bader

Ubuntulinux package

vsftpd causes a vmalloc space leak in Lucid

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Bug attachments

Remote bug watches

Ubuntu
linux package