auplink crashed with SIGSEGV in ftw_startup()

Bug #1442892 reported by modolo
638
This bug affects 142 people
Affects Status Importance Assigned to Milestone
aufs-tools (Ubuntu)
Confirmed
Medium
Unassigned

Bug Description

Hi!

I take docker from https://docs.docker.com/installation/ubuntulinux/ with follow instrunctions:

$ wget -qO- https://get.docker.com/ | sh

Now, I have to start docker winth "$ sudo service docker restart" after reboot.

Thanks,
Marcelo Módolo

ProblemType: Crash
DistroRelease: Ubuntu 15.04
Package: aufs-tools 1:3.2+20130722-1.1
ProcVersionSignature: Ubuntu 3.19.0-13.13-generic 3.19.3
Uname: Linux 3.19.0-13-generic x86_64
ApportVersion: 2.17-0ubuntu2
Architecture: amd64
Date: Sat Apr 11 00:13:12 2015
Dependencies:
 gcc-5-base 5-20150401-0ubuntu1
 libc6 2.21-0ubuntu4
 libgcc1 1:5-20150401-0ubuntu1
 multiarch-support 2.21-0ubuntu4
ExecutablePath: /sbin/auplink
InstallationDate: Installed on 2015-03-21 (20 days ago)
InstallationMedia: Ubuntu 15.04 "Vivid Vervet" - Alpha amd64 (20150318)
ProcCmdline: auplink /var/lib/docker/aufs/mnt/f2fce76cf809d39a6e9f43fb02f7ab85b8ce4ce43eeb7c69b8fae97ed3292cc4 flush
ProcEnviron:
 LANG=pt_BR.UTF-8
 LANGUAGE=pt_BR:pt:en
 PATH=(custom, no user)
SegvAnalysis:
 Segfault happened at: 0x7f754caf3759 <ftw_startup+105>: callq 0x7f754ca89200 <__memset_sse2>
 PC (0x7f754caf3759) ok
 source "0x7f754ca89200" (0x7f754ca89200) ok
 destination "(%rsp)" (0x7ffedd57fa20) not located in a known VMA region (needed writable region)!
 Stack memory exhausted (SP below stack segment)
SegvReason: writing unknown VMA
Signal: 11
SourcePackage: aufs-tools
StacktraceTop:
 ftw_startup (dir=0x2550010 "/var/lib/docker/aufs/mnt/f2fce76cf809d39a6e9f43fb02f7ab85b8ce4ce43eeb7c69b8fae97ed3292cc4", is_nftw=1, func=0x401458, descriptors=1048566, flags=19) at ../sysdeps/wordsize-64/../../io/ftw.c:654
 ?? ()
 ?? ()
 __libc_start_main (main=0x40126d, argc=3, argv=0x7ffeddd7fd08, init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7ffeddd7fcf8) at libc-start.c:289
 ?? ()
Title: auplink crashed with SIGSEGV in ftw_startup()
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups:

Revision history for this message
modolo (modolo) wrote :
Revision history for this message
Apport retracing service (apport) wrote : This bug is a duplicate

Thank you for taking the time to report this crash and helping to make this software better. This particular crash has already been reported and is a duplicate of bug #1441038, so is being marked as such. Please look at the other bug report to see if there is any missing information that you can provide, or to see if there is a workaround for the bug. Additionally, any further discussion regarding the bug should occur in the other report. Please continue to report any other bugs you may find.

information type: Private → Public
tags: removed: need-amd64-retrace
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in aufs-tools (Ubuntu):
status: New → Confirmed
Revision history for this message
Syver Enstad (syver-enstad) wrote :

Sorry, newbie error, clicked on duplicate bug to see the details and inadvertently deleted the duplicate bug status. I don't know how to undo

Changed in aufs-tools (Ubuntu):
importance: Undecided → Medium
Revision history for this message
Sławek Ehlert (slafs) wrote :

bug #1441038 no longer exists. Is there a new one we could follow?

Revision history for this message
Ville Ranki (ville-ranki) wrote :

"Me too."

Is this related do docker? If so, I installed it from Ubuntu's repositories. Now this crash happens on every boot so it's quite annoying.

Running Ubuntu vivid (15.04).

Revision history for this message
Norm Walsh (ndw) wrote :

"Me too, too". Ubuntu Wily.

Revision history for this message
Jonas G. Drange (jonas-drange) wrote :

Confirmed for Xenial.

Revision history for this message
XerebZ (xerebz) wrote :

Seeing this on Xubuntu 16.04 Xenial

Revision history for this message
Salih EMIN (salih-emin) wrote :

It happens when I build locally a docker image. It disconnects me from the network but fortunately it reconnects back

Revision history for this message
Graham Weldon (graham-gj58) wrote :

Confirmed on Xenial

Revision history for this message
PersonalPC@live.com (personalpc) wrote :

Confirmed on a fresh install of Xenial

Revision history for this message
Andrea Del Bene (an-delbene) wrote :

+1 on a fresh install of Xenial

Revision history for this message
Peter Cornelius (aranaea) wrote :

+1 on Xenial

Revision history for this message
RH (rh-t) wrote :

+1 on Xenial (16.04)

Revision history for this message
Rostás Balázs (rostas-balazs) wrote :

+1 on Xenial

Revision history for this message
Robert Smith (rsmith31415) wrote :

Just to include additional information, this error occurs after exiting a Docker container or after booting on Ubuntu 16.04.

Revision history for this message
Andreo de Aguiar Vieira (andreoav) wrote :

+1 on Xenial - Docker installed

Revision history for this message
Kaluzki Demjan (kaluzkidemjan) wrote :

+1

16.04
curl -fsSL https://get.docker.com/ | sh
sudo usermod -aG docker <user>

Revision history for this message
Joern Heissler (joernheissler) wrote :

Created new ticket https://sourceforge.net/p/aufs/bugs/26/ with explanation and patch.

Revision history for this message
llb4ll (beatrix-vad) wrote :

+1 on Xenial - Docker installed

Revision history for this message
John Paul Adrian Glaubitz (glaubitz) wrote :

People, just a heads-up: Writing "me, too" or "+1" in this bug report is absolutely pointless without providing any additional information which is missing here. It just produces noise. If you just want to say "This problem affects me as well", please just click " This bug affects 68 people. Does this bug affect you?".

Anything else just clutters the bug report and makes the work for the people who want to work on this harder.

Revision history for this message
Krishnan Sriram Rama (krishnan-srm) wrote :

This crashed on me too.

I am on 16.10 and I use Dell Latitude E7240. This crash happens everytime I restart my machine. In addition, everytime I do a "docker run", it crashes.

I'd appreciate if someone can provide a fix or workaround?

Revision history for this message
Steven Peters (scpeters) wrote :

I recently experienced this crash after upgrading to 16.04. While debugging, I noticed that my `nvidia-docker` installation wasn't working, so I upgraded that software, and I don't see the system problem dialog on launch anymore. I'm not sure why it worked.

Revision history for this message
Fábio Antunes (fabio-h-f-antunes) wrote :

Has anyone with this issue noticed data corruption affecting docker images or containers?

Revision history for this message
Julian Raschke (jlnr) wrote :

Also affects the current build of 17.10.

Revision history for this message
Craig Jellick (craig-jellick) wrote :

If this affects you and you are using aufs as your docker storage driver just because it is the default, you can just switch to the overlay2 storage driver to circumvent this crash.

Note that switching storage drivers will cause you to "lose" all you docker images. You'll have to repull them.

Can switch the storage driver on ubuntu (w/systemd) by putting this:
```
{
    "storage-driver": "overlay2"
}
```
in /etc/docker/daemon.json and restarting docker.

Revision history for this message
Cedric Jehasse (cedricj) wrote :
Download full text (4.1 KiB)

The trace i get is this:

SegvAnalysis:
 Segfault happened at: 0x7f022f4a25b9 <ftw_startup+105>: callq 0x7f022f438240 <__memset_sse2>
 PC (0x7f022f4a25b9) ok
 source "0x7f022f438240" (0x7f022f438240) ok
 destination "(%rsp)" (0x7fffbca06f40) not located in a known VMA region (needed writable region)!
 Stack memory exhausted (SP below stack segment)
SegvReason: writing unknown VMA
SourcePackage: aufs-tools
Stacktrace:
 #0 ftw_startup (dir=0x1081010 "/var/lib/docker/aufs/mnt/5f3306c6fad9d2fc750be5dd28b3d8d58db4eaa0d5947583d318829e2ea4326e", is_nftw=1, func=0x40149c, descriptors=1048566, flags=19) at ../sysdeps/wordsize-64/../../io/ftw.c:654
         data = {dirstreams = 0x7fffbca06f40, actdir = 0, maxdir = 1048566, dirbuf = 0xff0000 <error: Cannot access memory at address 0xff0000>, dirbufsize = 0, ftw = {base = 0, level = 0}, flags = 0, cvt_arr = 0x0, func = 0x0, dev = 0, known_objects = 0x0}
         st = {st_dev = 0, st_ino = 0, st_nlink = 0, st_mode = 0, st_uid = 0, st_gid = 0, __pad0 = 0, st_rdev = 0, st_size = 1523, st_blksize = 4, st_blocks = 17345968, st_atim = {tv_sec = 1531, tv_nsec = 17345968}, st_mtim = {tv_sec = 48, tv_nsec = 17305728}, st_ctim = {tv_sec = 139647365056152, tv_nsec = 1048576}, __glibc_reserved = {19, 1048566, 4199580}}
         result = 0
         cwdfd = -1
         cwd = 0x0
         cp = <optimized out>
 #1 0x0000000000401d52 in ?? ()
 No symbol table info available.
 #2 0x00000000004013ec in ?? ()
 No symbol table info available.
 #3 0x00007f022f3c9830 in __libc_start_main (main=0x401266, argc=3, argv=0x7fffbd207228, init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7fffbd207218) at ../csu/libc-start.c:291
         result = <optimized out>
         unwind_buf = {cancel_jmp_buf = {{jmp_buf = {0, -7392484191596224330, 4198768, 140736366408224, 0, 0, 7392628052837490870, 7452542804783751350}, mask_was_saved = 0}}, priv = {pad = {0x0, 0x0, 0x402820, 0x7f022f783ab0 <_dl_fini>}, data = {prev = 0x0, cleanup = 0x0, canceltype = 4204576}}}
         not_first_call = <optimized out>
 #4 0x0000000000401199 in ?? ()
 No symbol table info available.

I looked at the ftw_startup implementation in glibc, and think this line could be exhausting the stack memory:
 data.dirstreams = (struct dir_data **) alloca (data.maxdir
       * sizeof (struct dir_data *));

The stacktrace shows data.maxdir is 1048566.
data.maxdir is coming from the nopendfd parameter of ntfw.

Looking at plink.c from aufs-tools package shows it uses the current number of files limit as nopenfd:
 err = getrlimit(RLIMIT_NOFILE, &rlim);
 if (err)
  AuFin("getrlimit");
 nftw(cwd, func, rlim.rlim_cur - 10,
      FTW_PHYS | FTW_MOUNT | FTW_ACTIONRETVAL);

It looks like rlim.rlim_cur is 1048576 when auplink crashes.

Is auplink started by dockerd?
I can see on my system dockerd has it's max open files set to 1048576:
cat /proc/2603/limits
Limit Soft Limit Hard Limit Units
Max cpu time unlimited unlimited seconds
Max file size unlimited unlimited bytes
Max data size unlimited unl...

Read more...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.