aio completions are dropped

Bug #1641129 reported by Sage Weil on 2016-11-11
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
High
Unassigned
Trusty
High
Tim Gardner

Bug Description

In 3.13.0-100-generic (14.04 stable kernel), we are hitting an old AIO bug, described in this thread

 http://www.gossamer-threads.com/lists/linux/kernel/1993181

reproduced by this

 http://www.kvack.org/~bcrl/20140824-aio_bug.c

The bug was introduced by

 f8567a3845ac05bb28f3c1b478ef752762bd39ef

and I believe it was fixed by

 d856f32a86b2b015ab180ab7a55e455ed8d3ccc5

I'm not certain if there were follow-on fixes after that.

The bug is affecting Ceph OSD daemons, which hit this pretty reliably when using the new BlueStore backend.

CVE References

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1641129

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: trusty
James Page (james-page) on 2016-11-11
Changed in linux (Ubuntu):
status: Incomplete → New
importance: Undecided → High
Brad Figg (brad-figg) on 2016-11-11
Changed in linux (Ubuntu):
status: New → Incomplete
Brad Figg (brad-figg) on 2016-11-11
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Tim Gardner (timg-tpi) wrote :
Changed in linux (Ubuntu Trusty):
assignee: nobody → Tim Gardner (timg-tpi)
status: New → In Progress
tags: added: kernel-key
Changed in linux (Ubuntu Trusty):
importance: Undecided → High
Luis Henriques (henrix) on 2016-11-15
Changed in linux (Ubuntu Trusty):
status: In Progress → Fix Committed
Brad Figg (brad-figg) wrote :

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-trusty' to 'verification-done-trusty'. If the problem still exists, change the tag 'verification-needed-trusty' to 'verification-failed-trusty'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags: added: verification-needed-trusty
Po-Hsu Lin (cypressyew) wrote :

This issue can be fixed with the proposed kernel (3.13.0-106-generic)

kernel@kernel-Latitude-E7440:~/Kernel$ ./a.out
Submitting: 128
Submitted: 126
Submitting: 2
Submitted too much, that's okay
Completed: 126
Submitting: 2
Submitted: 2
Completed: 2
Verifying...
OK

Thanks

tags: added: verification-done-trusty
removed: verification-needed-trusty
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package linux - 3.13.0-106.153

---------------
linux (3.13.0-106.153) trusty; urgency=low

  [ Luis Henriques ]

  * Release Tracking Bug
    - LP: #1647749

  * CVE-2016-7916
    - proc: prevent accessing /proc/<PID>/environ until it's ready

  * CVE-2016-6213
    - mnt: Add a per mount namespace limit on the number of mounts

  * aio completions are dropped (LP: #1641129)
    - aio: fix reqs_available handling

  * [Hyper-V] do not lose pending heartbeat vmbus packets (LP: #1632786)
    - hv: do not lose pending heartbeat vmbus packets

  * ipv6: connected routes are missing after a down/up cycle on the loopback
    (LP: #1634545)
    - ipv6: reallocate addrconf router for ipv6 address when lo device up
    - ipv6: correctly add local routes when lo goes up

  * audit: prevent a new auditd to stop an old auditd still alive (LP: #1633404)
    - audit: stop an old auditd being starved out by a new auditd

  * Setting net.ipv4.neigh.default.gc_thresh1/2/3 on 3.13.0-97.144 or later
    causes 'invalid argument' error (LP: #1634892)
    - neigh: fix setting of default gc_* values

  * move nvme driver to linux-image (LP: #1640275)
    - [Config] Add nvme to the generic inclusion list

 -- Luis Henriques <email address hidden> Tue, 06 Dec 2016 15:00:27 +0000

Changed in linux (Ubuntu Trusty):
status: Fix Committed → Fix Released
Changed in linux (Ubuntu):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers