Bug #1505948 “Memory arena corruption with FUSE (was Memory allo...” : Bugs : linux package : Ubuntu

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2015-08-17:

#26

Download full text (4.7 KiB)

Description of problem:

After upgrading a node from F20 to F21, node crashes accessing glusterfs volume.
The remaining F20 nodes have no problem accessing the volume.

Aug 16 20:24:25 bagel kernel: [ 1810.077267] ------------[ cut here ]------------
Aug 16 20:24:25 bagel kernel: [ 1810.081945] kernel BUG at mm/slub.c:3413!
Aug 16 20:24:25 bagel kernel: [ 1810.085998] invalid opcode: 0000 [#1] SMP
Aug 16 20:24:25 bagel kernel: [ 1810.090177] Modules linked in: vhost_net vhost m
acvtap macvlan ebt_arp ebtable_nat fuse nfsv3 nfs_acl nfs lockd grace sunrpc fsca
che ebtable_filter ebtables ip6table_filter ip6_tables softdog scsi_transport_isc
si xt_physdev br_netfilter nf_conntrack_ipv4 nf_defrag_ipv4 xt_multiport xt_connt
rack nf_conntrack vfat fat coretemp kvm_intel kvm bcache iTCO_wdt crct10dif_pclmu
l ipmi_devintf crc32_pclmul iTCO_vendor_support gpio_ich igb crc32c_intel ptp pps
_core lpc_ich ghash_clmulni_intel i2c_i801 mfd_core ipmi_si dca ipmi_msghandler i
2c_ismt tpm_tis shpchp tpm acpi_cpufreq ast i2c_algo_bit drm_kms_helper ttm drm 8
021q garp mrp tun bridge stp llc bonding
Aug 16 20:24:25 bagel kernel: [ 1810.149526] CPU: 1 PID: 4794 Comm: qemu-system-x
86 Not tainted 4.1.4-100.fc21.x86_64 #1
Aug 16 20:24:25 bagel kernel: [ 1810.157603] Hardware name: Supermicro A1SRM-2758
F/A1SRM-2758F, BIOS 1.2 02/16/2015
Aug 16 20:24:25 bagel kernel: [ 1810.165246] task: ffff88085a1313c0 ti: ffff8803b
09b4000 task.ti: ffff8803b09b4000
Aug 16 20:24:25 bagel kernel: [ 1810.172800] RIP: 0010:[<ffffffff81208532>] [<ff
ffffff81208532>] kfree+0x152/0x160
Aug 16 20:24:25 bagel kernel: [ 1810.180467] RSP: 0018:ffff8803b09b7c98 EFLAGS:
00010246
Aug 16 20:24:25 bagel kernel: [ 1810.185833] RAX: 005ffff80000002c RBX: ffff88020
08b9960 RCX: dead000000200200
Aug 16 20:24:25 bagel kernel: [ 1810.193032] RDX: 000077ff80000000 RSI: ffff88085
a1313c0 RDI: ffff8802008b9960
Aug 16 20:24:25 bagel kernel: [ 1810.200231] RBP: ffff8803b09b7cb8 R08: ffff8803b
09b7c80 R09: ffffea0008022e40
Aug 16 20:24:25 bagel kernel: [ 1810.207431] R10: 0000000000002fe4 R11: 000000000
0000000 R12: 0000000149928000
Aug 16 20:24:25 bagel kernel: [ 1810.214629] R13: ffffffffa02e5c8c R14: ffff8803b
09b7e50 R15: ffff8801009b5600
Aug 16 20:24:25 bagel kernel: [ 1810.221829] FS: 00007f35609ff700(0000) GS:ffff88087fc40000(0000) knlGS:0000000000000000
Aug 16 20:24:25 bagel kernel: [ 1810.229992] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Aug 16 20:24:25 bagel kernel: [ 1810.235799] CR2: 00007fbf24022a98 CR3: 0000000100a81000 CR4: 00000000001027e0
Aug 16 20:24:25 bagel kernel: [ 1810.243001] Stack:
Aug 16 20:24:25 bagel kernel: [ 1810.245037] ffff8802008b9960 ffff8802008b9960 0000000149928000 ffff8803b09b7da8
Aug 16 20:24:25 bagel kernel: [ 1810.252590] ffff8803b09b7d48 ffffffffa02e5c8c 0000000000004800 ffff8806eea842c0
Aug 16 20:24:25 bagel kernel: [ 1810.260145] 0000000000004800 00000001f4000000 000000014992c800 0000000000000000
Aug 16 20:24:25 bagel kernel: [ 1810.267699] Call Trace:
Aug 16 20:24:25 bagel kernel: [ 1810.270189] [<ffffffffa02e5c8c>] fuse_direct_IO+0x20c/0x340 [fuse]
Aug 16 20:24:25 bagel kernel: [ 1810.276525] [<ffffffff811ac2fa>] generic_file_read_iter+0x4ca/0x6...

Description of problem:

After upgrading a node from F20 to F21, node crashes accessing glusterfs volume.
The remaining F20 nodes have no problem accessing the volume.

Aug 16 20:24:25 bagel kernel: [ 1810.077267] ------------[ cut here ]------------
Aug 16 20:24:25 bagel kernel: [ 1810.081945] kernel BUG at mm/slub.c:3413!
Aug 16 20:24:25 bagel kernel: [ 1810.085998] invalid opcode: 0000 [#1] SMP
Aug 16 20:24:25 bagel kernel: [ 1810.090177] Modules linked in: vhost_net vhost m
acvtap macvlan ebt_arp ebtable_nat fuse nfsv3 nfs_acl nfs lockd grace sunrpc fsca
che ebtable_filter ebtables ip6table_filter ip6_tables softdog scsi_transport_isc
si xt_physdev br_netfilter nf_conntrack_ipv4 nf_defrag_ipv4 xt_multiport xt_connt
rack nf_conntrack vfat fat coretemp kvm_intel kvm bcache iTCO_wdt crct10dif_pclmu
l ipmi_devintf crc32_pclmul iTCO_vendor_support gpio_ich igb crc32c_intel ptp pps
_core lpc_ich ghash_clmulni_intel i2c_i801 mfd_core ipmi_si dca ipmi_msghandler i
2c_ismt tpm_tis shpchp tpm acpi_cpufreq ast i2c_algo_bit drm_kms_helper ttm drm 8
021q garp mrp tun bridge stp llc bonding
Aug 16 20:24:25 bagel kernel: [ 1810.149526] CPU: 1 PID: 4794 Comm: qemu-system-x
86 Not tainted 4.1.4-100.fc21.x86_64 #1
Aug 16 20:24:25 bagel kernel: [ 1810.157603] Hardware name: Supermicro A1SRM-2758
F/A1SRM-2758F, BIOS 1.2 02/16/2015
Aug 16 20:24:25 bagel kernel: [ 1810.165246] task: ffff88085a1313c0 ti: ffff8803b
09b4000 task.ti: ffff8803b09b4000
Aug 16 20:24:25 bagel kernel: [ 1810.172800] RIP: 0010:[<ffffffff81208532>]  [<ff
ffffff81208532>] kfree+0x152/0x160
Aug 16 20:24:25 bagel kernel: [ 1810.180467] RSP: 0018:ffff8803b09b7c98  EFLAGS:
00010246
Aug 16 20:24:25 bagel kernel: [ 1810.185833] RAX: 005ffff80000002c RBX: ffff88020
08b9960 RCX: dead000000200200
Aug 16 20:24:25 bagel kernel: [ 1810.193032] RDX: 000077ff80000000 RSI: ffff88085
a1313c0 RDI: ffff8802008b9960
Aug 16 20:24:25 bagel kernel: [ 1810.200231] RBP: ffff8803b09b7cb8 R08: ffff8803b
09b7c80 R09: ffffea0008022e40
Aug 16 20:24:25 bagel kernel: [ 1810.207431] R10: 0000000000002fe4 R11: 000000000
0000000 R12: 0000000149928000
Aug 16 20:24:25 bagel kernel: [ 1810.214629] R13: ffffffffa02e5c8c R14: ffff8803b
09b7e50 R15: ffff8801009b5600
Aug 16 20:24:25 bagel kernel: [ 1810.221829] FS:  00007f35609ff700(0000) GS:ffff88087fc40000(0000) knlGS:0000000000000000
Aug 16 20:24:25 bagel kernel: [ 1810.229992] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Aug 16 20:24:25 bagel kernel: [ 1810.235799] CR2: 00007fbf24022a98 CR3: 0000000100a81000 CR4: 00000000001027e0
Aug 16 20:24:25 bagel kernel: [ 1810.243001] Stack:
Aug 16 20:24:25 bagel kernel: [ 1810.245037]  ffff8802008b9960 ffff8802008b9960 0000000149928000 ffff8803b09b7da8
Aug 16 20:24:25 bagel kernel: [ 1810.252590]  ffff8803b09b7d48 ffffffffa02e5c8c 0000000000004800 ffff8806eea842c0
Aug 16 20:24:25 bagel kernel: [ 1810.260145]  0000000000004800 00000001f4000000 000000014992c800 0000000000000000
Aug 16 20:24:25 bagel kernel: [ 1810.267699] Call Trace:
Aug 16 20:24:25 bagel kernel: [ 1810.270189]  [<ffffffffa02e5c8c>] fuse_direct_IO+0x20c/0x340 [fuse]
Aug 16 20:24:25 bagel kernel: [ 1810.276525]  [<ffffffff811ac2fa>] generic_file_read_iter+0x4ca/0x600
Aug 16 20:24:25 bagel kernel: [ 1810.282941]  [<ffffffffa02e22ac>] fuse_file_read_iter+0x4c/0x70 [fuse]
Aug 16 20:24:25 bagel kernel: [ 1810.289531]  [<ffffffff81227e1e>] __vfs_read+0xce/0x100
Aug 16 20:24:25 bagel kernel: [ 1810.294810]  [<ffffffff8122849a>] vfs_read+0x8a/0x140
Aug 16 20:24:25 bagel kernel: [ 1810.299910]  [<ffffffff812295c2>] SyS_pread64+0x92/0xc0
Aug 16 20:24:25 bagel kernel: [ 1810.305186]  [<ffffffff8179a76e>] system_call_fastpath+0x12/0x71
Aug 16 20:24:25 bagel kernel: [ 1810.311253] Code: 00 4d 8b 49 30 e9 35 ff ff ff 0f 1f 80 00 00 00 00 4c 89 d1 48 89 da 4c 89 ce e8 ca f9 ff ff e9 73 ff ff ff 0f 1f 44 00 00 0f 0b <0f> 0b 66 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 89
Aug 16 20:24:25 bagel kernel: [ 1810.336949] RIP  [<ffffffff81208532>] kfree+0x152/0x160
Aug 16 20:24:25 bagel kernel: [ 1810.344889]  RSP <ffff8803b09b7c98>
Aug 16 20:24:25 bagel kernel: [ 1810.360802] ---[ end trace 76f7ea1ab5ea1b36 ]---

Version-Release number of selected component (if applicable):

kernel-4.1.4-100.fc21.x86_64
glusterfs-fuse-3.5.5-2.fc21.x86_64

How reproducible:

Every time I would start a VM whose disk lived on the gluster volume, the crash would happen immediately. The node would become mostly unresponsive and require a hard reset.

Steps to Reproduce:
1. glusterfs distributed-replicated volume across 3 F20 nodes.
2. upgrade one node from F20 to F21
3. attempt to run a VM on the new F21 node (accessing a disk image on the gluster volume)

Actual results:
Accessing files on gluster volume causes node crash.

Expected results:
No crash.

Additional info:

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#1

lspci -vvnn log Edit (208.4 KiB, text/plain)

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#2

uname -a log Edit (115 bytes, text/plain)

Revision history for this message

Launchpad Janitor (janitor) wrote on 2015-10-14:

#3

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux (Ubuntu):
status:	New → Confirmed

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#4

/proc/version output Edit (152 bytes, text/plain)

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#5

As already mentioned in my email to the fuse developer mailing list, we have also tried to create direct i/o traffic on the affected mount directly but were not able to reproduce the issue. The problem only ever occurs once Qemu starts to run stuff on top of the FUSE mount. Other reports of this issue (identical or similar) have mentioned Qemu or VMware-based emulation as well.

description:	updated
summary:	- Memory allocation failure crashes kernel hard + Memory allocation failure crashes kernel hard, presumably related to + FUSE

Revision history for this message

Maik Zumstrull (m-zumstrull) wrote on 2015-10-14: Re: Memory allocation failure crashes kernel hard, presumably related to FUSE

#6

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1398465 also appears to be the same issue.

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#7

We can now confirm that the issue does not happen with 4.0.9. This leads to the assumption that the problem has either been fixed between 4.0 and 4.0.9, or, and I consider this much more likely, the problem was introduced between 4.0 and 4.1 on the main branch.

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2015-10-14:

#8

https://git.kernel.org/cgit/linux/kernel/git/stable/linux-stable.git/diff/fs/fuse/file.c?id=refs/tags/v4.1&id2=refs/tags/v4.0 is the diff between the version that works and the one that is broken. The candidate that broke functionality is likely here:

https://lkml.org/lkml/2015/4/13/871
https://lkml.org/lkml/2015/4/15/475

Joseph Salisbury (jsalisbury) on 2015-10-21

Changed in linux (Ubuntu):
importance:	Undecided → High
tags:	added: kernel-da-key wily

Revision history for this message

Robert Doebbelin (2-robert-3) wrote on 2015-10-26:

#9

Patch to build ntfs-3g against fuse3 Edit (4.7 KiB, text/plain)

Duplicating my post to the fuse developer mailing list here:

Hi all,

the kernel crash can be triggered if async direct IO is used which comes with Fuse 3.0_pre0 (i.e. current head). My workload was to install CentOS7 on a newly created qcow2 disk. The kernel (Fedora 21; 4.1.8-100.fc21.x86_64) crashed in 2/2 runs using qemu/kvm atop of ntfs-3g built against fuse3:

1) Build fuse3 from current head
2) Build ntfs-3g against fuse3 (feel free to use the attached patch. It assumes that pkg-config is able to find fuse3, so install fuse3.pc in a PKG_CONFIG_PATH)
3) ntfs-3g: ./configure --with-fuse=external; make
4) "src/lowntfs-3g --version" should now print 'lowntfs-3g 2015.3.14 external FUSE 30'

5) create and mount an NTFS volume
6) create a VM disk: qemu-img create -f qcow2 disk.qcow2 20G
7) make sure that the VM actually uses async direct io (cache='none' io='native')

In my case the kernel crashed around 12 minutes after the VM was started.

Regards,
Robert

Revision history for this message

In Red Hat Bugzilla #1254310, Fedora (fedora-redhat-bugs) wrote on 2015-11-04:

#27

This message is a reminder that Fedora 21 is nearing its end of life.
Approximately 4 (four) weeks from now Fedora will stop maintaining
and issuing updates for Fedora 21. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as EOL if it remains open with a Fedora 'version'
of '21'.

Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version.

Thank you for reporting this issue and we are sorry that we were not
able to fix it before Fedora 21 is end of life. If you would still like
to see this bug fixed and are able to reproduce it against a later version
of Fedora, you are encouraged change the 'version' to a later Fedora
version prior this bug is closed as described in the policy above.

Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2015-11-08:

#28

Confirming that this still occurs on fresh Fedora 22 install with:

kernel-4.2.5-201.fc22.x86_64
glusterfs-fuse-3.7.5-1.fc22.x86_64
fuse-2.9.4-3.fc22.x86_64
fuse-libs-2.9.4-3.fc22.x86_64

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2015-11-11:

#29

Found a workaround (not a fix):

I recompiled kernel-4.2.5-201.fc22.x86_64 to use the older SLAB allocator instead of the default SLUB allocator. Problem avoided. No more crash when using glusterfs (fuse).

Now.. what the -bleep- is wrong with SLUB?

While using SLAB is a workaround (at least it seems to be working so far; knock-on-wood), I am uncertain what performance impacts it is going to have on my virtualization cluster. :-(

And without a run/boot-time method of switching between allocators, I am now going to have to compile my customized kernel from here on out.. not a big deal, but a nuisance.. and have to take extra care to make sure to never boot into a distro-built kernel by mistake and have everything come crashing down.

Andy Whitcroft (apw) on 2016-01-27

summary:

- Memory allocation failure crashes kernel hard, presumably related to
- FUSE
+ Memory arena corruption with FUSE (was Memory allocation failure crashes
+ kernel hard, presumably related to FUSE)

Revision history for this message

Maik Zumstrull (m-zumstrull) wrote on 2016-01-27:

#10

Screen Shot 2016-01-26 at 10.00.03.png Edit (111.6 KiB, image/png)

We've been able to confirm an out of bounds write in fuse_direct_io with the slub_debug boot option on linux-lts-wily.

Revision history for this message

Maik Zumstrull (m-zumstrull) wrote on 2016-01-27:

#11

Screen Shot 2016-01-26 at 10.00.03.png Edit (111.6 KiB, image/png)

We've been able to confirm an out of bounds write in fuse_direct_io with the slub_debug boot option on linux-lts-wily.

Revision history for this message

Robert Doebbelin (2-robert-3) wrote on 2016-01-27:

#12

Download full text (5.0 KiB)

Enabling KASAN on a Wily kernel prints the following:

Jan 27 12:02:05 ubuntu kernel: ==================================================================
Jan 27 12:02:05 ubuntu kernel: BUG: KASan: use after free in fuse_direct_IO+0xb1a/0xcc0 at addr ffff88036c414390
Jan 27 12:02:05 ubuntu kernel: Read of size 8 by task qemu-system-x86/2784
Jan 27 12:02:05 ubuntu kernel: =============================================================================
Jan 27 12:02:05 ubuntu kernel: BUG kmalloc-128 (Tainted: G I ): kasan: bad access detected
Jan 27 12:02:05 ubuntu kernel: -----------------------------------------------------------------------------
Jan 27 12:02:05 ubuntu kernel: Disabling lock debugging due to kernel taint
Jan 27 12:02:05 ubuntu kernel: INFO: Slab 0xffffea000db10500 objects=32 used=26 fp=0xffff88036c414e80 flags=0x2ffff0000000080
Jan 27 12:02:05 ubuntu kernel: INFO: Object 0xffff88036c414380 @offset=896 fp=0x (null)
Jan 27 12:02:05 ubuntu kernel: Bytes b4 ffff88036c414370: 18 00 00 00 40 27 a3 1f 3b 56 00 00 00 00 00 00 ....@'..;V......
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c414380: 00 00 00 00 00 00 00 00 00 f0 75 35 00 00 00 00 ..........u5....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c414390: 80 27 67 81 ff ff ff ff 00 00 00 00 00 00 00 00 .'g.............
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143a0: 05 00 00 00 00 00 00 00 80 82 44 ad 05 88 ff ff ..........D.....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143b0: 00 00 00 00 00 00 00 00 10 e1 bc 56 49 56 00 00 ...........VIV..
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143d0: 00 00 00 00 00 00 00 00 80 f6 85 6d 03 88 ff ff ...........m....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: CPU: 0 PID: 2784 Comm: qemu-system-x86 Tainted: G B I 4.2.0-25-generic 0000030
Jan 27 12:02:05 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512 , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 27 12:02:05 ubuntu kernel: ffff88036c414380 00000000d939cde9 ffff8805adf0f7c8 ffffffff828cafee
Jan 27 12:02:05 ubuntu kernel: 0000000000000080 ffff880373803680 ffff8805adf0f7f8 ffffffff81546759
Jan 27 12:02:05 ubuntu kernel: ffff880373803680 ffffea000db10500 ffff88036c414380 ffff8805ad56d600
Jan 27 12:02:05 ubuntu kernel: Call Trace:

Jan 27 12:02:05 ubuntu kernel: [< inline >] __dump_stack linux-4.2.0/lib/dump_stack.c:15
Jan 27 12:02:05 ubuntu kernel: [<ffffffff828cafee>] dump_stack+0x45/0x57 linux-4.2.0/lib/dump_stack.c:50
Jan 27 12:02:05 ubuntu kernel: [<ffffffff81546759>] print_trailer+0xf9/0x150 linux-4.2.0/mm/slub.c:650
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8154b9c8>] object_err+0x38/0x50 linux-4.2.0/mm/slub.c:657
Jan 27 12:02:05 ubuntu kernel: [< inline >] print_address_description linux-4.2.0/mm/kasan/report.c:120
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8154e3d8>] kasan_report_error+0x1e8/0x3f0 linux-4.2.0/...

Enabling KASAN on a Wily kernel prints the following:

Jan 27 12:02:05 ubuntu kernel: ==================================================================
Jan 27 12:02:05 ubuntu kernel: BUG: KASan: use after free in fuse_direct_IO+0xb1a/0xcc0 at addr ffff88036c414390
Jan 27 12:02:05 ubuntu kernel: Read of size 8 by task qemu-system-x86/2784
Jan 27 12:02:05 ubuntu kernel: =============================================================================
Jan 27 12:02:05 ubuntu kernel: BUG kmalloc-128 (Tainted: G I ): kasan: bad access detected
Jan 27 12:02:05 ubuntu kernel: -----------------------------------------------------------------------------
Jan 27 12:02:05 ubuntu kernel: Disabling lock debugging due to kernel taint
Jan 27 12:02:05 ubuntu kernel: INFO: Slab 0xffffea000db10500 objects=32 used=26 fp=0xffff88036c414e80 flags=0x2ffff0000000080
Jan 27 12:02:05 ubuntu kernel: INFO: Object 0xffff88036c414380 @offset=896 fp=0x (null)
Jan 27 12:02:05 ubuntu kernel: Bytes b4 ffff88036c414370: 18 00 00 00 40 27 a3 1f 3b 56 00 00 00 00 00 00 ....@'..;V......
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c414380: 00 00 00 00 00 00 00 00 00 f0 75 35 00 00 00 00 ..........u5....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c414390: 80 27 67 81 ff ff ff ff 00 00 00 00 00 00 00 00 .'g.............
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143a0: 05 00 00 00 00 00 00 00 80 82 44 ad 05 88 ff ff ..........D.....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143b0: 00 00 00 00 00 00 00 00 10 e1 bc 56 49 56 00 00 ...........VIV..
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143d0: 00 00 00 00 00 00 00 00 80 f6 85 6d 03 88 ff ff ...........m....
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: Object ffff88036c4143f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
Jan 27 12:02:05 ubuntu kernel: CPU: 0 PID: 2784 Comm: qemu-system-x86 Tainted: G B I 4.2.0-25-generic 0000030
Jan 27 12:02:05 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512 , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 27 12:02:05 ubuntu kernel: ffff88036c414380 00000000d939cde9 ffff8805adf0f7c8 ffffffff828cafee
Jan 27 12:02:05 ubuntu kernel: 0000000000000080 ffff880373803680 ffff8805adf0f7f8 ffffffff81546759
Jan 27 12:02:05 ubuntu kernel: ffff880373803680 ffffea000db10500 ffff88036c414380 ffff8805ad56d600
Jan 27 12:02:05 ubuntu kernel: Call Trace:

Jan 27 12:02:05 ubuntu kernel: [< inline >] __dump_stack linux-4.2.0/lib/dump_stack.c:15
Jan 27 12:02:05 ubuntu kernel: [<ffffffff828cafee>] dump_stack+0x45/0x57 linux-4.2.0/lib/dump_stack.c:50
Jan 27 12:02:05 ubuntu kernel: [<ffffffff81546759>] print_trailer+0xf9/0x150 linux-4.2.0/mm/slub.c:650
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8154b9c8>] object_err+0x38/0x50 linux-4.2.0/mm/slub.c:657
Jan 27 12:02:05 ubuntu kernel: [< inline >] print_address_description linux-4.2.0/mm/kasan/report.c:120
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8154e3d8>] kasan_report_error+0x1e8/0x3f0 linux-4.2.0/mm/kasan/report.c:193
Jan 27 12:02:05 ubuntu kernel: [< inline >] kasan_report linux-4.2.0/mm/kasan/report.c:230
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8154e791>] __asan_report_load8_noabort+0x61/0x70 linux-4.2.0/mm/kasan/report.c:251
Jan 27 12:02:05 ubuntu kernel: [<ffffffff818d8bfa>] fuse_direct_IO+0xb1a/0xcc0 linux-4.2.0/fs/fuse/file.c:2842
Jan 27 12:02:05 ubuntu kernel: [<ffffffff8145eda6>] generic_file_direct_write+0x246/0x540 linux-4.2.0/mm/filemap.c:2398
Jan 27 12:02:05 ubuntu kernel: [<ffffffff818da16c>] fuse_file_write_iter+0x31c/0x780 linux-4.2.0/fs/fuse/file.c:1182
Jan 27 12:02:05 ubuntu kernel: [<ffffffff81673aba>] aio_run_iocb+0x68a/0x870 linux-4.2.0/fs/aio.c:1446
Jan 27 12:02:05 ubuntu kernel: [< inline >] io_submit_one linux-4.2.0/fs/aio.c:1548
Jan 27 12:02:05 ubuntu kernel: [<ffffffff81676567>] do_io_submit+0x4a7/0xb40 linux-4.2.0/fs/aio.c:1606
Jan 27 12:02:05 ubuntu kernel: [< inline >] SYSC_io_submit linux-4.2.0/fs/aio.c:1631
Jan 27 12:02:05 ubuntu kernel: [<ffffffff81676c10>] SyS_io_submit+0x10/0x20 linux-4.2.0/fs/aio.c:1628
Jan 27 12:02:05 ubuntu kernel: [<ffffffff828dc632>] entry_SYSCALL_64_fastpath+0x16/0x75 linux-4.2.0/arch/x86/entry/entry_64.S:186
Jan 27 12:02:05 ubuntu kernel: Memory state around the buggy address:
Jan 27 12:02:05 ubuntu kernel: ffff88036c414280: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Jan 27 12:02:05 ubuntu kernel: ffff88036c414300: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 27 12:02:05 ubuntu kernel: >ffff88036c414380: fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
Jan 27 12:02:05 ubuntu kernel: ^
Jan 27 12:02:05 ubuntu kernel: ffff88036c414400: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
Jan 27 12:02:05 ubuntu kernel: ffff88036c414480: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 fc
Jan 27 12:02:05 ubuntu kernel: ==================================================================

Revision history for this message

Andy Whitcroft (apw) wrote on 2016-01-28:

#13

Interesting that implies that we submitted some kind of async IO, and the IO must have completed and free(io). This implies that the io->req count is getting out of sync with the world. A quick eyeball says we are handling them right, but something is exploding. To try and confirm this is correct I have built a test kernel with a debugging patch applied. This bumps the io->req from 1 (the pending report for the submission of the IO) to 100. If the theory is right the io->req should go to 99 or fewer. If that occurs we should be able to detect it and report the type of the IO in flight. I also have tried to correct for it in the case where that is possible.

Would you be able to test the kernel at the below URL and let me know what you see in dmesg. If the detection triggers we should see "fuse_direct_IO: io->reg would have gone negative" messages, and I would be interested in the content of those when it occurs:

http://people.canonical.com/~apw/lp1505948-wily/

Builds will be there shortly. Please report any results back here.

Revision history for this message

Robert Doebbelin (2-robert-3) wrote on 2016-01-29:

#14

Download full text (18.5 KiB)

The bug triggers with the debug kernel, however there is no message like "fuse_direct_IO: io->reg would have gone negative" in the journal:

Jan 29 16:22:18 ubuntu dnsmasq-dhcp[896]: DHCPREQUEST(virbr0) 192.168.122.93 52:54:00:45:1c:61
Jan 29 16:22:18 ubuntu dnsmasq-dhcp[896]: DHCPACK(virbr0) 192.168.122.93 52:54:00:45:1c:61
Jan 29 16:22:51 ubuntu kernel: BUG: unable to handle kernel paging request at ffff8800904b06c0
Jan 29 16:22:51 ubuntu kernel: IP: [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:51 ubuntu kernel: PGD 1ff0067 PUD 3738b6063 PMD 0
Jan 29 16:22:51 ubuntu kernel: Oops: 0000 [#1] SMP
Jan 29 16:22:51 ubuntu kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables nls_iso8859_1 ipmi_ssif ipmi_devintf gpio_ich coretemp kvm_intel serio_raw kvm input_leds cdc_ether usbnet mii lpc_ich i7core_edac ioatdma edac_core i5500_temp shpchp dca 8250_fintek ipmi_si mac_hid ipmi_msghandler sunrpc autofs4 hid_generic mptsas mptscsih usbhid mptbase psmouse hid pata_acpi scsi_transport_sas bnx2
Jan 29 16:22:51 ubuntu kernel: CPU: 4 PID: 21954 Comm: qemu-system-x86 Tainted: G I 4.2.0-27-generic #32lp1505948v201601281755
Jan 29 16:22:51 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512 , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 29 16:22:51 ubuntu kernel: task: ffff880380e98c80 ti: ffff8803811d4000 task.ti: ffff8803811d4000
Jan 29 16:22:51 ubuntu kernel: RIP: 0010:[<ffffffff811df264>] [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:51 ubuntu kernel: RSP: 0018:ffff8803811d79c8 EFLAGS: 00010286
Jan 29 16:22:51 ubuntu kernel: RAX: 0000000000000000 RBX: 00000000000000d0 RCX: 000000000009d36e
Jan 29 16:22:51 ubuntu kernel: RDX: 000000000009d36d RSI: 0000000000000000 RDI: 0000000000019aa0
Jan 29 16:22:51 ubuntu kernel: RBP: ffff8803811d7a08 R08: ffff88067fc19aa0 R09: ffffffff812f8d56
Jan 29 16:22:51 ubuntu kernel: R10: ffff8800904b06c0 R11: 000000000000081a R12: 00000000000000d0
Jan 29 16:22:51 ubuntu kernel: R13: 0000000000000058 R14: ffff8803738037c0 R15: ffff8803738037c0
Jan 29 16:22:51 ubuntu kernel: FS: 00007f384a78eb00(0000) GS:ffff88067fc00000(0000) knlGS:0000000000000000
Jan 29 16:22:51 ubuntu kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jan 29 16:22:51 ubuntu kernel: CR2: ffff8800904b06c0 CR3: 00000002da9d5000 CR4: 00000000000026e0
Jan 29 16:22:51 ubuntu kernel: Stack:
Jan 29 16:22:51 ubuntu kernel: ffff8803811d7a18 ffffffff812f8d56 ffff880371e2b200 ffff8805993ae0d0
Jan 29 16:22:51 ubuntu kernel: 000000000000000b 00000000000000d0 0000000000000058 ffff8805993ae210
Jan 29 16:22:51 ubuntu kernel: ffff8803811d7a58 ffffffff812f8d56 ffff8803811d7a38 ffff8805993ae0d0
Jan 29 16:22:51 ubuntu kernel: Call Trace:
Jan 29 16:22:51 ubuntu kernel: [<ffffffff812f8d56>] ? __fuse_request_alloc+0x56/0xd0
Jan 29 16:22:51 ubuntu kernel: [<ffffffff812f8d56>] __fuse_request_alloc+0x56/0xd0
Jan 29 16:22:51 ubuntu kernel: [<ffffffff812f9026>] _...

The bug triggers with the debug kernel, however there is no message like "fuse_direct_IO: io->reg would have gone negative" in the journal:

Jan 29 16:22:18 ubuntu dnsmasq-dhcp[896]: DHCPREQUEST(virbr0) 192.168.122.93 52:54:00:45:1c:61
Jan 29 16:22:18 ubuntu dnsmasq-dhcp[896]: DHCPACK(virbr0) 192.168.122.93 52:54:00:45:1c:61
Jan 29 16:22:51 ubuntu kernel: BUG: unable to handle kernel paging request at ffff8800904b06c0
Jan 29 16:22:51 ubuntu kernel: IP: [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:51 ubuntu kernel: PGD 1ff0067 PUD 3738b6063 PMD 0 
Jan 29 16:22:51 ubuntu kernel: Oops: 0000 [#1] SMP 
Jan 29 16:22:51 ubuntu kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables nls_iso8859_1 ipmi_ssif ipmi_devintf gpio_ich coretemp kvm_intel serio_raw kvm input_leds cdc_ether usbnet mii lpc_ich i7core_edac ioatdma edac_core i5500_temp shpchp dca 8250_fintek ipmi_si mac_hid ipmi_msghandler sunrpc autofs4 hid_generic mptsas mptscsih usbhid mptbase psmouse hid pata_acpi scsi_transport_sas bnx2
Jan 29 16:22:51 ubuntu kernel: CPU: 4 PID: 21954 Comm: qemu-system-x86 Tainted: G          I     4.2.0-27-generic #32lp1505948v201601281755
Jan 29 16:22:51 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512     , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 29 16:22:51 ubuntu kernel: task: ffff880380e98c80 ti: ffff8803811d4000 task.ti: ffff8803811d4000
Jan 29 16:22:51 ubuntu kernel: RIP: 0010:[<ffffffff811df264>]  [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:51 ubuntu kernel: RSP: 0018:ffff8803811d79c8  EFLAGS: 00010286
Jan 29 16:22:51 ubuntu kernel: RAX: 0000000000000000 RBX: 00000000000000d0 RCX: 000000000009d36e
Jan 29 16:22:51 ubuntu kernel: RDX: 000000000009d36d RSI: 0000000000000000 RDI: 0000000000019aa0
Jan 29 16:22:51 ubuntu kernel: RBP: ffff8803811d7a08 R08: ffff88067fc19aa0 R09: ffffffff812f8d56
Jan 29 16:22:51 ubuntu kernel: R10: ffff8800904b06c0 R11: 000000000000081a R12: 00000000000000d0
Jan 29 16:22:51 ubuntu kernel: R13: 0000000000000058 R14: ffff8803738037c0 R15: ffff8803738037c0
Jan 29 16:22:51 ubuntu kernel: FS:  00007f384a78eb00(0000) GS:ffff88067fc00000(0000) knlGS:0000000000000000
Jan 29 16:22:51 ubuntu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jan 29 16:22:51 ubuntu kernel: CR2: ffff8800904b06c0 CR3: 00000002da9d5000 CR4: 00000000000026e0
Jan 29 16:22:51 ubuntu kernel: Stack:
Jan 29 16:22:51 ubuntu kernel:  ffff8803811d7a18 ffffffff812f8d56 ffff880371e2b200 ffff8805993ae0d0
Jan 29 16:22:51 ubuntu kernel:  000000000000000b 00000000000000d0 0000000000000058 ffff8805993ae210
Jan 29 16:22:51 ubuntu kernel:  ffff8803811d7a58 ffffffff812f8d56 ffff8803811d7a38 ffff8805993ae0d0
Jan 29 16:22:51 ubuntu kernel: Call Trace:
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff812f8d56>] ? __fuse_request_alloc+0x56/0xd0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff812f8d56>] __fuse_request_alloc+0x56/0xd0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff812f9026>] __fuse_get_req+0x1d6/0x280
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff810bd7d0>] ? wake_atomic_t_function+0x60/0x60
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff812f90e0>] fuse_get_req+0x10/0x20
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8130389d>] fuse_direct_io+0x4fd/0x5c0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff812fce2f>] ? fuse_getxattr+0x12f/0x160
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff811de8e7>] ? kmem_cache_alloc_trace+0x187/0x1f0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8130445f>] ? fuse_direct_IO+0xff/0x3b0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff813044f3>] fuse_direct_IO+0x193/0x3b0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff811843b9>] generic_file_direct_write+0xb9/0x180
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff81304efc>] fuse_file_write_iter+0x15c/0x2e0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff813267cd>] ? security_file_permission+0x3d/0xc0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff81304da0>] ? fuse_perform_write+0x540/0x540
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8124adff>] aio_run_iocb+0x27f/0x2e0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8123f046>] ? fsnotify+0x316/0x4a0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8121b265>] ? __fget_light+0x25/0x60
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8124bcdb>] do_io_submit+0x24b/0x4f0
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff810a6240>] ? wake_up_q+0x70/0x70
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff8124bf90>] SyS_io_submit+0x10/0x20
Jan 29 16:22:51 ubuntu kernel:  [<ffffffff817f2532>] entry_SYSCALL_64_fastpath+0x16/0x75
Jan 29 16:22:51 ubuntu kernel: Code: 08 65 4c 03 05 36 af e2 7e 49 83 78 10 00 4d 8b 10 0f 84 36 01 00 00 4d 85 d2 0f 84 2d 01 00 00 49 63 46 20 48 8d 4a 01 49 8b 3e <49> 8b 1c 02 4c 89 d0 65 48 0f c7 0f 0f 94 c0 84 c0 74 bb 49 63 
Jan 29 16:22:51 ubuntu kernel: RIP  [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:51 ubuntu kernel:  RSP <ffff8803811d79c8>
Jan 29 16:22:51 ubuntu kernel: CR2: ffff8800904b06c0
Jan 29 16:22:51 ubuntu kernel: ---[ end trace 1ebba465731d9933 ]---
Jan 29 16:22:52 ubuntu kernel: BUG: unable to handle kernel paging request at ffff8800904b06c0
Jan 29 16:22:52 ubuntu kernel: IP: [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:52 ubuntu kernel: PGD 1ff0067 PUD 3738b6063 PMD 0 
Jan 29 16:22:52 ubuntu kernel: Oops: 0000 [#2] SMP 
Jan 29 16:22:52 ubuntu kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables nls_iso8859_1 ipmi_ssif ipmi_devintf gpio_ich coretemp kvm_intel serio_raw kvm input_leds cdc_ether usbnet mii lpc_ich i7core_edac ioatdma edac_core i5500_temp shpchp dca 8250_fintek ipmi_si mac_hid ipmi_msghandler sunrpc autofs4 hid_generic mptsas mptscsih usbhid mptbase psmouse hid pata_acpi scsi_transport_sas bnx2
Jan 29 16:22:52 ubuntu kernel: CPU: 4 PID: 21994 Comm: qemu-system-x86 Tainted: G      D   I     4.2.0-27-generic #32lp1505948v201601281755
Jan 29 16:22:52 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512     , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 29 16:22:52 ubuntu kernel: task: ffff88048cb88000 ti: ffff88062d63c000 task.ti: ffff88062d63c000
Jan 29 16:22:52 ubuntu kernel: RIP: 0010:[<ffffffff811de7da>]  [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:52 ubuntu kernel: RSP: 0018:ffff88062d63fb68  EFLAGS: 00010286
Jan 29 16:22:52 ubuntu kernel: RAX: 0000000000000000 RBX: 00000000000000d0 RCX: 000000000009d36e
Jan 29 16:22:52 ubuntu kernel: RDX: 000000000009d36d RSI: 00000000000000d0 RDI: 0000000000019aa0
Jan 29 16:22:52 ubuntu kernel: RBP: ffff88062d63fba8 R08: ffff88067fc19aa0 R09: ffffffff8130445f
Jan 29 16:22:52 ubuntu kernel: R10: 0000000000000000 R11: 0000000000000337 R12: 00000000000000d0
Jan 29 16:22:52 ubuntu kernel: R13: ffff8803738037c0 R14: ffff8800904b06c0 R15: ffff8803738037c0
Jan 29 16:22:52 ubuntu kernel: FS:  00007f4931ae9b00(0000) GS:ffff88067fc00000(0000) knlGS:0000000000000000
Jan 29 16:22:52 ubuntu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 29 16:22:52 ubuntu kernel: CR2: ffff8800904b06c0 CR3: 000000020a958000 CR4: 00000000000026e0
Jan 29 16:22:52 ubuntu kernel: Stack:
Jan 29 16:22:52 ubuntu kernel:  ffffffff8130445f 0000000000000048 0000000000000008 000000000a465000
Jan 29 16:22:52 ubuntu kernel:  ffff880636b1cc00 ffff88000ee95d00 0000000000000001 ffff88062d63fc78
Jan 29 16:22:52 ubuntu kernel:  ffff88062d63fc48 ffffffff8130445f ffff88062d63fb58 0000000000000000
Jan 29 16:22:52 ubuntu kernel: Call Trace:
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8130445f>] ? fuse_direct_IO+0xff/0x3b0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8130445f>] fuse_direct_IO+0xff/0x3b0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff811843b9>] generic_file_direct_write+0xb9/0x180
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff81304efc>] fuse_file_write_iter+0x15c/0x2e0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff813267cd>] ? security_file_permission+0x3d/0xc0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff81304da0>] ? fuse_perform_write+0x540/0x540
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8124adff>] aio_run_iocb+0x27f/0x2e0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8123f046>] ? fsnotify+0x316/0x4a0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8121b265>] ? __fget_light+0x25/0x60
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8124bcdb>] do_io_submit+0x24b/0x4f0
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff810a6240>] ? wake_up_q+0x70/0x70
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff8124bf90>] SyS_io_submit+0x10/0x20
Jan 29 16:22:52 ubuntu kernel:  [<ffffffff817f2532>] entry_SYSCALL_64_fastpath+0x16/0x75
Jan 29 16:22:52 ubuntu kernel: Code: 65 4c 03 05 c1 b9 e2 7e 49 83 78 10 00 4d 8b 30 0f 84 2b 01 00 00 4d 85 f6 0f 84 22 01 00 00 49 63 45 20 48 8d 4a 01 49 8b 7d 00 <49> 8b 1c 06 4c 89 f0 65 48 0f c7 0f 0f 94 c0 84 c0 74 b9 49 63 
Jan 29 16:22:52 ubuntu kernel: RIP  [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:52 ubuntu kernel:  RSP <ffff88062d63fb68>
Jan 29 16:22:52 ubuntu kernel: CR2: ffff8800904b06c0
Jan 29 16:22:52 ubuntu kernel: ---[ end trace 1ebba465731d9934 ]---
Jan 29 16:22:53 ubuntu kernel: BUG: unable to handle kernel paging request at ffff8800904b06c0
Jan 29 16:22:53 ubuntu kernel: IP: [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:53 ubuntu kernel: PGD 1ff0067 PUD 3738b6063 PMD 0 
Jan 29 16:22:53 ubuntu kernel: Oops: 0000 [#3] SMP 
Jan 29 16:22:53 ubuntu kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables nls_iso8859_1 ipmi_ssif ipmi_devintf gpio_ich coretemp kvm_intel serio_raw kvm input_leds cdc_ether usbnet mii lpc_ich i7core_edac ioatdma edac_core i5500_temp shpchp dca 8250_fintek ipmi_si mac_hid ipmi_msghandler sunrpc autofs4 hid_generic mptsas mptscsih usbhid mptbase psmouse hid pata_acpi scsi_transport_sas bnx2
Jan 29 16:22:53 ubuntu kernel: CPU: 4 PID: 21888 Comm: qemu-system-x86 Tainted: G      D   I     4.2.0-27-generic #32lp1505948v201601281755
Jan 29 16:22:53 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512     , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 29 16:22:53 ubuntu kernel: task: ffff8806681a5780 ti: ffff88005c210000 task.ti: ffff88005c210000
Jan 29 16:22:53 ubuntu kernel: RIP: 0010:[<ffffffff811de7da>]  [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:53 ubuntu kernel: RSP: 0018:ffff88005c213b68  EFLAGS: 00010286
Jan 29 16:22:53 ubuntu kernel: RAX: 0000000000000000 RBX: 00000000000000d0 RCX: 000000000009d36e
Jan 29 16:22:53 ubuntu kernel: RDX: 000000000009d36d RSI: 00000000000000d0 RDI: 0000000000019aa0
Jan 29 16:22:53 ubuntu kernel: RBP: ffff88005c213ba8 R08: ffff88067fc19aa0 R09: ffffffff8130445f
Jan 29 16:22:53 ubuntu kernel: R10: ffffea001664eb00 R11: 0000000000000f1b R12: 00000000000000d0
Jan 29 16:22:53 ubuntu kernel: R13: ffff8803738037c0 R14: ffff8800904b06c0 R15: ffff8803738037c0
Jan 29 16:22:53 ubuntu kernel: FS:  00007f975c60fb00(0000) GS:ffff88067fc00000(0000) knlGS:0000000000000000
Jan 29 16:22:53 ubuntu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 29 16:22:53 ubuntu kernel: CR2: ffff8800904b06c0 CR3: 00000003705cf000 CR4: 00000000000026e0
Jan 29 16:22:53 ubuntu kernel: Stack:
Jan 29 16:22:53 ubuntu kernel:  ffffffff8130445f 0000000000000048 0000000000000008 000000000a1b0000
Jan 29 16:22:53 ubuntu kernel:  ffff8804aa0a2c80 ffff8802b7d4db00 0000000000000001 ffff88005c213c78
Jan 29 16:22:53 ubuntu kernel:  ffff88005c213c48 ffffffff8130445f ffff88005c213b58 0000000000000000
Jan 29 16:22:53 ubuntu kernel: Call Trace:
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8130445f>] ? fuse_direct_IO+0xff/0x3b0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8130445f>] fuse_direct_IO+0xff/0x3b0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff811843b9>] generic_file_direct_write+0xb9/0x180
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff81304efc>] fuse_file_write_iter+0x15c/0x2e0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff813267cd>] ? security_file_permission+0x3d/0xc0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff81304da0>] ? fuse_perform_write+0x540/0x540
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8124adff>] aio_run_iocb+0x27f/0x2e0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff811b57bf>] ? handle_mm_fault+0xb7f/0x17e0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8121b265>] ? __fget_light+0x25/0x60
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8124bcdb>] do_io_submit+0x24b/0x4f0
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff8124bf90>] SyS_io_submit+0x10/0x20
Jan 29 16:22:53 ubuntu kernel:  [<ffffffff817f2532>] entry_SYSCALL_64_fastpath+0x16/0x75
Jan 29 16:22:53 ubuntu kernel: Code: 65 4c 03 05 c1 b9 e2 7e 49 83 78 10 00 4d 8b 30 0f 84 2b 01 00 00 4d 85 f6 0f 84 22 01 00 00 49 63 45 20 48 8d 4a 01 49 8b 7d 00 <49> 8b 1c 06 4c 89 f0 65 48 0f c7 0f 0f 94 c0 84 c0 74 b9 49 63 
Jan 29 16:22:53 ubuntu kernel: RIP  [<ffffffff811de7da>] kmem_cache_alloc_trace+0x7a/0x1f0
Jan 29 16:22:53 ubuntu kernel:  RSP <ffff88005c213b68>
Jan 29 16:22:53 ubuntu kernel: CR2: ffff8800904b06c0
Jan 29 16:22:53 ubuntu kernel: ---[ end trace 1ebba465731d9935 ]---
Jan 29 16:22:54 ubuntu kernel: BUG: unable to handle kernel paging request at ffff8800904b06c0
Jan 29 16:22:54 ubuntu kernel: IP: [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:54 ubuntu kernel: PGD 1ff0067 PUD 3738b6063 PMD 0 
Jan 29 16:22:54 ubuntu kernel: Oops: 0000 [#4] SMP 
Jan 29 16:22:54 ubuntu kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables nls_iso8859_1 ipmi_ssif ipmi_devintf gpio_ich coretemp kvm_intel serio_raw kvm input_leds cdc_ether usbnet mii lpc_ich i7core_edac ioatdma edac_core i5500_temp shpchp dca 8250_fintek ipmi_si mac_hid ipmi_msghandler sunrpc autofs4 hid_generic mptsas mptscsih usbhid mptbase psmouse hid pata_acpi scsi_transport_sas bnx2
Jan 29 16:22:54 ubuntu kernel: CPU: 4 PID: 294 Comm: jbd2/sda2-8 Tainted: G      D   I     4.2.0-27-generic #32lp1505948v201601281755
Jan 29 16:22:54 ubuntu kernel: Hardware name: IBM System x3550 M2 -[794654G]-/49Y6512     , BIOS -[D6E131CUS-1.05]- 11/25/2009
Jan 29 16:22:54 ubuntu kernel: task: ffff88066e496400 ti: ffff88036e98c000 task.ti: ffff88036e98c000
Jan 29 16:22:54 ubuntu kernel: RIP: 0010:[<ffffffff811df264>]  [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:54 ubuntu kernel: RSP: 0018:ffff88036e98f898  EFLAGS: 00010286
Jan 29 16:22:54 ubuntu kernel: RAX: 0000000000000000 RBX: 0000000000008050 RCX: 000000000009d36e
Jan 29 16:22:54 ubuntu kernel: RDX: 000000000009d36d RSI: 0000000000000000 RDI: 0000000000019aa0
Jan 29 16:22:54 ubuntu kernel: RBP: ffff88036e98f8d8 R08: ffff88067fc19aa0 R09: ffffffff812b0d79
Jan 29 16:22:54 ubuntu kernel: R10: ffff8800904b06c0 R11: 0000000000000004 R12: 0000000000008050
Jan 29 16:22:54 ubuntu kernel: R13: 0000000000000060 R14: ffff8803738037c0 R15: ffff8803738037c0
Jan 29 16:22:54 ubuntu kernel: FS:  0000000000000000(0000) GS:ffff88067fc00000(0000) knlGS:0000000000000000
Jan 29 16:22:54 ubuntu kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jan 29 16:22:54 ubuntu kernel: CR2: ffff8800904b06c0 CR3: 0000000001c0c000 CR4: 00000000000026e0
Jan 29 16:22:54 ubuntu kernel: Stack:
Jan 29 16:22:54 ubuntu kernel:  ffff8803703880c0 ffffffff812b0d79 0000000000000000 0000000000002056
Jan 29 16:22:54 ubuntu kernel:  0000000000002056 0000000000000000 0000000000000000 ffff880370388000
Jan 29 16:22:54 ubuntu kernel:  ffff88036e98f948 ffffffff812b0d79 ffff8803703880c0 ffff88066e496468
Jan 29 16:22:54 ubuntu kernel: Call Trace:
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812b0d79>] ? ext4_find_extent+0x1b9/0x320
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812b0d79>] ext4_find_extent+0x1b9/0x320
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812b5488>] ext4_ext_map_blocks+0x88/0xe30
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810b345b>] ? dequeue_task_fair+0x36b/0x700
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff81285e6b>] ext4_map_blocks+0x9b/0x4a0
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff817eec80>] ? bit_wait+0x60/0x60
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff8128632f>] _ext4_get_block+0xbf/0x220
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810ed9d7>] ? ktime_get+0x37/0xa0
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812864a6>] ext4_get_block+0x16/0x20
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812325ee>] generic_block_bmap+0x4e/0x70
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812d5828>] ? journal_submit_data_buffers+0x48/0x1b0
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff81283327>] ext4_bmap+0x77/0xe0
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff81217b1c>] bmap+0x1c/0x30
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812dd70f>] jbd2_journal_bmap+0x2f/0x80
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812dd7cb>] jbd2_journal_next_log_block+0x6b/0x80
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812dd9db>] jbd2_journal_get_descriptor_buffer+0x2b/0xb0
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812d64e1>] jbd2_journal_commit_transaction+0x991/0x1690
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810b345b>] ? dequeue_task_fair+0x36b/0x700
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810b2821>] ? put_prev_entity+0x31/0x420
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810e5b8e>] ? try_to_del_timer_sync+0x5e/0x90
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff812da9da>] kjournald2+0xca/0x250
Jan 29 16:22:54 ubuntu kernel:  [<ffffffff810bd7d0>] ? wake_atomic_t_function+0x60/0x60
Jan 29 16:22:55 ubuntu kernel:  [<ffffffff812da910>] ? commit_timeout+0x10/0x10
Jan 29 16:22:55 ubuntu kernel:  [<ffffffff8109ae48>] kthread+0xd8/0xf0
Jan 29 16:22:55 ubuntu kernel:  [<ffffffff8109ad70>] ? kthread_create_on_node+0x1f0/0x1f0
Jan 29 16:22:55 ubuntu kernel:  [<ffffffff817f295f>] ret_from_fork+0x3f/0x70
Jan 29 16:22:55 ubuntu kernel:  [<ffffffff8109ad70>] ? kthread_create_on_node+0x1f0/0x1f0
Jan 29 16:22:55 ubuntu kernel: Code: 08 65 4c 03 05 36 af e2 7e 49 83 78 10 00 4d 8b 10 0f 84 36 01 00 00 4d 85 d2 0f 84 2d 01 00 00 49 63 46 20 48 8d 4a 01 49 8b 3e <49> 8b 1c 02 4c 89 d0 65 48 0f c7 0f 0f 94 c0 84 c0 74 bb 49 63 
Jan 29 16:22:55 ubuntu kernel: RIP  [<ffffffff811df264>] __kmalloc+0x94/0x250
Jan 29 16:22:55 ubuntu kernel:  RSP <ffff88036e98f898>
Jan 29 16:22:55 ubuntu kernel: CR2: ffff8800904b06c0
Jan 29 16:22:55 ubuntu kernel: ---[ end trace 1ebba465731d9936 ]---

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2016-02-13:

#30

Download full text (4.3 KiB)

I am trying to get some traction on this bug, open for 6 months with no responses.

I have attempted to remove some variables from the equation to see what factors are potentially contributing to this kernel BUG.

First test:

I have replicated the issue on a host that does NOT run a glusterfsd, and thus only consumes a vm image from a separate server, eliminating any potential conflict from having both glusterfs server and client on the same node.

Also, the original hosts used when this bug was first reported were Supermicro Avoton Atom C2750/58. This new replication of the fault is on an older Dell PE2950 (Xeon E54xx), so the specific hardware does not seem to be a factor in the bug.

Reproduction steps:

- Fresh install of Fedora Server 22, minimal package set, with online updates.
- rpm -Uvh http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm.
- Add this node as a new host via oVirt WebAdmin.
- Start a VM on this new node, using a disk image that resides on a glusterfs storage domain.
- Boom!

kernel-4.3.4-200.fc22.x86_64
glusterfs-fuse-3.7.6-1.fc22.x86_64
fuse-2.9.4-3.fc22.x86_64
fuse-libs-2.9.4-3.fc22.x86_64

[ 316.458148] ------------[ cut here ]------------
[ 316.459052] kernel BUG at mm/slub.c:3517!
[ 316.459052] invalid opcode: 0000 [#1] SMP
[ 316.459052] Modules linked in: vhost_net vhost macvtap macvlan ebt_arp ebtable_nat tun nfsv3 nfs fscache fuse ebtable_filter ebtables ip6table_filter ip6_tables scsi_transport_iscsi xt_physdev br_netfilter nf_conntrack_ipv4 nf_defrag_ipv4 xt_multiport xt_conntrack dm_service_time nf_conntrack coretemp kvm_intel iTCO_wdt ipmi_ssif iTCO_vendor_support gpio_ich kvm ipmi_devintf dcdbas bnx2 lpc_ich ipmi_si i5000_edac edac_core ipmi_msghandler i5k_amb shpchp fjes acpi_cpufreq tpm_tis tpm nfsd 8021q auth_rpcgss garp mrp bridge nfs_acl lockd stp grace llc sunrpc bonding dm_multipath amdkfd amd_iommu_v2 radeon i2c_algo_bit drm_kms_helper ttm drm ata_generic serio_raw pata_acpi megaraid_sas
[ 316.515263] CPU: 2 PID: 3055 Comm: qemu-system-x86 Not tainted 4.3.4-200.fc22.x86_64 #1
[ 316.515263] Hardware name: Dell Inc. PowerEdge 2950/0M332H, BIOS 2.7.0 10/30/2010
[ 316.515263] task: ffff88041cbbb980 ti: ffff880418e94000 task.ti: ffff880418e94000
[ 316.515263] RIP: 0010:[<ffffffff81203edc>] [<ffffffff81203edc>] kfree+0x12c/0x130
[ 316.515263] RSP: 0018:ffff880418e97cc8 EFLAGS: 00010246
[ 316.515263] RAX: 003ffff800000000 RBX: ffff88002a43fea0 RCX: dead000000000200
[ 316.515263] RDX: 000077ff80000000 RSI: ffff88041cbbb980 RDI: ffff88002a43fea0
[ 316.515263] RBP: ffff880418e97ce0 R08: ffff880418e97ca8 R09: ffffea0000a90fc0
[ 316.515263] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000006e30400
[ 316.515263] R13: ffffffffa054c60e R14: ffff88042b32e400 R15: ffff880418e97dc8
[ 316.515263] FS: 00007f1c4e3ff700(0000) GS:ffff88043fc80000(0000) knlGS:0000000000000000
[ 316.515263] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 316.515263] CR2: 0000000000000000 CR3: 0000000418c96000 CR4: 00000000000026e0
[ 316.515263] Stack:
[ 316.515263] ffff88002a43fea0 0000000006e30400 ffff880418e97e60 ffff880418e97d68
[ 316.515263] ffffffffa054c60e 0000000000007c00 ffff88041c944c...

I am trying to get some traction on this bug, open for 6 months with no responses.

I have attempted to remove some variables from the equation to see what factors are potentially contributing to this kernel BUG.

First test:

I have replicated the issue on a host that does NOT run a glusterfsd, and thus only consumes a vm image from a separate server, eliminating any potential conflict from having both glusterfs server and client on the same node.

Also, the original hosts used when this bug was first reported were Supermicro Avoton Atom C2750/58. This new replication of the fault is on an older Dell PE2950 (Xeon E54xx), so the specific hardware does not seem to be a factor in the bug.

Reproduction steps:

- Fresh install of Fedora Server 22, minimal package set, with online updates.
- rpm -Uvh http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm.
- Add this node as a new host via oVirt WebAdmin.
- Start a VM on this new node, using a disk image that resides on a glusterfs storage domain.
- Boom!

kernel-4.3.4-200.fc22.x86_64
glusterfs-fuse-3.7.6-1.fc22.x86_64
fuse-2.9.4-3.fc22.x86_64
fuse-libs-2.9.4-3.fc22.x86_64

[  316.458148] ------------[ cut here ]------------
[  316.459052] kernel BUG at mm/slub.c:3517!
[  316.459052] invalid opcode: 0000 [#1] SMP 
[  316.459052] Modules linked in: vhost_net vhost macvtap macvlan ebt_arp ebtable_nat tun nfsv3 nfs fscache fuse ebtable_filter ebtables ip6table_filter ip6_tables scsi_transport_iscsi xt_physdev br_netfilter nf_conntrack_ipv4 nf_defrag_ipv4 xt_multiport xt_conntrack dm_service_time nf_conntrack coretemp kvm_intel iTCO_wdt ipmi_ssif iTCO_vendor_support gpio_ich kvm ipmi_devintf dcdbas bnx2 lpc_ich ipmi_si i5000_edac edac_core ipmi_msghandler i5k_amb shpchp fjes acpi_cpufreq tpm_tis tpm nfsd 8021q auth_rpcgss garp mrp bridge nfs_acl lockd stp grace llc sunrpc bonding dm_multipath amdkfd amd_iommu_v2 radeon i2c_algo_bit drm_kms_helper ttm drm ata_generic serio_raw pata_acpi megaraid_sas
[  316.515263] CPU: 2 PID: 3055 Comm: qemu-system-x86 Not tainted 4.3.4-200.fc22.x86_64 #1
[  316.515263] Hardware name: Dell Inc. PowerEdge 2950/0M332H, BIOS 2.7.0 10/30/2010
[  316.515263] task: ffff88041cbbb980 ti: ffff880418e94000 task.ti: ffff880418e94000
[  316.515263] RIP: 0010:[<ffffffff81203edc>]  [<ffffffff81203edc>] kfree+0x12c/0x130
[  316.515263] RSP: 0018:ffff880418e97cc8  EFLAGS: 00010246
[  316.515263] RAX: 003ffff800000000 RBX: ffff88002a43fea0 RCX: dead000000000200
[  316.515263] RDX: 000077ff80000000 RSI: ffff88041cbbb980 RDI: ffff88002a43fea0
[  316.515263] RBP: ffff880418e97ce0 R08: ffff880418e97ca8 R09: ffffea0000a90fc0
[  316.515263] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000006e30400
[  316.515263] R13: ffffffffa054c60e R14: ffff88042b32e400 R15: ffff880418e97dc8
[  316.515263] FS:  00007f1c4e3ff700(0000) GS:ffff88043fc80000(0000) knlGS:0000000000000000
[  316.515263] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  316.515263] CR2: 0000000000000000 CR3: 0000000418c96000 CR4: 00000000000026e0
[  316.515263] Stack:
[  316.515263]  ffff88002a43fea0 0000000006e30400 ffff880418e97e60 ffff880418e97d68
[  316.515263]  ffffffffa054c60e 0000000000007c00 ffff88041c944c00 0000000280000000
[  316.515263]  0000000000007c00 0000000006e38000 0000000000000000 0000000000000000
[  316.515263] Call Trace:
[  316.515263]  [<ffffffffa054c60e>] fuse_direct_IO+0x1ee/0x310 [fuse]
[  316.515263]  [<ffffffff811a791b>] generic_file_read_iter+0x47b/0x5c0
[  316.515263]  [<ffffffffa054910c>] fuse_file_read_iter+0x4c/0x70 [fuse]
[  316.515263]  [<ffffffff81223346>] __vfs_read+0xc6/0x100
[  316.515263]  [<ffffffff81223d73>] vfs_read+0x83/0x130
[  316.515263]  [<ffffffff81224c85>] SyS_pread64+0x95/0xb0
[  316.515263]  [<ffffffff8178182e>] entry_SYSCALL_64_fastpath+0x12/0x71
[  316.515263] Code: 2a 49 8b 01 31 f6 f6 c4 40 74 04 41 8b 71 68 4c 89 cf e8 58 a2 fa ff eb a0 4c 89 d1 48 89 da 4c 89 ce e8 78 fa ff ff eb 90 0f 0b <0f> 0b 66 90 66 66 66 66 90 55 48 89 e5 41 57 41 56 41 55 41 54 
[  316.515263] RIP  [<ffffffff81203edc>] kfree+0x12c/0x130
[  316.515263]  RSP <ffff880418e97cc8>
[  316.904456] ---[ end trace a63508bc8d44e7be ]---

Second test:

Same as above, but with a fresh install of CentOS 7.2 (1511).
Result: No bug triggered. The VM runs just fine.

kernel-3.10.0-327.4.5.el7.x86_64
glusterfs-fuse-3.7.6-1.el7.x86_64
fuse-2.9.2-6.el7.x86_64
fuse-libs-2.9.2-6.el7.x86_64

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2016-02-25:

#15

This also affects the Xenial Standard Kernel.

Revision history for this message

Seth Forshee (sforshee) wrote on 2016-03-10:

#16

I've been looking at the code, but I haven't found anything aside from the two races mentioned on the mailing list thread. Those could explain the original problems, but I don't have any ideas about the problems seen with the fixes applied yet.

I'm trying to reproduce now using the steps you provided in xenial but am not having any luck. My vm installed just fine and has been running for half an hour now with some synthesized disk IO. Anything you might have forgot to mention in the steps - ntfs-3g mount options, sepcific version of ntfs-3g to use, etc?

Revision history for this message

Seth Forshee (sforshee) wrote on 2016-03-10:

#17

I don't seem to be able to reproduce.

I did try making a patch though that you can try that adds a separate reference count to fuse_io_priv separate from the request count. I don't know if it fixes anything that moving spin_unlock() doesn't, but to me this seems more straightforward and less error prone than having the request count serve kind of as a reference count but not really.

A build with my patch and the iocb use-after-free fix are at http://people.canonical.com/~sforshee/lp1505948/.

Revision history for this message

Robert Doebbelin (2-robert-3) wrote on 2016-03-11:

#18

Thank you Seth for taking a close look at the problem and my proposed fix. As mentioned on the mailing list my test runs fine now with the two fixes.

However, I prefer your fix as it prevents us from running into this issue again. Our test system is happily installing VMs for two hours now using your build. Please propose your patch.

Revision history for this message

Seth Forshee (sforshee) wrote on 2016-03-11: Re: [Bug 1505948] Re: Memory arena corruption with FUSE (was Memory allocation failure crashes kernel hard, presumably related to FUSE)

#19

On Fri, Mar 11, 2016 at 01:03:32PM -0000, Robert Doebbelin wrote:
> Thank you Seth for taking a close look at the problem and my proposed
> fix. As mentioned on the mailing list my test runs fine now with the two
> fixes.
>
> However, I prefer your fix as it prevents us from running into this
> issue again. Our test system is happily installing VMs for two hours now
> using your build. Please propose your patch.

I'm not subscribed to fuse-devel and hadn't refreshed the mailing list
thread so I didn't realize that you had discovered that the hang was
unrelated. That's good.

I'm happy to send the patches, I'll go ahead and send both my patch and
your iocb patch after I make sure it all applies/builds okay on 4.5.

Revision history for this message

Robert Doebbelin (2-robert-3) wrote on 2016-03-11:

#20

Download full text (6.0 KiB)

Great, thanks!

Robert
Am 11.03.2016 15:01 schrieb "Seth Forshee" <email address hidden>:

> On Fri, Mar 11, 2016 at 01:03:32PM -0000, Robert Doebbelin wrote:
> > Thank you Seth for taking a close look at the problem and my proposed
> > fix. As mentioned on the mailing list my test runs fine now with the two
> > fixes.
> >
> > However, I prefer your fix as it prevents us from running into this
> > issue again. Our test system is happily installing VMs for two hours now
> > using your build. Please propose your patch.
>
> I'm not subscribed to fuse-devel and hadn't refreshed the mailing list
> thread so I didn't realize that you had discovered that the hang was
> unrelated. That's good.
>
> I'm happy to send the patches, I'll go ahead and send both my patch and
> your iocb patch after I make sure it all applies/builds okay on 4.5.
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1505948
>
> Title:
> Memory arena corruption with FUSE (was Memory allocation failure
> crashes kernel hard, presumably related to FUSE)
>
> Status in linux package in Ubuntu:
> Confirmed
> Status in linux source package in Wily:
> Confirmed
> Status in linux package in Fedora:
> Unknown
>
> Bug description:
> Hello everybody,
>
> Linux 4.1, 4.2 or 4.3-rc leads to an immediate kernel panic in our
> setup when trying to start a Qemu process on top of a fuse-based
> mount. Here is an example stacktrace:
>
> [ 739.807817] BUG: unable to handle kernel paging request at
> ffff8800a4104ea0
> [ 739.840201] IP: [<ffffffff811cc95a>] kmem_cache_alloc_trace+0x7a/0x1f0
> [ 739.870309] PGD 2fee067 PUD 2fbf4dd063 PMD 0
> [ 739.890418] Oops: 0000 [#1] SMP
> [ 739.905265] Modules linked in: nbd vport_vxlan vport_gre gre
> ebtable_filter ebtables openvswitch ib_iser rdma_cm iw_cm ib_cm ib_sa
> ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
> ipt_REJECT nf_reject_ipv4 nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter
> xt_CT iptable_raw ip_tables xt_tcpudp ip6t_REJECT nf_reject_ipv6 xt_limit
> nf_conntrack_ipv6 nf_defrag_ipv6 xt_multiport xt_conntrack nf_conntrack
> ip6table_filter ip6_tables x_tables dm_crypt ipmi_ssif intel_rapl iosf_mbi
> x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul
> crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul
> glue_helper ablk_helper cryptd kvm_intel kvm ipmi_devintf vhost_net vhost
> macvtap macvlan joydev input_leds dm_multipath scsi_dh bonding sb_edac
> 8021q garp hpilo mrp stp ipmi_si llc edac_core lpc_ich ioatdma 8250_fintek
> ipmi_msghandler lp shpchp acpi_power_meter mac_hid parport nls_iso8859_1
> sch_fq_codel xfs libcrc32c btrfs xor raid6_pq ixgbe ses enclosure
> hid_generic dca vxlan usbhid ip6_udp_tunnel tg3 udp_tunnel ptp hid pps_core
> hpsa mdio wmi
> [ 740.345300] CPU: 8 PID: 10550 Comm: qemu-system-x86 Not tainted
> 4.2.0-040200-generic #201508301530
> [ 740.386879] Hardware name: HP ProLiant DL380 Gen9, BIOS P89 05/06/2015
> [ 740.416827] task: ffff882f8e958dc0 ti: ffff882f28c20000 task.ti:
> ffff882f28c20000
> [ 740.451672] RIP: 0010:[<ffffffff811cc...

Great, thanks!

Robert
Am 11.03.2016 15:01 schrieb "Seth Forshee" <seth.forshee+lp@canonical.com>:

> On Fri, Mar 11, 2016 at 01:03:32PM -0000, Robert Doebbelin wrote:
> > Thank you Seth for taking a close look at the problem and my proposed
> > fix. As mentioned on the mailing list my test runs fine now with the two
> > fixes.
> >
> > However, I prefer your fix as it prevents us from running into this
> > issue again. Our test system is happily installing VMs for two hours now
> > using your build. Please propose your patch.
>
> I'm not subscribed to fuse-devel and hadn't refreshed the mailing list
> thread so I didn't realize that you had discovered that the hang was
> unrelated. That's good.
>
> I'm happy to send the patches, I'll go ahead and send both my patch and
> your iocb patch after I make sure it all applies/builds okay on 4.5.
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1505948
>
> Title:
>   Memory arena corruption with FUSE (was Memory allocation failure
>   crashes kernel hard, presumably related to FUSE)
>
> Status in linux package in Ubuntu:
>   Confirmed
> Status in linux source package in Wily:
>   Confirmed
> Status in linux package in Fedora:
>   Unknown
>
> Bug description:
>   Hello everybody,
>
>   Linux 4.1, 4.2 or 4.3-rc leads to an immediate kernel panic in our
>   setup when trying to start a Qemu process on top of a fuse-based
>   mount. Here is an example stacktrace:
>
>   [  739.807817] BUG: unable to handle kernel paging request at
> ffff8800a4104ea0
>   [  739.840201] IP: [<ffffffff811cc95a>] kmem_cache_alloc_trace+0x7a/0x1f0
>   [  739.870309] PGD 2fee067 PUD 2fbf4dd063 PMD 0
>   [  739.890418] Oops: 0000 [#1] SMP
>   [  739.905265] Modules linked in: nbd vport_vxlan vport_gre gre
> ebtable_filter ebtables openvswitch ib_iser rdma_cm iw_cm ib_cm ib_sa
> ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
> ipt_REJECT nf_reject_ipv4 nf_conntrack_ipv4 nf_defrag_ipv4 iptable_filter
> xt_CT iptable_raw ip_tables xt_tcpudp ip6t_REJECT nf_reject_ipv6 xt_limit
> nf_conntrack_ipv6 nf_defrag_ipv6 xt_multiport xt_conntrack nf_conntrack
> ip6table_filter ip6_tables x_tables dm_crypt ipmi_ssif intel_rapl iosf_mbi
> x86_pkg_temp_thermal intel_powerclamp coretemp crct10dif_pclmul
> crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul
> glue_helper ablk_helper cryptd kvm_intel kvm ipmi_devintf vhost_net vhost
> macvtap macvlan joydev input_leds dm_multipath scsi_dh bonding sb_edac
> 8021q garp hpilo mrp stp ipmi_si llc edac_core lpc_ich ioatdma 8250_fintek
> ipmi_msghandler lp shpchp acpi_power_meter mac_hid parport nls_iso8859_1
> sch_fq_codel xfs libcrc32c btrfs xor raid6_pq ixgbe ses enclosure
> hid_generic dca vxlan usbhid ip6_udp_tunnel tg3 udp_tunnel ptp hid pps_core
> hpsa mdio wmi
>   [  740.345300] CPU: 8 PID: 10550 Comm: qemu-system-x86 Not tainted
> 4.2.0-040200-generic #201508301530
>   [  740.386879] Hardware name: HP ProLiant DL380 Gen9, BIOS P89 05/06/2015
>   [  740.416827] task: ffff882f8e958dc0 ti: ffff882f28c20000 task.ti:
> ffff882f28c20000
>   [  740.451672] RIP: 0010:[<ffffffff811cc95a>]  [<ffffffff811cc95a>]
> kmem_cache_alloc_trace+0x7a/0x1f0
>   [  740.494047] RSP: 0018:ffff882f28c23c68  EFLAGS: 00010286
>   [  740.518425] RAX: 0000000000000000 RBX: 00000000000000d0 RCX:
> 00000000000026b3
>   [  740.551611] RDX: 00000000000026b2 RSI: 00000000000000d0 RDI:
> ffff882fbf407840
>   [  740.584846] RBP: ffff882f28c23ca8 R08: 0000000000019920 R09:
> ffffe8d000200ab0
>   [  740.618287] R10: ffffffff812e8dcd R11: ffffea00bca0ac00 R12:
> 00000000000000d0
>   [  740.651320] R13: ffff882fbf407840 R14: ffff8800a4104ea0 R15:
> ffff882fbf407840
>   [  740.684195] FS:  00007f2642ffd700(0000) GS:ffff882fbfa00000(0000)
> knlGS:0000000000000000
>   [  740.722030] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>   [  740.749469] CR2: ffff8800a4104ea0 CR3: 0000002f26f83000 CR4:
> 00000000001426e0
>   [  740.783390] Stack:
>   [  740.792577]  ffffffff812e8dcd 0000000000000048 0000000000000002
> ffff882f908c8468
>   [  740.827003]  0000000001bef000 ffff882f928e4600 ffff882f28c23e48
> ffff882f28c23d70
>   [  740.860971]  ffff882f28c23d38 ffffffff812e8dcd 0000000000000001
> ffff882f908c8300
>   [  740.894994] Call Trace:
>   [  740.906211]  [<ffffffff812e8dcd>] ? fuse_direct_IO+0xdd/0x280
>   [  740.932940]  [<ffffffff812e8dcd>] fuse_direct_IO+0xdd/0x280
>   [  740.958866]  [<ffffffff8117750e>] generic_file_direct_write+0x9e/0x150
>   [  740.989318]  [<ffffffff812e96bc>] fuse_file_write_iter+0x15c/0x2e0
>   [  741.017725]  [<ffffffff811e94a7>] __vfs_write+0xa7/0xf0
>   [  741.041787]  [<ffffffff811e9b09>] vfs_write+0xa9/0x190
>   [  741.065307]  [<ffffffff811ea9d9>] SyS_pwrite64+0x69/0xa0
>   [  741.090141]  [<ffffffff81085b57>] ? SyS_rt_sigprocmask+0x67/0xb0
>   [  741.135924]  [<ffffffff817a8e32>] entry_SYSCALL_64_fastpath+0x16/0x75
>   [  741.183478] Code: 4c 03 05 32 d8 e3 7e 4d 8b 30 49 8b 40 10 4d 85 f6
> 0f 84 22 01 00 00 48 85 c0 0f 84 19 01 00 00 49 63 47 20 48 8d 4a 01 4d 8b
> 07 <49> 8b 1c 06 4c 89 f0 65 49 0f c7 08 0f 94 c0 84 c0 74 b9 49 63
>   [  741.306817] RIP  [<ffffffff811cc95a>]
> kmem_cache_alloc_trace+0x7a/0x1f0
>
>   The problem has also been documented by somebody else in the Fedora
>   bug tracker at https://bugzilla.redhat.com/show_bug.cgi?id=1254310
>
>   This behaviour is 100% reproducible. I have asked the fuse-devel
>   mailinglist for advice, but up to this point with no success:
>
>   http://sourceforge.net/p/fuse/mailman/message/34537139/
>
>   We are still investigating if this issue is also happening with 4.0
>   and will add the information to this bug report once we have it. Any
>   help on debugging will be greatly appreciated.
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1505948/+subscriptions
>

--

--
*Quobyte* GmbH
Hardenbergplatz 2 - 10623 Berlin - Germany
+49-30-814 591 800 - www.quobyte.com
Amtsgericht Berlin-Charlottenburg, HRB 149012B
management board: Dr. Felix Hupfeld, Dr. Björn Kolbeck, Dr. Jan Stender

Revision history for this message

In Red Hat Bugzilla #1254310, Miklos (miklos-redhat-bugs) wrote on 2016-03-16:

#31

Created attachment 1137049
proposed patch #1

Revision history for this message

In Red Hat Bugzilla #1254310, Miklos (miklos-redhat-bugs) wrote on 2016-03-16:

#32

Created attachment 1137050
proposed patch #2

Could you please test with these two patches?

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2016-03-16:

#33

Miklos,

Those patches look promising. I will endeavour to test them ASAP. If not today, then by the end of the week.

In the interest of not introducing any additional variables into the tests at this point, I will switch my current in-production kernel (kernel-4.2.5-201.fc22.x86_64 recompiled to use SLAB) back to the default/broken SLUB-based allocator, with your two patches applied and test that.

Whether that works or not, I will then apply the patches against the latest kernel-4.4.4-200.fc22 and test that as well.

Thank you for your work on this. I am very pleased to see this bug finally get some attention.

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2016-03-16:

#34

Miklos,

Yahoo! The above two patches have allowed me to return to the SLUB allocator without fuse crashing. VMs started up with no problem, just as they should. This is with kernel 4.2.5.

Having one test node with the patches running VMs for only a few minutes now, I am tentatively calling this one a success. I'll will try the patches on 4.4.4 shortly, but I expect that to work as well.

What are the odds that the Fedora kernel team will incorporate these patches without waiting for it to hit mainline/stable upstream first?

Seth Forshee (sforshee) on 2016-03-22

description:	updated
Changed in linux (Ubuntu Wily):
assignee:	nobody → Seth Forshee (sforshee)
status:	Confirmed → In Progress
Changed in linux (Ubuntu Xenial):
assignee:	nobody → Seth Forshee (sforshee)
status:	Confirmed → In Progress

Seth Forshee (sforshee) on 2016-03-22

Changed in linux (Ubuntu Xenial):
status:	In Progress → Fix Committed

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-03-29:

#21

Download full text (4.2 KiB)

This bug was fixed in the package linux - 4.4.0-16.32

---------------
linux (4.4.0-16.32) xenial; urgency=low

[ Tim Gardner ]

* Release Tracking Bug
- LP: #1561727

  * fix thermal throttling due to commit "Thermal: initialize thermal zone
    device correctly" (LP: #1561676)
    - Thermal: Ignore invalid trip points

  * Thinkpad T460: Trackpoint mouse buttons instantly generate "release" event
    on press (LP: #1553811)
    - SAUCE: (noup) Input: synaptics - handle spurious release of trackstick
      buttons, again

  * reading /sys/kernel/security/apparmor/profiles requires CAP_MAC_ADMIN
    (LP: #1560583)
    - SAUCE: apparmor: Allow ns_root processes to open profiles file
    - SAUCE: apparmor: Consult sysctl when reading profiles in a user ns

* linux: sync virtualbox drivers to 5.0.16-dfsg-2 (LP: #1561492)
- ubuntu: vbox -- update to 5.0.16-dfsg-2

  * s390/kconfig: CONFIG_NUMA without CONFIG_NUMA_EMU does not make any sense on
    s390x (LP: #1557690)
    - [Config] CONFIG_NUMA_BALANCING_DEFAULT_ENABLED=n for s390x

  * spl/zfs fails to build on s390x (LP: #1519814)
    - [Config] s390x -- re-enable zfs
    - [Config] zfs -- disable powerpc until the test failures can be resolved

* linux: sync to ZFS 0.6.5.6 stable release (LP: #1561483)
- SAUCE: (noup) Update spl to 0.6.5.6-0ubuntu1, zfs to 0.6.5.6-0ubuntu1

  * zfs: enable zfs for 64bit powerpc kernels (LP: #1558871)
    - [Packaging] zfs -- handle rprovides via dpkg-gencontrol
    - [Config] powerpc -- convert zfs configuration to custom_override

  * Memory arena corruption with FUSE (was Memory allocation failure crashes
    kernel hard, presumably related to FUSE) (LP: #1505948)
    - SAUCE: (noup) fuse: do not use iocb after it may have been freed
    - SAUCE: (noup) fuse: Add reference counting for fuse_io_priv

* cgroup namespaces: add a 'nsroot=' mountinfo field (LP: #1560489)
- SAUCE: (noup) cgroup namespaces: add a 'nsroot=' mountinfo field

* linux packaging: clear remaining redundant delta (LP: #1560445)
- [Debian] Remove generated intermediate files on clean

  * arm64: guest hangs when ntpd is running (LP: #1549494)
    - Revert "hrtimer: Add support for CLOCK_MONOTONIC_RAW"
    - Revert "hrtimer: Catch illegal clockids"
    - Revert "KVM: arm/arm64: timer: Switch to CLOCK_MONOTONIC_RAW"

  * Need enough contiguous memory to support GICv3 ITS table (LP: #1558828)
    - [Config] CONFIG_FORCE_MAX_ZONEORDER=13 on arm64
    - SAUCE: (no-up) arm64: gicv3: its: Increase FORCE_MAX_ZONEORDER for Cavium
      ThunderX

  * update arcmsr to version v1.30.00.22-20151126 to fix card timeouts
    (LP: #1559609)
    - arcmsr: fixed getting wrong configuration data
    - arcmsr: fixes not release allocated resource
    - arcmsr: make code more readable
    - arcmsr: adds code to support new Areca adapter ARC1203
    - arcmsr: changes driver version number
    - arcmsr: more readability improvements
    - arcmsr: Split dma resource allocation to a new function
    - arcmsr: change driver version to v1.30.00.22-20151126

* server image has no keyboard, desktop image works (LP: #1559692)
- [Config] Rework input-modules (d-i) list

* PMU sup...

This bug was fixed in the package linux - 4.4.0-16.32

---------------
linux (4.4.0-16.32) xenial; urgency=low

[ Tim Gardner ]

* Release Tracking Bug
    - LP: #1561727

* fix thermal throttling due to commit "Thermal: initialize thermal zone
    device correctly"  (LP: #1561676)
    - Thermal: Ignore invalid trip points

* Thinkpad T460: Trackpoint mouse buttons instantly generate "release" event
    on press (LP: #1553811)
    - SAUCE: (noup) Input: synaptics - handle spurious release of trackstick
      buttons, again

* reading /sys/kernel/security/apparmor/profiles requires CAP_MAC_ADMIN
    (LP: #1560583)
    - SAUCE: apparmor: Allow ns_root processes to open profiles file
    - SAUCE: apparmor: Consult sysctl when reading profiles in a user ns

* linux: sync virtualbox drivers to 5.0.16-dfsg-2 (LP: #1561492)
    - ubuntu: vbox -- update to 5.0.16-dfsg-2

* s390/kconfig: CONFIG_NUMA without CONFIG_NUMA_EMU does not make any sense on
    s390x (LP: #1557690)
    - [Config] CONFIG_NUMA_BALANCING_DEFAULT_ENABLED=n for s390x

* spl/zfs fails to build on s390x (LP: #1519814)
    - [Config] s390x -- re-enable zfs
    - [Config] zfs -- disable powerpc until the test failures can be resolved

* linux: sync to ZFS 0.6.5.6 stable release (LP: #1561483)
    - SAUCE: (noup) Update spl to 0.6.5.6-0ubuntu1, zfs to 0.6.5.6-0ubuntu1

* zfs: enable zfs for 64bit powerpc kernels (LP: #1558871)
    - [Packaging] zfs -- handle rprovides via dpkg-gencontrol
    - [Config] powerpc -- convert zfs configuration to custom_override

* Memory arena corruption with FUSE (was Memory allocation failure crashes
    kernel hard, presumably related to FUSE) (LP: #1505948)
    - SAUCE: (noup) fuse: do not use iocb after it may have been freed
    - SAUCE: (noup) fuse: Add reference counting for fuse_io_priv

* cgroup namespaces: add a 'nsroot=' mountinfo field (LP: #1560489)
    - SAUCE: (noup) cgroup namespaces: add a 'nsroot=' mountinfo field

* linux packaging: clear remaining redundant delta (LP: #1560445)
    - [Debian] Remove generated intermediate files on clean

* arm64: guest hangs when ntpd is running (LP: #1549494)
    - Revert "hrtimer: Add support for CLOCK_MONOTONIC_RAW"
    - Revert "hrtimer: Catch illegal clockids"
    - Revert "KVM: arm/arm64: timer: Switch to CLOCK_MONOTONIC_RAW"

* Need enough contiguous memory to support GICv3 ITS table (LP: #1558828)
    - [Config] CONFIG_FORCE_MAX_ZONEORDER=13 on arm64
    - SAUCE: (no-up) arm64: gicv3: its: Increase FORCE_MAX_ZONEORDER for Cavium
      ThunderX

* update arcmsr to version v1.30.00.22-20151126 to fix card timeouts
    (LP: #1559609)
    - arcmsr: fixed getting wrong configuration data
    - arcmsr: fixes not release allocated resource
    - arcmsr: make code more readable
    - arcmsr: adds code to support new Areca adapter ARC1203
    - arcmsr: changes driver version number
    - arcmsr: more readability improvements
    - arcmsr: Split dma resource allocation to a new function
    - arcmsr: change driver version to v1.30.00.22-20151126

* server image has no keyboard, desktop image works (LP: #1559692)
    - [Config] Rework input-modules (d-i) list

* PMU support for Cavium ThunderX (LP: #1559349)
    - arm64: perf: Rename Cortex A57 events
    - arm64/perf: Add Cavium ThunderX PMU support
    - arm64: perf: Enable PMCR long cycle counter bit
    - arm64: perf: Extend event mask for ARMv8.1
    - arm64: dts: Add Cavium ThunderX specific PMU

* Show ARM PMU events in perf stat (LP: #1559350)
    - drivers/perf: kill armpmu_register
    - arm: perf: Convert event enums to #defines
    - arm: perf: Add event descriptions
    - arm64: perf: Convert event enums to #defines
    - arm64: perf: Add event descriptions
    - ARM: perf: add format entry to describe event -> config mapping
    - arm64: perf: add format entry to describe event -> config mapping

* [Bug]HSW/BDW EDAC driver reports wrong DIMM (LP: #1559904)
    - EDAC/sb_edac: Fix computation of channel address

* 5-10 second delay in kernel boot with kernel command line ip= (LP: #1259861)
    - [Config] disable CONFIG_IP_PNP

* Miscellaneous Ubuntu changes
    - [Debian] Silence the reconstruct script

-- Tim Gardner <tim.gardner@canonical.com>  Mon, 21 Mar 2016 10:15:31 -0600

Changed in linux (Ubuntu Xenial):
status:	Fix Committed → Fix Released

Brad Figg (brad-figg) on 2016-03-29

Changed in linux (Ubuntu Wily):
status:	In Progress → Fix Committed

Revision history for this message

Kamal Mostafa (kamalmostafa) wrote on 2016-04-20:

#22

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-wily' to 'verification-done-wily'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-wily

Revision history for this message

Martin Gerhard Loschwitz (martin-loschwitz) wrote on 2016-04-21:

#23

Done.

tags:

added: verification-done-wily
removed: verification-needed-wily

Revision history for this message

In Red Hat Bugzilla #1254310, Ian (ian-redhat-bugs) wrote on 2016-05-08:

#35

Confirmed fixed on all nodes of my production cluster with the FUSE patches included in kernel-4.4.8-200.fc22.x86_64.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-05-09:

#24

Download full text (30.4 KiB)

This bug was fixed in the package linux - 4.2.0-36.41

---------------
linux (4.2.0-36.41) wily; urgency=low

[ Kamal Mostafa ]

* Release Tracking Bug
- LP: #1571667

[ Benjamin Tissoires ]

  * SAUCE: Input: synaptics - handle spurious release of trackstick
    buttons, again
    - LP: #1553811

[ dann frazier ]

  * Revert "SAUCE: arm64, numa, dt: adding dt based numa support using dt
    node property arm, associativity"
    - LP: #1558828
  * Revert "SAUCE: Documentation: arm64/arm: dt bindings for numa."
    - LP: #1558828
  * Revert "SAUCE: arm64, numa: adding numa support for arm64 platforms."
    - LP: #1558828
  * Revert "[Config] Enable NUMA on ARM64"
    - LP: #1558828

[ K. Y. Srinivasan ]

  * SAUCE: (noup): Drivers: hv: vmbus: Fix a bug in
    hv_need_to_signal_on_read()
    - LP: #1556264

[ Kamal Mostafa ]

* [debian] BugLink: close LP: bugs only for Launchpad urls
* [Config] updateconfigs after v4.2.8-ckt7

[ Upstream Kernel Changes ]

  * Revert "jffs2: Fix lock acquisition order bug in jffs2_write_begin"
    - LP: #1561677
  * tipc: fix connection abort during subscription cancel
    - LP: #1561677
  * tipc: fix nullptr crash during subscription cancel
    - LP: #1561677
  * s390/mm: four page table levels vs. fork
    - LP: #1561677
  * Input: aiptek - fix crash on detecting device without endpoints
    - LP: #1561677
  * wext: fix message delay/ordering
    - LP: #1561677
  * cfg80211/wext: fix message ordering
    - LP: #1561677
  * mac80211: fix use of uninitialised values in RX aggregation
    - LP: #1561677
  * mac80211: minstrel: Change expected throughput unit back to Kbps
    - LP: #1561677
  * libata: fix HDIO_GET_32BIT ioctl
    - LP: #1561677
  * iwlwifi: mvm: inc pending frames counter also when txing non-sta
    - LP: #1561677
  * [media] adv7604: fix tx 5v detect regression
    - LP: #1561677
  * ahci: add new Intel device IDs
    - LP: #1561677
  * ahci: Order SATA device IDs for codename Lewisburg
    - LP: #1561677
  * Adding Intel Lewisburg device IDs for SATA
    - LP: #1561677
  * ASoC: samsung: Use IRQ safe spin lock calls
    - LP: #1561677
  * mac80211: minstrel_ht: set default tx aggregation timeout to 0
    - LP: #1561677
  * usb: chipidea: otg: change workqueue ci_otg as freezable
    - LP: #1561677
  * jffs2: Fix page lock / f->sem deadlock
    - LP: #1561677
  * Fix directory hardlinks from deleted directories
    - LP: #1561677
  * iommu/amd: Fix boot warning when device 00:00.0 is not iommu covered
    - LP: #1561677
  * iommu/amd: Apply workaround for ATS write permission check
    - LP: #1561677
  * libata: Align ata_device's id on a cacheline
    - LP: #1561677
  * can: gs_usb: fixed disconnect bug by removing erroneous use of kfree()
    - LP: #1561677
  * fbcon: set a default value to blink interval
    - LP: #1561677
  * KVM: x86: fix root cause for missed hardware breakpoints
    - LP: #1561677
  * arm64: vmemmap: use virtual projection of linear region
    - LP: #1561677
  * vfio: fix ioctl error handling
    - LP: #1561677
  * ALSA: ctl: Fix ioctls for X32 ABI
    - LP: #1561677
  * ALSA: pcm: Fix ioctls for X32 ABI
    - LP: #1561677
  * ALSA: rawmidi: Fix ioct...

This bug was fixed in the package linux - 4.2.0-36.41

---------------
linux (4.2.0-36.41) wily; urgency=low

[ Kamal Mostafa ]

* Release Tracking Bug
    - LP: #1571667

[ Benjamin Tissoires ]

* SAUCE: Input: synaptics - handle spurious release of trackstick
    buttons, again
    - LP: #1553811

[ dann frazier ]

* Revert "SAUCE: arm64, numa, dt: adding dt based numa support using dt
    node property arm, associativity"
    - LP: #1558828
  * Revert "SAUCE: Documentation: arm64/arm: dt bindings for numa."
    - LP: #1558828
  * Revert "SAUCE: arm64, numa: adding numa support for arm64 platforms."
    - LP: #1558828
  * Revert "[Config] Enable NUMA on ARM64"
    - LP: #1558828

[ K. Y. Srinivasan ]

* SAUCE: (noup): Drivers: hv: vmbus: Fix a bug in
    hv_need_to_signal_on_read()
    - LP: #1556264

[ Kamal Mostafa ]

* [debian] BugLink: close LP: bugs only for Launchpad urls
  * [Config] updateconfigs after v4.2.8-ckt7

[ Upstream Kernel Changes ]

* Revert "jffs2: Fix lock acquisition order bug in jffs2_write_begin"
    - LP: #1561677
  * tipc: fix connection abort during subscription cancel
    - LP: #1561677
  * tipc: fix nullptr crash during subscription cancel
    - LP: #1561677
  * s390/mm: four page table levels vs. fork
    - LP: #1561677
  * Input: aiptek - fix crash on detecting device without endpoints
    - LP: #1561677
  * wext: fix message delay/ordering
    - LP: #1561677
  * cfg80211/wext: fix message ordering
    - LP: #1561677
  * mac80211: fix use of uninitialised values in RX aggregation
    - LP: #1561677
  * mac80211: minstrel: Change expected throughput unit back to Kbps
    - LP: #1561677
  * libata: fix HDIO_GET_32BIT ioctl
    - LP: #1561677
  * iwlwifi: mvm: inc pending frames counter also when txing non-sta
    - LP: #1561677
  * [media] adv7604: fix tx 5v detect regression
    - LP: #1561677
  * ahci: add new Intel device IDs
    - LP: #1561677
  * ahci: Order SATA device IDs for codename Lewisburg
    - LP: #1561677
  * Adding Intel Lewisburg device IDs for SATA
    - LP: #1561677
  * ASoC: samsung: Use IRQ safe spin lock calls
    - LP: #1561677
  * mac80211: minstrel_ht: set default tx aggregation timeout to 0
    - LP: #1561677
  * usb: chipidea: otg: change workqueue ci_otg as freezable
    - LP: #1561677
  * jffs2: Fix page lock / f->sem deadlock
    - LP: #1561677
  * Fix directory hardlinks from deleted directories
    - LP: #1561677
  * iommu/amd: Fix boot warning when device 00:00.0 is not iommu covered
    - LP: #1561677
  * iommu/amd: Apply workaround for ATS write permission check
    - LP: #1561677
  * libata: Align ata_device's id on a cacheline
    - LP: #1561677
  * can: gs_usb: fixed disconnect bug by removing erroneous use of kfree()
    - LP: #1561677
  * fbcon: set a default value to blink interval
    - LP: #1561677
  * KVM: x86: fix root cause for missed hardware breakpoints
    - LP: #1561677
  * arm64: vmemmap: use virtual projection of linear region
    - LP: #1561677
  * vfio: fix ioctl error handling
    - LP: #1561677
  * ALSA: ctl: Fix ioctls for X32 ABI
    - LP: #1561677
  * ALSA: pcm: Fix ioctls for X32 ABI
    - LP: #1561677
  * ALSA: rawmidi: Fix ioctls X32 ABI
    - LP: #1561677
  * ALSA: timer: Fix broken compat timer user status ioctl
    - LP: #1561677
  * ALSA: timer: Fix ioctls for X32 ABI
    - LP: #1561677
  * cifs: fix out-of-bounds access in lease parsing
    - LP: #1561677
  * CIFS: Fix SMB2+ interim response processing for read requests
    - LP: #1561677
  * Fix cifs_uniqueid_to_ino_t() function for s390x
    - LP: #1561677
  * arm/arm64: KVM: Fix ioctl error handling
    - LP: #1561677
  * MIPS: kvm: Fix ioctl error handling.
    - LP: #1561677
  * ALSA: hdspm: Fix wrong boolean ctl value accesses
    - LP: #1561677
  * ALSA: hdspm: Fix zero-division
    - LP: #1561677
  * ALSA: hdsp: Fix wrong boolean ctl value accesses
    - LP: #1561677
  * use ->d_seq to get coherency between ->d_inode and ->d_flags
    - LP: #1561677
  * USB: qcserial: add Dell Wireless 5809e Gobi 4G HSPA+ (rev3)
    - LP: #1561677
  * USB: cp210x: Add ID for Parrot NMEA GPS Flight Recorder
    - LP: #1561677
  * ASoC: dapm: Fix ctl value accesses in a wrong type
    - LP: #1561677
  * ASoC: wm8958: Fix enum ctl accesses in a wrong type
    - LP: #1561677
  * ASoC: wm8994: Fix enum ctl accesses in a wrong type
    - LP: #1561677
  * ASoC: wm_adsp: Fix enum ctl accesses in a wrong type
    - LP: #1561677
  * USB: serial: option: add support for Telit LE922 PID 0x1045
    - LP: #1561677
  * USB: serial: option: add support for Quectel UC20
    - LP: #1561677
  * ALSA: usb-audio: Add a quirk for Plantronics DA45
    - LP: #1561677
  * mac80211: check PN correctly for GCMP-encrypted fragmented MPDUs
    - LP: #1561677
  * mac80211: Fix Public Action frame RX in AP mode
    - LP: #1561677
  * i2c: brcmstb: allocate correct amount of memory for regmap
    - LP: #1561677
  * ALSA: seq: oss: Don't drain at closing a client
    - LP: #1561677
  * parisc: Fix ptrace syscall number and return value modification
    - LP: #1561677
  * drm/ast: Fix incorrect register check for DRAM width
    - LP: #1561677
  * USB: qcserial: add Sierra Wireless EM74xx device ID
    - LP: #1561677
  * drm/amdgpu/pm: update current crtc info after setting the powerstate
    - LP: #1561677
  * drm/radeon/pm: update current crtc info after setting the powerstate
    - LP: #1561677
  * drm/amdgpu: return from atombios_dp_get_dpcd only when error
    - LP: #1561677
  * PM / sleep / x86: Fix crash on graph trace through x86 suspend
    - LP: #1561677
  * ALSA: hda - Fix mic issues on Acer Aspire E1-472
    - LP: #1561677
  * ovl: fix working on distributed fs as lower layer
    - LP: #1561677
  * ovl: fix getcwd() failure after unsuccessful rmdir
    - LP: #1561677
  * ovl: ignore lower entries when checking purity of non-directory entries
    - LP: #1561677
  * MIPS: traps: Fix SIGFPE information leak from `do_ov' and
    `do_trap_or_bp'
    - LP: #1561677
  * ubi: Fix out of bounds write in volume update code
    - LP: #1561677
  * target: Drop incorrect ABORT_TASK put for completed commands
    - LP: #1561677
  * ARM: OMAP2+: hwmod: Introduce ti,no-idle dt property
    - LP: #1561677
  * ARM: dts: dra7: do not gate cpsw clock due to errata i877
    - LP: #1561677
  * PCI: Allow a NULL "parent" pointer in pci_bus_assign_domain_nr()
    - LP: #1561677
  * KVM: PPC: Book3S HV: Sanitize special-purpose register values on guest
    exit
    - LP: #1561677
  * ncpfs: fix a braino in OOM handling in ncp_fill_cache()
    - LP: #1561677
  * jffs2: reduce the breakage on recovery from halfway failed rename()
    - LP: #1561677
  * KVM: VMX: disable PEBS before a guest entry
    - LP: #1561677
  * arm64: account for sparsemem section alignment when choosing vmemmap
    offset
    - LP: #1561677
  * tracing: Fix check for cpu online when event is disabled
    - LP: #1561677
  * KVM: MMU: fix ept=0/pte.u=1/pte.w=0/CR0.WP=0/CR4.SMEP=1/EFER.NX=0 combo
    - LP: #1561677
  * dmaengine: at_xdmac: fix residue computation
    - LP: #1561677
  * MIPS: Fix build error when SMP is used without GIC
    - LP: #1561677
  * IB/core: Use GRH when the path hop-limit > 0
    - LP: #1561677
  * dmaengine: pxa_dma: fix cyclic transfers
    - LP: #1561677
  * MIPS: smp.c: Fix uninitialised temp_foreign_map
    - LP: #1561677
  * tcp: fix tcpi_segs_in after connection establishment
    - LP: #1561677
  * be2net: Don't leak iomapped memory on removal.
    - LP: #1561677
  * tcp: convert cached rtt from usec to jiffies when feeding initial rto
    - LP: #1561677
  * ext4: iterate over buffer heads correctly in move_extent_per_page()
    - LP: #1561677
  * ppp: release rtnl mutex when interface creation fails
    - LP: #1561677
  * net/mlx4_core: Allow resetting VF admin mac to zero
    - LP: #1561677
  * ipv6: re-enable fragment header matching in ipv6_find_hdr
    - LP: #1561677
  * net/mlx5e: Remove wrong poll CQ optimization
    - LP: #1561677
  * cdc_ncm: do not call usbnet_link_change from cdc_ncm_bind
    - LP: #1561677
  * net: qca_spi: Don't clear IFF_BROADCAST
    - LP: #1561677
  * net: moxa: fix an error code
    - LP: #1561677
  * mld, igmp: Fix reserved tailroom calculation
    - LP: #1561677
  * Linux 4.2.8-ckt6
    - LP: #1561677
  * (upstream) net/mlx5e: Avoid NULL pointer access in case of
    configuration failure
    - LP: #1528466
  * PCI: Disable IO/MEM decoding for devices with non-compliant BARs
    - LP: #1559929
  * x86/PCI: Mark Broadwell-EP Home Agent & PCU as having non-compliant
    BARs
    - LP: #1559929
  * fuse: do not use iocb after it may have been freed
    - LP: #1505948
  * fuse: Add reference counting for fuse_io_priv
    - LP: #1505948
  * intel_idle: prevent SKL-H boot failure when C8+C9+C10 enabled
    - LP: #1559918
  * crypto: skcipher - Add crypto_skcipher_has_setkey
    - LP: #1556562
  * crypto: algif_skcipher - Add key check exception for cipher_null
    - LP: #1556562
  * crypto: algif_skcipher - Do not assume that req is unchanged
    - LP: #1556562
  * crypto: algif_skcipher - Do not dereference ctx without socket lock
    - LP: #1556562
  * proc: revert /proc/<pid>/maps [stack:TID] annotation
    - LP: #1547231
  * ACPI / processor: Request native thermal interrupt handling via _OSC
    - LP: #1559923
  * gpiolib: do not allow to insert an empty gpiochip
    - LP: #1566544
  * gpio: add a data pointer to gpio_chip
    - LP: #1566544
  * gpio: rcar: Add Runtime PM handling for interrupts
    - LP: #1566544
  * ipv4: Don't do expensive useless work during inetdev destroy.
    - LP: #1566544
  * Input: powermate - fix oops with malicious USB descriptors
    - LP: #1566544
  * USB: iowarrior: fix oops with malicious USB descriptors
    - LP: #1566544
  * ALSA: usb-audio: Fix NULL dereference in create_fixed_stream_quirk()
    - LP: #1566544
  * ALSA: usb-audio: Add sanity checks for endpoint accesses
    - LP: #1566544
  * include/linux/poison.h: fix LIST_POISON{1,2} offset
    - LP: #1566544
  * Input: ati_remote2 - fix crashes on detecting device with invalid
    descriptor
    - LP: #1566544
  * USB: cdc-acm: more sanity checking
    - LP: #1566544
  * drm/i915: Workaround CHV pipe C cursor fail
    - LP: #1566544
  * EDAC, amd64_edac: Shift wrapping issue in f1x_get_norm_dct_addr()
    - LP: #1566544
  * crypto: ccp - Add hash state import and export support
    - LP: #1566544
  * clk: rockchip: add pclk_cpu to the list of rk3188 critical clocks
    - LP: #1566544
  * clk: rockchip: Add pclk_peri to critical clocks on RK3066/RK3188
    - LP: #1566544
  * clk: rockchip: add hclk_cpubus to the list of rk3188 critical clocks
    - LP: #1566544
  * tty: Fix GPF in flush_to_ldisc(), part 2
    - LP: #1566544
  * media: v4l2-compat-ioctl32: fix missing length copy in
    put_v4l2_buffer32
    - LP: #1566544
  * pwc: Add USB id for Philips Spc880nc webcam
    - LP: #1566544
  * crypto: ccp - Limit the amount of information exported
    - LP: #1566544
  * crypto: ccp - Don't assume export/import areas are aligned
    - LP: #1566544
  * 8250: use callbacks to access UART_DLL/UART_DLM
    - LP: #1566544
  * net: irda: Fix use-after-free in irtty_open()
    - LP: #1566544
  * mei: bus: check if the device is enabled before data transfer
    - LP: #1566544
  * staging: comedi: ni_tiocmd: change mistaken use of start_src for
    start_arg
    - LP: #1566544
  * tools/hv: Use include/uapi with __EXPORTED_HEADERS__
    - LP: #1566544
  * tpm: fix the rollback in tpm_chip_register()
    - LP: #1566544
  * tpm: fix the cleanup of struct tpm_chip
    - LP: #1566544
  * ARM: dts: armada-375: use armada-370-sata for SATA
    - LP: #1566544
  * usb: retry reset if a device times out
    - LP: #1566544
  * HID: fix hid_ignore_special_drivers module parameter
    - LP: #1566544
  * scripts/coccinelle: modernize &
    - LP: #1566544
  * adv7511: TX_EDID_PRESENT is still 1 after a disconnect
    - LP: #1566544
  * saa7134: Fix bytesperline not being set correctly for planar formats
    - LP: #1566544
  * tpm_crb: tpm2_shutdown() must be called before tpm_chip_unregister()
    - LP: #1566544
  * perf tools: Dont stop PMU parsing on alias parse error
    - LP: #1566544
  * Bluetooth: btusb: Add new AR3012 ID 13d3:3395
    - LP: #1542564, #1566544
  * Bluetooth: Add new AR3012 ID 0489:e095
    - LP: #1542944, #1566544
  * aacraid: Fix RRQ overload
    - LP: #1566544
  * aacraid: Fix memory leak in aac_fib_map_free
    - LP: #1566544
  * aic7xxx: Fix queue depth handling
    - LP: #1566544
  * mtd: onenand: fix deadlock in onenand_block_markbad
    - LP: #1566544
  * md/raid5: Compare apples to apples (or sectors to sectors)
    - LP: #1566544
  * RAID5: check_reshape() shouldn't call mddev_suspend
    - LP: #1566544
  * RAID5: revert e9e4c377e2f563 to fix a livelock
    - LP: #1566544
  * crypto: ccp - memset request context to zero during import
    - LP: #1566544
  * Bluetooth: btusb: Add a new AR3012 ID 04ca:3014
    - LP: #1546694, #1566544
  * mmc: sdhci: fix data timeout (part 1)
    - LP: #1566544
  * mmc: sdhci: fix data timeout (part 2)
    - LP: #1566544
  * perf tools: Fix python extension build
    - LP: #1566544
  * IB/srpt: Simplify srpt_handle_tsk_mgmt()
    - LP: #1566544
  * bttv: Width must be a multiple of 16 when capturing planar formats
    - LP: #1566544
  * watchdog: rc32434_wdt: fix ioctl error handling
    - LP: #1566544
  * nfsd4: fix bad bounds checking
    - LP: #1566544
  * xfs: fix two memory leaks in xfs_attr_list.c error paths
    - LP: #1566544
  * quota: Fix possible GPF due to uninitialised pointers
    - LP: #1566544
  * mtip32xx: Fix broken service thread handling
    - LP: #1566544
  * mtip32xx: Remove unwanted code from taskfile error handler
    - LP: #1566544
  * mtip32xx: Print exact time when an internal command is interrupted
    - LP: #1566544
  * mtip32xx: Avoid issuing standby immediate cmd during FTL rebuild
    - LP: #1566544
  * mtip32xx: Fix for rmmod crash when drive is in FTL rebuild
    - LP: #1566544
  * mtip32xx: Handle safe removal during IO
    - LP: #1566544
  * mtip32xx: Handle FTL rebuild failure state during device initialization
    - LP: #1566544
  * of: alloc anywhere from memblock if range not specified
    - LP: #1566544
  * usb: hub: fix a typo in hub_port_init() leading to wrong logic
    - LP: #1566544
  * KVM: i8254: change PIT discard tick policy
    - LP: #1566544
  * sched/cputime: Fix steal time accounting vs. CPU hotplug
    - LP: #1566544
  * libnvdimm: Fix security issue with DSM IOCTL.
    - LP: #1566544
  * rt2x00: add new rt2800usb device Buffalo WLI-UC-G450
    - LP: #1566544
  * pinctrl-bcm2835: Fix cut-and-paste error in "pull" parsing
    - LP: #1566544
  * perf/core: Fix perf_sched_count derailment
    - LP: #1566544
  * perf/x86/intel: Use PAGE_SIZE for PEBS buffer size on Core2
    - LP: #1566544
  * perf/x86/intel: Fix PEBS warning by only restoring active PMU in pmi
    - LP: #1566544
  * sched/cputime: Fix steal_account_process_tick() to always return
    jiffies
    - LP: #1566544
  * bcache: fix race of writeback thread starting before complete
    initialization
    - LP: #1566544
  * bcache: cleaned up error handling around register_cache()
    - LP: #1566544
  * bcache: fix cache_set_flush() NULL pointer dereference on OOM
    - LP: #1566544
  * be2iscsi: set the boot_kset pointer to NULL in case of failure
    - LP: #1566544
  * md/raid5: preserve STRIPE_PREREAD_ACTIVE in break_stripe_batch_list
    - LP: #1566544
  * drm/radeon: Don't drop DP 2.7 Ghz link setup on some cards.
    - LP: #1566544
  * sg: fix dxferp in from_to case
    - LP: #1566544
  * jbd2: fix FS corruption possibility in jbd2_journal_destroy() on umount
    path
    - LP: #1566544
  * ALSA: hda - Apply reboot D3 fix for CX20724 codec, too
    - LP: #1566544
  * EDAC/sb_edac: Fix computation of channel address
    - LP: #1566544
  * Bluetooth: btusb: Add a new AR3012 ID 13d3:3472
    - LP: #1552925, #1566544
  * ALSA: pcm: Avoid "BUG:" string for warnings again
    - LP: #1566544
  * dm snapshot: disallow the COW and origin devices from being identical
    - LP: #1566544
  * dm thin metadata: don't issue prefetches if a transaction abort has
    failed
    - LP: #1566544
  * dm cache: make sure every metadata function checks fail_io
    - LP: #1566544
  * iser-target: Fix identification of login rx descriptor type
    - LP: #1566544
  * iser-target: Add new state ISER_CONN_BOUND to isert_conn
    - LP: #1566544
  * iser-target: Separate flows for np listeners and connections cma events
    - LP: #1566544
  * ALSA: hda - fix the mic mute button and led problem for a Lenovo AIO
    - LP: #1555912, #1566544
  * xtensa: ISS: don't hang if stdin EOF is reached
    - LP: #1566544
  * xtensa: fix preemption in {clear,copy}_user_highpage
    - LP: #1566544
  * xtensa: clear all DBREAKC registers on start
    - LP: #1566544
  * Bluetooth: Fix potential buffer overflow with Add Advertising
    - LP: #1566544
  * ARC: [BE] readl()/writel() to work in Big Endian CPU configuration
    - LP: #1566544
  * bus: imx-weim: Take the 'status' property value into account
    - LP: #1566544
  * ALSA: intel8x0: Add clock quirk entry for AD1981B on IBM ThinkPad X41.
    - LP: #1566544
  * s390/pci: enforce fmb page boundary rule
    - LP: #1566544
  * drm/radeon: rework fbdev handling on chips with no connectors
    - LP: #1566544
  * md: multipath: don't hardcopy bio in .make_request path
    - LP: #1566544
  * net: mvneta: enable change MAC address when interface is up
    - LP: #1566544
  * dm: fix rq_end_stats() NULL pointer in dm_requeue_original_request()
    - LP: #1566544
  * HID: i2c-hid: fix OOB write in i2c_hid_set_or_send_report()
    - LP: #1566544
  * ALSA: hda - Fix unconditional GPIO toggle via automute
    - LP: #1566544
  * mmc: mmc_spi: Add Card Detect comments and fix CD GPIO case
    - LP: #1566544
  * nfsd: fix deadlock secinfo+readdir compound
    - LP: #1566544
  * vfs: show_vfsstat: do not ignore errors from show_devname method
    - LP: #1566544
  * x86/iopl: Fix iopl capability check on Xen PV
    - LP: #1566544
  * crypto: marvell/cesa - forward devm_ioremap_resource() error code
    - LP: #1566544
  * mmc: sdhci: Fix override of timeout clk wrt max_busy_timeout
    - LP: #1566544
  * drm/amdgpu: include the right version of gmc header files for iceland
    - LP: #1566544
  * Input: ims-pcu - sanity check against missing interfaces
    - LP: #1566544
  * watchdog: don't run proc_watchdog_update if new value is same as old
    - LP: #1566544
  * mm: memcontrol: reclaim when shrinking memory.high below usage
    - LP: #1566544
  * mm: memcontrol: reclaim and OOM kill when shrinking memory.max below
    usage
    - LP: #1566544
  * x86/apic: Fix suspicious RCU usage in
    smp_trace_call_function_interrupt()
    - LP: #1566544
  * USB: usb_driver_claim_interface: add sanity checking
    - LP: #1566544
  * USB: uas: Reduce can_queue to MAX_CMNDS
    - LP: #1566544
  * tracing: Have preempt(irqs)off trace preempt disabled functions
    - LP: #1566544
  * tracing: Fix crash from reading trace_pipe with sendfile
    - LP: #1566544
  * splice: handle zero nr_pages in splice_to_pipe()
    - LP: #1566544
  * ALSA: usb-audio: add Microsoft HD-5001 to quirks
    - LP: #1566544
  * writeback, cgroup: fix premature wb_put() in
    locked_inode_to_wb_and_lock_list()
    - LP: #1566544
  * fs-writeback: unplug before cond_resched in writeback_sb_inodes
    - LP: #1566544
  * writeback, cgroup: fix use of the wrong bdi_writeback which mismatches
    the inode
    - LP: #1566544
  * bitops: Do not default to __clear_bit() for __clear_bit_unlock()
    - LP: #1566544
  * target: Fix target_release_cmd_kref shutdown comp leak
    - LP: #1566544
  * KVM: VMX: avoid guest hang on invalid invept instruction
    - LP: #1566544
  * KVM: fix spin_lock_init order on x86
    - LP: #1566544
  * tracing: Fix trace_printk() to print when not using bprintk()
    - LP: #1566544
  * fs/coredump: prevent fsuid=0 dumps into user-controlled directories
    - LP: #1566544
  * rapidio/rionet: fix deadlock on SMP
    - LP: #1566544
  * staging: comedi: ni_mio_common: fix the ni_write[blw]() functions
    - LP: #1566544
  * staging: android: ion_test: fix check of
    platform_device_register_simple() error code
    - LP: #1566544
  * ideapad-laptop: Add ideapad Y700 (15) to the no_hw_rfkill DMI list
    - LP: #1566544
  * MAINTAINERS: Update mailing list and web page for hwmon subsystem
    - LP: #1566544
  * ocfs2/dlm: fix race between convert and recovery
    - LP: #1566544
  * ocfs2/dlm: fix BUG in dlm_move_lockres_to_recovery_list
    - LP: #1566544
  * mm/page_alloc: prevent merging between isolated and other pageblocks
    - LP: #1566544
  * mac80211: avoid excessive stack usage in sta_info
    - LP: #1566544
  * clk: xgene: Add missing parenthesis when clearing divider value
    - LP: #1566544
  * clk: qcom: msm8960: Fix ce3_src register offset
    - LP: #1566544
  * xen kconfig: don't "select INPUT_XEN_KBDDEV_FRONTEND"
    - LP: #1566544
  * ppp: take reference on channels netns
    - LP: #1566544
  * mdio-sun4i: oops in error handling in probe
    - LP: #1566544
  * clk: rockchip: free memory in error cases when registering clock
    branches
    - LP: #1566544
  * ARC: bitops: Remove non relevant comments
    - LP: #1566544
  * mac80211: fix txq queue related crashes
    - LP: #1566544
  * net: Fix use after free in the recvmmsg exit path
    - LP: #1566544
  * ath9k: fix misleading indentation
    - LP: #1566544
  * sctp: fix the transports round robin issue when init is retransmitted
    - LP: #1566544
  * ethernet: micrel: fix some error codes
    - LP: #1566544
  * megaraid_sas: add missing curly braces in ioctl handler
    - LP: #1566544
  * clk-divider: make sure read-only dividers do not write to their
    register
    - LP: #1566544
  * misc/bmp085: Enable building as a module
    - LP: #1566544
  * HID: logitech: fix Dual Action gamepad support
    - LP: #1566544
  * net/mlx5: Make command timeout way shorter
    - LP: #1566544
  * ASoC: ssm4567: Reset device before regcache_sync()
    - LP: #1566544
  * fbdev: da8xx-fb: fix videomodes of lcd panels
    - LP: #1566544
  * clk: qcom: msm8960: fix ce3_core clk enable register
    - LP: #1566544
  * ipvs: correct initial offset of Call-ID header search in SIP
    persistence engine
    - LP: #1566544
  * drm/i915: Cleanup phys status page too
    - LP: #1566544
  * ata: ahci_xgene: dereferencing uninitialized pointer in probe
    - LP: #1566544
  * ath9k: fix buffer overrun for ar9287
    - LP: #1566544
  * perf tools: handle spaces in file names obtained from /proc/pid/maps
    - LP: #1566544
  * rtc: ds1685: passing bogus values to irq_restore
    - LP: #1566544
  * ARM: davinci: make I2C support optional
    - LP: #1566544
  * drm/amdkfd: uninitialized variable in
    dbgdev_wave_control_set_registers()
    - LP: #1566544
  * mtd: map: fix .set_vpp() documentation
    - LP: #1566544
  * ARM: OMAP3: Add cpuidle parameters table for omap3430
    - LP: #1566544
  * efi: Expose non-blocking set_variable() wrapper to efivars
    - LP: #1566544
  * rtc: vr41xx: Wire up alarm_irq_enable
    - LP: #1566544
  * sunrpc/cache: drop reference when sunrpc_cache_pipe_upcall() detects a
    race
    - LP: #1566544
  * ipv4: fix broadcast packets reception
    - LP: #1566544
  * lpfc: fix misleading indentation
    - LP: #1566544
  * sched/preempt, sh: kmap_coherent relies on disabled preemption
    - LP: #1566544
  * ipip: Properly mark ipip GRO packets as encapsulated.
    - LP: #1566544
  * spi/rockchip: Make sure spi clk is on in rockchip_spi_set_cs
    - LP: #1566544
  * ASoC: s3c24xx: use const snd_soc_component_driver pointer
    - LP: #1566544
  * mlx4: add missing braces in verify_qp_parameters
    - LP: #1566544
  * clk: meson: Fix meson_clk_register_clks() signature type mismatch
    - LP: #1566544
  * coda: fix error path in case of missing pdata on non-DT platform
    - LP: #1566544
  * kbuild/mkspec: fix grub2 installkernel issue
    - LP: #1566544
  * bpf: avoid copying junk bytes in bpf_get_current_comm()
    - LP: #1566544
  * mac80211: fix unnecessary frame drops in mesh fwding
    - LP: #1566544
  * mtd: brcmnand: Fix v7.1 register offsets
    - LP: #1566544
  * mac80211: fix ibss scan parameters
    - LP: #1566544
  * at803x: fix reset handling
    - LP: #1566544
  * rtc: hym8563: fix invalid year calculation
    - LP: #1566544
  * perf pmu: Fix misleadingly indented assignment (whitespace)
    - LP: #1566544
  * paride: make 'verbose' parameter an 'int' again
    - LP: #1566544
  * regulator: s5m8767: fix get_register() error handling
    - LP: #1566544
  * ppp: ensure file->private_data can't be overridden
    - LP: #1566544
  * clk: versatile: sp810: support reentrance
    - LP: #1566544
  * net: add description for len argument of dev_get_phys_port_name
    - LP: #1566544
  * net: bcmgenet: fix dma api length mismatch
    - LP: #1566544
  * ARM: prima2: always enable reset controller
    - LP: #1566544
  * drivers/misc/ad525x_dpot: AD5274 fix RDAC read back errors
    - LP: #1566544
  * perf stat: Document --detailed option
    - LP: #1566544
  * v4l: vsp1: Set the SRU CTRL0 register when starting the stream
    - LP: #1566544
  * ipvs: drop first packet to redirect conntrack
    - LP: #1566544
  * rtc: max77686: Properly handle regmap_irq_get_virq() error code
    - LP: #1566544
  * x86/iopl/64: Properly context-switch IOPL on Xen PV
    - LP: #1566544
  * Linux 4.2.8-ckt7
    - LP: #1566544
  * PKCS#7: pkcs7_validate_trust(): initialize the _trusted output argument
    - LP: #1571027
  * ALSA: hda - Asus N750JV external subwoofer fixup
    - LP: #1571027
  * ALSA: hda - Fix white noise on Asus N750JV headphone
    - LP: #1571027
  * ALSA: hda - Apply fix for white noise on Asus N550JV, too
    - LP: #1571027
  * drm/radeon: add a dpm quirk for sapphire Dual-X R7 370 2G D5
    - LP: #1571027
  * fs: add file_dentry()
    - LP: #1571027
  * nfs: use file_dentry()
    - LP: #1571027
  * hwmon: (max1111) Return -ENODEV from max1111_read_channel if not
    instantiated
    - LP: #1571027
  * drm/radeon: add another R7 370 quirk
    - LP: #1571027
  * drm/radeon: add a dpm quirk for all R7 370 parts
    - LP: #1571027
  * powerpc/mm: Fixup preempt underflow with huge pages
    - LP: #1571027
  * pinctrl: pistachio: fix mfio84-89 function description and pinmux.
    - LP: #1571027
  * pinctrl: sunxi: Fix A33 external interrupts not working
    - LP: #1571027
  * usb: renesas_usbhs: avoid NULL pointer derefernce in
    usbhsf_pkt_handler()
    - LP: #1571027
  * usb: renesas_usbhs: disable TX IRQ before starting TX DMAC transfer
    - LP: #1571027
  * btrfs: fix crash/invalid memory access on fsync when using overlayfs
    - LP: #1571027
  * ALSA: usb-audio: Minor code cleanup in create_fixed_stream_quirk()
    - LP: #1571027
  * ALSA: usb-audio: Fix double-free in error paths after
    snd_usb_add_audio_stream() call
    - LP: #1571027
  * USB: mct_u232: add sanity checking in probe
    - LP: #1571027
    - CVE-2016-3136
  * USB: cypress_m8: add endpoint sanity check
    - LP: #1571027
    - CVE-2016-3137
  * USB: digi_acceleport: do sanity checking for the number of ports
    - LP: #1571027
  * [media] au0828: fix au0828_v4l2_close() dev_state race condition
    - LP: #1571027
  * [media] au0828: Fix dev_state handling
    - LP: #1571027
  * sd: Fix excessive capacity printing on devices with blocks bigger than
    512 bytes
    - LP: #1571027
  * drm/dp: move hw_mutex up the call stack
    - LP: #1571027
  * drm/udl: Use unlocked gem unreferencing
    - LP: #1571027
  * ext4: add lockdep annotations for i_data_sem
    - LP: #1571027
  * ALSA: hda - fix front mic problem for a HP desktop
    - LP: #1564712, #1571027
  * KVM: x86: Inject pending interrupt even if pending nmi exist
    - LP: #1571027
  * ALSA: timer: Use mod_timer() for rearming the system timer
    - LP: #1571027
  * mm: fix invalid node in alloc_migrate_target()
    - LP: #1571027
  * iio: st_magn: always define ST_MAGN_TRIGGER_SET_STATE
    - LP: #1571027
  * ext4: ignore quota mount options if the quota feature is enabled
    - LP: #1571027
  * xen/events: Mask a moving irq
    - LP: #1571027
  * usb: renesas_usbhs: fix to avoid using a disabled ep in
    usbhsg_queue_done()
    - LP: #1571027
  * mac80211: properly deal with station hashtable insert errors
    - LP: #1571027
  * compiler-gcc: disable -ftracer for __noclone functions
    - LP: #1571027
  * rbd: use GFP_NOIO consistently for request allocations
    - LP: #1571027
  * Btrfs: fix file/data loss caused by fsync after rename and new inode
    - LP: #1571027
  * USB: serial: ftdi_sio: Add support for ICP DAS I-756xU devices
    - LP: #1571027
  * USB: serial: cp210x: Adding GE Healthcare Device ID
    - LP: #1571027
  * USB: option: add "D-Link DWM-221 B1" device id
    - LP: #1571027
  * virtio: virtio 1.0 cs04 spec compliance for reset
    - LP: #1571027
  * libnvdimm: fix smart data retrieval
    - LP: #1571027
  * gpio: pca953x: Use correct u16 value for register word write
    - LP: #1571027
  * parisc: Avoid function pointers for kernel exception routines
    - LP: #1571027
  * parisc: Fix kernel crash with reversed copy_from_user()
    - LP: #1571027
  * parisc: Unbreak handling exceptions from kernel modules
    - LP: #1571027
  * net: macb: replace macb_writel() call by queue_writel() to update queue
    ISR
    - LP: #1571027
  * net: bcmgenet: fix dev->stats.tx_bytes accounting
    - LP: #1571027
  * net: bcmgenet: fix skb_len in bcmgenet_xmit_single()
    - LP: #1571027
  * ipv6: udp: fix UDP_MIB_IGNOREDMULTI updates
    - LP: #1571027
  * pinctrl: nomadik: fix pull debug print inversion
    - LP: #1571027
  * ip6_tunnel: set rtnl_link_ops before calling register_netdevice
    - LP: #1571027
  * KVM: x86: move steal time initialization to vcpu entry time
    - LP: #1571027
  * lib/ucs2_string: Add ucs2 -> utf8 helper functions
    - LP: #1571027
  * efi: Use ucs2_as_utf8 in efivarfs instead of open coding a bad version
    - LP: #1571027
  * efi: Do variable name validation tests in utf8
    - LP: #1571027
  * efi: Make our variable validation list include the guid
    - LP: #1571027
  * efi: Make efivarfs entries immutable by default
    - LP: #1571027
  * efi: Add pstore variables to the deletion whitelist
    - LP: #1571027
  * lib/ucs2_string: Correct ucs2 -> utf8 conversion
    - LP: #1571027
  * ipr: Fix out-of-bounds null overwrite
    - LP: #1571027
  * ipr: Fix regression when loading firmware
    - LP: #1571027
  * perf/x86/intel: Fix PEBS data source interpretation on Nehalem/Westmere
    - LP: #1571027
  * ALSA: hda - Add new GPU codec ID 0x10de0082 to snd-hda
    - LP: #1571027
  * mwifiex: fix corner case association failure
    - LP: #1571027
  * net: phy: at803x: Request 'reset' GPIO only for AT8030 PHY
    - LP: #1571027
  * Linux 4.2.8-ckt8
    - LP: #1571027

-- Kamal Mostafa <kamal@canonical.com>  Mon, 18 Apr 2016 06:54:19 -0700

Changed in linux (Ubuntu Wily):
status:	Fix Committed → Fix Released
status:	Fix Committed → Fix Released

Bug Watch Updater (bug-watch-updater) on 2017-10-26

Changed in linux (Fedora):
importance:	Unknown → Critical
status:	Unknown → Won't Fix

Ubuntu
linux package

Memory arena corruption with FUSE (was Memory allocation failure crashes kernel hard, presumably related to FUSE)

Bug Description

Related branches

CVE References

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
linux (Fedora)	Won't Fix	Critical	redhat-bugs #1254310
linux (Ubuntu)	Fix Released	High	Seth Forshee
Wily	Fix Released	High	Seth Forshee
Xenial	Fix Released	High	Seth Forshee

Ubuntulinux package

Memory arena corruption with FUSE (was Memory allocation failure crashes kernel hard, presumably related to FUSE)

Bug Description

Related branches

CVE References

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package