Bug #1322407 “NFS kernel server creates a kworker with 100% CPU ...” : Bugs : linux package : Ubuntu

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-23:

#1

AlsaDevices.txt Edit (492 bytes, text/plain; charset="utf-8")
BootDmesg.txt Edit (69.0 KiB, text/plain; charset="utf-8")
Card0.Codecs.codec.0.txt Edit (15.0 KiB, text/plain; charset="utf-8")
CurrentDmesg.txt Edit (4.7 KiB, text/plain; charset="utf-8")
Dependencies.txt Edit (3.1 KiB, text/plain; charset="utf-8")
Lspci.txt Edit (17.7 KiB, text/plain; charset="utf-8")
Lsusb.txt Edit (692 bytes, text/plain; charset="utf-8")
PciMultimedia.txt Edit (589 bytes, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (6.4 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (2.9 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (3.6 KiB, text/plain; charset="utf-8")
UdevDb.txt Edit (171.1 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (350.0 KiB, text/plain; charset="utf-8")

description:	updated
description:	updated

Revision history for this message

Brad Figg (brad-figg) wrote on 2014-05-23: Status changed to Confirmed

#2

This change was made by a bot.

Changed in linux (Ubuntu):
status:	New → Confirmed

Revision history for this message

Stefan Bader (smb) wrote on 2014-05-23:

#3

Can you be more specific about the configuration of the server (/etc/exports,/etc/defaul/nfs-*) and what kind of client you use. I just tried this on two VMs and saw not issues (at least with a basic NFSv4 setup).

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-23:

#4

/etc/exports has one line that is not a comment:

/home 192.168.1.0/255.255.255.0(rw,sync,root_squash,no_subtree_check)

/etc/defaults/nfs-common:

NEED_STATD=no
# Options for rpc.statd.
STATDOPTS=
# Do you want to start the gssd daemon? It is required for Kerberos mounts.
NEED_GSSD=no

/etc/defaults/nfs-kernel-server

# Number of servers to start up
RPCNFSDCOUNT=8
# Runtime priority of server (see nice(1))
RPCNFSDPRIORITY=0
# Options for rpc.mountd.
RPCMOUNTDOPTS="--manage-gids"
# Do you want to start the svcgssd daemon? It is only required for Kerberos
# exports. Valid alternatives are "yes" and "no"; the default is "no".
NEED_SVCGSSD="no"
# Options for rpc.svcgssd.
RPCSVCGSSDOPTS=""
# Options for rpc.nfsd.
RPCNFSDOPTS="--debug --syslog"

################
Note that I tried several alternatives, such as

RPCMOUNTDOPTS="--manage-gids --no-nfs-version 4"

and other combinations, to no effect. I am mounting Ubuntu 12.04 clients. The relevant entry in /etc/mtab for one of these clients looks like this:

192.168.1.2:/home /mnt/nfshome nfs rw,vers=4,addr=192.168.1.2,clientaddr=192.168.1.9 0 0

The client's nfs-common version is 1:1.2.5-3ubuntu3.1. Also, it makes no difference whether I use mount -t nfs or mount -t nfs4 (in the first case the /etc/mtab entry has vers=4 in the options, in the other case it does not, but the filesystem type is nfs4.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-23:

#5

Oh, and maybe of less relevance, but to keep it complete, here is the output of "top":

top - 13:46:47 up 5 days, 56 min, 2 users, load average: 1.00, 0.97, 1.01
Tasks: 172 total, 2 running, 170 sleeping, 0 stopped, 0 zombie
%Cpu(s): 0.0 us, 12.0 sy, 0.0 ni, 83.3 id, 0.0 wa, 4.8 hi, 0.0 si, 0.0 st
KiB Mem: 16415556 total, 2105640 used, 14309916 free, 390756 buffers
KiB Swap: 4194296 total, 0 used, 4194296 free. 865744 cached Mem

PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
86 root 20 0 0 0 0 R 100.0 0.0 6867:11 kworker/2:1
15637 root 20 0 24956 1760 1172 R 0.3 0.0 0:00.03 top
...

Joseph Salisbury (jsalisbury) on 2014-05-25

Changed in linux (Ubuntu):
importance:	Undecided → High
tags:	added: kernel-key

Revision history for this message

Stefan Bader (smb) wrote on 2014-05-26:

#6

Probably the relevant part I missed initially is that probably this involves connections going on and succeeding for a bit and fail at some point. Where it is unclear how many connections have been going on and so on.
The function tracing you did does show that there is some kind of loop going on but does not allow to figure out any real details. I wonder whether you could try to enable some of the nfs debugging from /proc/sys/sunrpc/*_debug. Which allows to enable various pieces of internal debugging.

#define RPCDBG_XPRT 0x0001
#define RPCDBG_CALL 0x0002
#define RPCDBG_DEBUG 0x0004
#define RPCDBG_NFS 0x0008
#define RPCDBG_AUTH 0x0010
#define RPCDBG_BIND 0x0020
#define RPCDBG_SCHED 0x0040
#define RPCDBG_TRANS 0x0080
#define RPCDBG_SVCXPRT 0x0100
#define RPCDBG_SVCDSP 0x0200
#define RPCDBG_MISC 0x0400
#define RPCDBG_CACHE 0x0800
#define RPCDBG_ALL 0x7fff

So echoing 524287 into the various /proc interfaces should enable all debugging. Not sure which if them maybe starting with nfsd_debug and/or rpc_debug. Maybe this allows to narrow down what goes wrong in a better way.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-26:

#7

A few bits for clarification before I get started: (1) Your suggested value, 524287, clearly is 0x7ffff. This is (RPCDBG_ALL | 0x78000), i.e., with four extra bits, correct?

(2) I guess I can harvest the debug messages from /var/log/syslog, correct? Meaning, once the kworker runs amok, I simply copy syslog and attach it to one of my responses?

I'll start doing this now.

Revision history for this message

Stefan Bader (smb) wrote on 2014-05-26:

#8

Oh, sorry, actually it was just me hitting too many Fs. So 32767 (0x7fff) should be enough. I guess the other value is ok, too just sets too many bits in the mask.

Yes, the messages should end up in syslog. With all debugging turned on there will be quite a bit of logging going on. Hope this does not change timing in a way that the problem does not show up anymore.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-27:

#9

Funny you should mention timing... right now, the server does not show the runaway kworker. nfds faithfully spits out a few debug messages per second, but acts normally otherwise. I have put the server back in production, see if I can catch the runaway kworker when I have more users -- I'll get some flak if nfsd hangs again, but that's life. I am also setting up another, similar NFS server for testing purposes. Will report once I have more information.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-28:

#10

Download full text (12.3 KiB)

OK -- bug can be reproduced. Here is an interesting observation: When a *second* nfs server is connected to the network , the bug seems to be suppressed, for what it's worth. This one really puzzles me. The runaway kworker appeared only after I shut down the old server.

As for debugging, the component that seems to create an insane amount of debug messages is rpcd. I am pasting snippets from nfsd's debugging-enabled and rpcd's debugging-enabled here, please let me know if you need more of the syslog file.

######### NFSD ##############
(...)
May 28 13:06:41 marcato kernel: [184158.621702] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619455)
May 28 13:06:41 marcato kernel: [184158.621714] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10618649)
May 28 13:06:41 marcato kernel: [184158.621731] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619242)
May 28 13:06:41 marcato kernel: [184158.621742] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619422)
May 28 13:06:41 marcato kernel: [184158.621754] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619296)
May 28 13:06:41 marcato kernel: [184158.621761] nfsv4 compound op ffff880417a01080 opcnt 2 #2: 26: status 0
May 28 13:06:41 marcato kernel: [184158.621764] nfsv4 compound returned 0
May 28 13:06:41 marcato kernel: [184158.622472] nfsd_dispatch: vers 4 proc 1
May 28 13:06:41 marcato kernel: [184158.622484] nfsv4 compound op #1/2: 22 (OP_PUTFH)
May 28 13:06:41 marcato kernel: [184158.622491] nfsd: fh_verify(28: 01060001 57c17980 8845d969 d9acb495 eab15095 00a205da)
May 28 13:06:41 marcato kernel: [184158.622507] nfsv4 compound op ffff880417a01080 opcnt 2 #1: 22: status 0
May 28 13:06:41 marcato kernel: [184158.622511] nfsv4 compound op #2/2: 26 (OP_READDIR)
May 28 13:06:41 marcato kernel: [184158.622549] nfsd: fh_verify(28: 01060001 57c17980 8845d969 d9acb495 eab15095 00a205da)

######### RPCD ##############
(...)
May 28 13:07:29 marcato kernel: [184207.248774] RPC: 22256 call_connect_status (status -11)
May 28 13:07:29 marcato kernel: [184207.248777] svc: socket ffff880417284d00 TCP data ready (svsk ffff8803e3ff2000)
May 28 13:07:29 marcato kernel: [184207.248781] svc: transport ffff8803e3ff2000 served by daemon ffff8800ce6da000
May 28 13:07:29 marcato kernel: [184207.248791] RPC: 22256 call_bind (status 0)
May 28 13:07:29 marcato kernel: [184207.248793] svc: server ffff8800ce6da000, pool 0, transport ffff8803e3ff2000, inuse=2
May 28 13:07:29 marcato kernel: [184207.248795] svc: tcp_recv ffff8803e3ff2000 data 1 conn 0 close 0
May 28 13:07:29 marcato kernel: [184207.248800] svc: socket ffff8803e3ff2000 recvfrom(ffff8803e3ff22bc, 0) = 4
May 28 13:07:29 marcato kernel: [184207.248802] svc: TCP record, 188 bytes
May 28 13:07:29 marcato kernel: [184207.248806] svc: socket ffff8803e3ff2000 recvfrom(ffff880415a7c0bc, 3908) = 188
May 28 13:07:29 marcato kernel: [184207.248809] svc: TCP final record (188 bytes)
May 28 13:07:29 marcato kernel: [184207.248812] svc: got len=188
May 28 13:07:29 marcato kernel: [184207.248815] svc: svc_authenticate (1)
May 28 13:07:29 marcato kernel: [184207.248819] svc: calling dispatcher
May 28 13:07:29 marcato kernel: [184207.248826]...

OK -- bug can be reproduced. Here is an interesting observation: When a *second* nfs server is connected to the network , the bug seems to be suppressed, for what it's worth. This one really puzzles me. The runaway kworker appeared only after I shut down the old server.

As for debugging, the component that seems to create an insane amount of debug messages is rpcd. I am pasting snippets from nfsd's debugging-enabled and rpcd's debugging-enabled here, please let me know if you need more of the syslog file.

######### NFSD ##############
(...)
May 28 13:06:41 marcato kernel: [184158.621702] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619455)
May 28 13:06:41 marcato kernel: [184158.621714] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10618649)
May 28 13:06:41 marcato kernel: [184158.621731] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619242)
May 28 13:06:41 marcato kernel: [184158.621742] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619422)
May 28 13:06:41 marcato kernel: [184158.621754] nfsd: fh_compose(exp 09:04/2 (filename redacted), ino=10619296)
May 28 13:06:41 marcato kernel: [184158.621761] nfsv4 compound op ffff880417a01080 opcnt 2 #2: 26: status 0
May 28 13:06:41 marcato kernel: [184158.621764] nfsv4 compound returned 0
May 28 13:06:41 marcato kernel: [184158.622472] nfsd_dispatch: vers 4 proc 1
May 28 13:06:41 marcato kernel: [184158.622484] nfsv4 compound op #1/2: 22 (OP_PUTFH)
May 28 13:06:41 marcato kernel: [184158.622491] nfsd: fh_verify(28: 01060001 57c17980 8845d969 d9acb495 eab15095 00a205da)
May 28 13:06:41 marcato kernel: [184158.622507] nfsv4 compound op ffff880417a01080 opcnt 2 #1: 22: status 0
May 28 13:06:41 marcato kernel: [184158.622511] nfsv4 compound op #2/2: 26 (OP_READDIR)
May 28 13:06:41 marcato kernel: [184158.622549] nfsd: fh_verify(28: 01060001 57c17980 8845d969 d9acb495 eab15095 00a205da)

######### RPCD ##############
(...)
May 28 13:07:29 marcato kernel: [184207.248774] RPC: 22256 call_connect_status (status -11)
May 28 13:07:29 marcato kernel: [184207.248777] svc: socket ffff880417284d00 TCP data ready (svsk ffff8803e3ff2000)
May 28 13:07:29 marcato kernel: [184207.248781] svc: transport ffff8803e3ff2000 served by daemon ffff8800ce6da000
May 28 13:07:29 marcato kernel: [184207.248791] RPC: 22256 call_bind (status 0)
May 28 13:07:29 marcato kernel: [184207.248793] svc: server ffff8800ce6da000, pool 0, transport ffff8803e3ff2000, inuse=2
May 28 13:07:29 marcato kernel: [184207.248795] svc: tcp_recv ffff8803e3ff2000 data 1 conn 0 close 0
May 28 13:07:29 marcato kernel: [184207.248800] svc: socket ffff8803e3ff2000 recvfrom(ffff8803e3ff22bc, 0) = 4
May 28 13:07:29 marcato kernel: [184207.248802] svc: TCP record, 188 bytes
May 28 13:07:29 marcato kernel: [184207.248806] svc: socket ffff8803e3ff2000 recvfrom(ffff880415a7c0bc, 3908) = 188
May 28 13:07:29 marcato kernel: [184207.248809] svc: TCP final record (188 bytes)
May 28 13:07:29 marcato kernel: [184207.248812] svc: got len=188
May 28 13:07:29 marcato kernel: [184207.248815] svc: svc_authenticate (1)
May 28 13:07:29 marcato kernel: [184207.248819] svc: calling dispatcher
May 28 13:07:29 marcato kernel: [184207.248826] RPC: 22256 call_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248835] RPC: 22256 xprt_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248844] RPC: 22256 sleep_on(queue "xprt_pending" time 4340888821)
May 28 13:07:29 marcato kernel: [184207.248851] RPC: 22256 added to queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.248862] RPC: 22256 setting alarm for 9000 ms
May 28 13:07:29 marcato kernel: [184207.248871] RPC:       xs_connect scheduled xprt ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.248885] RPC:       disconnecting xprt ffff8802fd03c000 to reuse port
May 28 13:07:29 marcato kernel: [184207.248893] RPC:       AF_UNSPEC connect return code 0
May 28 13:07:29 marcato kernel: [184207.248902] RPC:       worker connecting xprt ffff8802fd03c000 via tcp to 0.0.0.0 (port 58162)
May 28 13:07:29 marcato kernel: [184207.248921] RPC:       xs_tcp_state_change client ffff8802fd03c000...
May 28 13:07:29 marcato kernel: [184207.248922] RPC:       state 7 conn 0 dead 0 zapped 1 sk_shutdown 3
May 28 13:07:29 marcato kernel: [184207.248924] RPC:       disconnected transport ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.248925] RPC: 22256 __rpc_wake_up_task (now 4340888822)
May 28 13:07:29 marcato kernel: [184207.248926] RPC: 22256 disabling timer
May 28 13:07:29 marcato kernel: [184207.248928] RPC: 22256 removed from queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.248929] RPC:       __rpc_wake_up_task done
May 28 13:07:29 marcato kernel: [184207.248931] RPC:       ffff8802fd03c000 connect status 115 connected 0 sock state 7
May 28 13:07:29 marcato kernel: [184207.248933] RPC: 22256 __rpc_execute flags=0x681
May 28 13:07:29 marcato kernel: [184207.248934] RPC: 22256 xprt_connect_status: retrying
May 28 13:07:29 marcato kernel: [184207.248936] RPC: 22256 call_connect_status (status -11)
May 28 13:07:29 marcato kernel: [184207.248937] RPC: 22256 call_bind (status 0)
May 28 13:07:29 marcato kernel: [184207.248939] RPC: 22256 call_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248940] RPC: 22256 xprt_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248942] RPC: 22256 sleep_on(queue "xprt_pending" time 4340888822)
May 28 13:07:29 marcato kernel: [184207.248943] RPC: 22256 added to queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.248944] RPC: 22256 setting alarm for 9000 ms
May 28 13:07:29 marcato kernel: [184207.248946] RPC:       xs_connect scheduled xprt ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.248948] RPC:       disconnecting xprt ffff8802fd03c000 to reuse port
May 28 13:07:29 marcato kernel: [184207.248949] RPC:       AF_UNSPEC connect return code 0
May 28 13:07:29 marcato kernel: [184207.248951] RPC:       worker connecting xprt ffff8802fd03c000 via tcp to 0.0.0.0 (port 58162)
May 28 13:07:29 marcato kernel: [184207.248959] RPC:       xs_tcp_state_change client ffff8802fd03c000...
May 28 13:07:29 marcato kernel: [184207.248961] RPC:       state 7 conn 0 dead 0 zapped 1 sk_shutdown 3
May 28 13:07:29 marcato kernel: [184207.248962] RPC:       disconnected transport ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.248964] RPC: 22256 __rpc_wake_up_task (now 4340888822)
May 28 13:07:29 marcato kernel: [184207.248977] RPC: 22256 call_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248979] RPC: 22256 xprt_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.248980] RPC: 22256 sleep_on(queue "xprt_pending" time 4340888822)
May 28 13:07:29 marcato kernel: [184207.248982] RPC: 22256 added to queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.248984] svc: socket ffff8803e3ff2000 sendto([ffff88011f47d000 64... ], 4092) = 4092 (addr 192.168.1.16, port=839)
May 28 13:07:29 marcato kernel: [184207.248986] svc: server ffff8800ce6da000 waiting for data (to = 900000)
May 28 13:07:29 marcato kernel: [184207.248987] RPC: 22256 setting alarm for 9000 ms
May 28 13:07:29 marcato kernel: [184207.248989] RPC:       xs_connect scheduled xprt ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.248990] RPC:       disconnecting xprt ffff8802fd03c000 to reuse port
May 28 13:07:29 marcato kernel: [184207.248991] RPC:       AF_UNSPEC connect return code 0
May 28 13:07:29 marcato kernel: [184207.248993] RPC:       worker connecting xprt ffff8802fd03c000 via tcp to 0.0.0.0 (port 58162)
May 28 13:07:29 marcato kernel: [184207.249002] RPC:       xs_tcp_state_change client ffff8802fd03c000...
May 28 13:07:29 marcato kernel: [184207.249004] RPC:       state 7 conn 0 dead 0 zapped 1 sk_shutdown 3
May 28 13:07:29 marcato kernel: [184207.249005] RPC:       disconnected transport ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.249006] RPC: 22256 __rpc_wake_up_task (now 4340888822)
May 28 13:07:29 marcato kernel: [184207.249007] RPC: 22256 disabling timer
May 28 13:07:29 marcato kernel: [184207.249008] RPC: 22256 removed from queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.249009] RPC:       __rpc_wake_up_task done
May 28 13:07:29 marcato kernel: [184207.249011] RPC:       ffff8802fd03c000 connect status 115 connected 0 sock state 7
May 28 13:07:29 marcato kernel: [184207.249012] RPC: 22256 __rpc_execute flags=0x681
May 28 13:07:29 marcato kernel: [184207.249013] RPC: 22256 xprt_connect_status: retrying
May 28 13:07:29 marcato kernel: [184207.249014] RPC: 22256 call_connect_status (status -11)
May 28 13:07:29 marcato kernel: [184207.249016] RPC: 22256 call_bind (status 0)
May 28 13:07:29 marcato kernel: [184207.249017] RPC: 22256 call_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.249018] RPC: 22256 xprt_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.249019] RPC: 22256 sleep_on(queue "xprt_pending" time 4340888822)
May 28 13:07:29 marcato kernel: [184207.249020] RPC: 22256 added to queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.249021] RPC: 22256 setting alarm for 9000 ms
May 28 13:07:29 marcato kernel: [184207.249023] RPC:       xs_connect scheduled xprt ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.249024] RPC:       disconnecting xprt ffff8802fd03c000 to reuse port
May 28 13:07:29 marcato kernel: [184207.249026] RPC:       AF_UNSPEC connect return code 0
May 28 13:07:29 marcato kernel: [184207.249027] RPC:       worker connecting xprt ffff8802fd03c000 via tcp to 0.0.0.0 (port 58162)
May 28 13:07:29 marcato kernel: [184207.249034] RPC:       xs_tcp_state_change client ffff8802fd03c000...
May 28 13:07:29 marcato kernel: [184207.249035] RPC:       state 7 conn 0 dead 0 zapped 1 sk_shutdown 3
May 28 13:07:29 marcato kernel: [184207.249036] RPC:       disconnected transport ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.249037] RPC: 22256 __rpc_wake_up_task (now 4340888822)
May 28 13:07:29 marcato kernel: [184207.249038] RPC: 22256 disabling timer
May 28 13:07:29 marcato kernel: [184207.249040] RPC: 22256 removed from queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.249041] RPC:       __rpc_wake_up_task done
May 28 13:07:29 marcato kernel: [184207.249042] RPC:       ffff8802fd03c000 connect status 115 connected 0 sock state 7
May 28 13:07:29 marcato kernel: [184207.249043] RPC: 22256 __rpc_execute flags=0x681
May 28 13:07:29 marcato kernel: [184207.249044] RPC: 22256 xprt_connect_status: retrying
May 28 13:07:29 marcato kernel: [184207.249046] RPC: 22256 call_connect_status (status -11)
May 28 13:07:29 marcato kernel: [184207.249047] RPC: 22256 call_bind (status 0)
May 28 13:07:29 marcato kernel: [184207.249048] RPC: 22256 call_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.249054] RPC: 22256 xprt_connect xprt ffff8802fd03c000 is not connected
May 28 13:07:29 marcato kernel: [184207.249062] RPC: 22256 sleep_on(queue "xprt_pending" time 4340888822)
May 28 13:07:29 marcato kernel: [184207.249070] RPC: 22256 added to queue ffff8802fd03c258 "xprt_pending"
May 28 13:07:29 marcato kernel: [184207.249078] RPC: 22256 setting alarm for 9000 ms
May 28 13:07:29 marcato kernel: [184207.249086] RPC:       xs_connect scheduled xprt ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.249093] RPC:       disconnecting xprt ffff8802fd03c000 to reuse port
May 28 13:07:29 marcato kernel: [184207.249099] RPC:       AF_UNSPEC connect return code 0
May 28 13:07:29 marcato kernel: [184207.249106] RPC:       worker connecting xprt ffff8802fd03c000 via tcp to 0.0.0.0 (port 58162)
May 28 13:07:29 marcato kernel: [184207.249119] RPC:       xs_tcp_state_change client ffff8802fd03c000...
May 28 13:07:29 marcato kernel: [184207.249127] RPC:       state 7 conn 0 dead 0 zapped 1 sk_shutdown 3
May 28 13:07:29 marcato kernel: [184207.249136] RPC:       disconnected transport ffff8802fd03c000
May 28 13:07:29 marcato kernel: [184207.249145] RPC: 22256 __rpc_wake_up_task (now 4340888822)
May 28 13:07:29 marcato kernel: [184207.249153] RPC: 22256 disabling timer
May 28 13:07:29 marcato kernel: [184207.249159] RPC: 22256 removed from queue ffff8802fd03c258 "xprt_pending"

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-28:

#11

One more observation that I'll post here as *TENTATIVE*. All clients are , as I wrote earlier, 12.04 clients. The bug, as described above, occurs whenever one of the client's fstab entry uses the nfs filesystem:

192.168.1.2:/home /marcato/home nfs defaults,user,exec,hard,auto,nolock 0 0

However, the NFS server appears to act normally when all clients use nfs4:

192.168.1.2:/home /marcato/home nfs4 defaults,user,exec,hard,auto,nolock 0 0

Interesting, because the /etc/mtab of the corresponding client has vers=4 in the options, which those clients that use nfs4 don't have. So it appears that, after all, all clients use version 4, right?

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-29:

#12

Strike #11 above. Observation is not correct. The runaway kworker appears even when all clients use "nfs4".

Revision history for this message

Stefan Bader (smb) wrote on 2014-05-29:

#13

If /proc/mounts shows nvsvers=4 I would assume as well that nfsv4 is used. Formally I had the feeling that the examples looked like for really using nfsv4 one would need to have one entry in /etc/exports declaring a fsid=0 (iow the root) and then clients would ask for paths relative to that root. On one hand I *think* that I remember someone asking on irc and having issues when not following that. Which was another thing that seemed to have worked before.

For the debugging, I think it does give me a rough idea, just need to match that up against some code. Which will probably take a bit. But somehow it appears to try to connect but then to think the socket needs to be released for some reason. And possibly it either is failing to release it with some unexpected condition or incorrectly assumes it is still connected and repeats this forever. Cannot really think of a reason why this is related (or not) to another server running (at least yet).

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-29:

#14

I saw the thread with the /etc/exports entry that has fsid=0 (export filesystem root), and, of course, I tried that. Somehow, I did not get any valid exports when, for example, using

/home 192.168.1.0/255.255.255.0(rw,fsid=0,root_squash,no_subtree_check)
/home/samba 192.168.1.0/255.255.255.0(rw,root_squash,no_subtree_check)

Also, I had the same problem when I dsabled NFSv4 and limited the kernel server to NFSv3 (with options in /etc/defaults)

So I moved away from using the fsid=0 option.

Stefan, if you need me to do anything, le me know. I have done my own share of printk-debugging, although I am not sure if I am out of my depth with nfs.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-30:

#15

I made one more observation, and it is fully reproducible. Frankly, it would appear that the bug may not be as impactful as originally though (if this is a bug, to begin with). Here it is:

On one of the Linux clients, Windows runs in Virtualbox. The NFS share is mounted within the Linux host, and the mount point is available in the virtual environment as a vboxdrv shared folder. Within Windows, the user uses a translation automation tool called "Trados". If -- and only if -- this program is running, the apparent runaway kworker uses 100% CPU. When Trados is closed, the CPU core returns to idle within ~20 to 30 seconds.

This, finally, explains many observations I made before, which are mere coincidences. It also explains why a second NFS server that I brought online today does not exhibit the runaway behavior. Only the combination of all three: NFS, Virtualbox (Win 7 guest), and Trados, puts a heavy load on the rpc daemon. No other Windows program in the virtual box does that.

Why and how? I used tcpdump to see if anything insane is going on, but it is not -- neither on the client's physical eth0 interface nor on the virtual vboxnet0 interface. I did not find anything suspicious. Also, I cannot explain why the old 10.04 NFS server didn't show this behavior. NFSv3 perhaps? And why does it affect the rpc daemon?

In conclusion, I believe that I found the cause (albeit not the reason) for the runaway behavior. I also believe that the combination of software packages is very unusual, and this bug is not likely to affect many people. I propose to reduce the importance level of this bug to "low".

Latsly, it makes me shudder to think about what Trados might do to the hard disks of people who use it locally, with natively installed Windows.

Revision history for this message

Stefan Bader (smb) wrote on 2014-05-30:

#16

Now this also explains why I had a hard time (iow was not being able to) reproducing this here. Since this indeed is a rather unusual corner case, I can put it a little lower on the list. Still I would try to understand the debug output enough to have some better idea about how that windows app is getting NFS into such a mess. Maybe there is some way to at least weaken the effects.

About the fsid=0 exports. Yeah, it seems like a bit of a twisted way of setting things up. That face that I have to mount "<server>:/" for the export defined as fsid=0 I can grasp. But then how other mounts are matched to be below this and to be NFSv4 is not really that obvious.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-05-30:

#17

Well, I am game.

I'll safely assume that Trados is rather one of the less-frequently used programs in this community, so it is obviously up to me to run the actual tests. If you could help me by suggesting debugging functions or actions, I'll gladly provide more information.

Joseph Salisbury (jsalisbury) on 2014-06-03

tags:

removed: kernel-key

Revision history for this message

Stefan Bader (smb) wrote on 2014-06-04:

#18

Deeper inspection of the logs looks like the problem is some connection attempt when xprt is not connected. Part of that procedure is to re-use the connection which forces the xprt to disconnect (so the socket can be re-used). This triggers a state change (TCP_CLOSE) and wakes up the task waiting for the connection. But the connection state then in INPROGRESS which somehow gets translated into EGAIN and that triggers call_bind which repeats the re-use of socket process.

With that lead, I found two commits upstream referring to this commit that introduces that behaviour:

* 561ec1603171 (SUNRPC: call_connect_status should recheck bind..)

The two fixes related to that are:

* 1fa3e2e SUNRPC: Ensure call_connect_status() deals correctly with SOFTCONN tasks
* 485f225 SUNRPC: Ensure that call_connect times out correctly

The latter would at least cause timeouts to be re-adjusted before looping back into call_bind. So it might be worth trying those. I build a trusty kernel with those two patches added. The debs are at http://people.canonical.com/~smb/lp1322407/
Could you install those on the server side and see whether this helps with the problem?

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-06-05:

#19

Preliminary results: This fixed it.

I say "preliminary", because we tested it only for a short moment. After some heavy Trados use yesterday, the NFS kernel daemon "hung" again (rpc timeout, probably choked on its own connect/reconnect state queue), I installed the kernel packages you prepared. We briefly ran Trados and the runaway kworker no longer appeared.

I would like to observe this a bit more and report in a few days.

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-06-11:

#20

FIX CONFIRMED. We have been operating with the new NFS server for several days, and the runaway kworker has not reappeared. Not even once. The NFS daemon has never "hung" during these days.

I'd say this is it. Thanks, Stefan.

Revision history for this message

Stefan Bader (smb) wrote on 2014-06-12:

#21

Thanks Mark. I will propose those for 14.04 (Trusty) then.

Changed in linux (Ubuntu Trusty):
assignee:	nobody → Stefan Bader (smb)
importance:	Undecided → High
status:	New → In Progress
Changed in linux (Ubuntu):
status:	Confirmed → Fix Released

Revision history for this message

Stefan Bader (smb) wrote on 2014-06-12:

#22

The fix-released for the development kernel is based on both patches being in the 3.14.4 upstream stable tree.

Revision history for this message

Gordon Dracup (gordon-dracup) wrote on 2014-06-24:

#23

I appear to still having this issue, despite the fix being release. 3.13.0-29-generic

Happy to provide more details.

Revision history for this message

Gordon Dracup (gordon-dracup) wrote on 2014-06-24:

#24

Problem exists on 3.13.0-29-generic. Clean 14.04 server install. Export of root sub folder. NFSv4. Client 12.04 mount. Kworker 100% every time the folder is accessed. Testing on a single server and single client.

Revision history for this message

Dilshod (dilshod-z) wrote on 2014-06-24:

#25

Linux 3.13.0-29-generic #53-Ubuntu SMP Wed Jun 4 21:00:20 UTC 2014 x86_64 x86_64 x86_64 GNU/Linux

I have this problem. Any suggestion how to fix it? Do this require upgrading to newer version of kernel?

Revision history for this message

davidwca (davidwca) wrote on 2014-06-25:

#26

Any confirmed release date for the fixed kernel? I'm using Stefan's custom build which fixes the problem.

Revision history for this message

Gordon Dracup (gordon-dracup) wrote on 2014-06-25:

#27

FIX CONFIRMED. I can confirm that Stefan's fix in #18 resolves my issue, and is unrelated to Trados.

Many thanks Stefan.

Revision history for this message

Stefan Bader (smb) wrote on 2014-06-25:

#28

It might be a bit unclear but the main task is "fix released" because Utopic (14.10 and current trunk) is based on 3.15 right now (will move to 3.16 before release). So Utopic is ok. For Trusty this is still in the mill. The patches have references to this report. So you should see an automatic post here when a kernel with the fixes hits proposed.
Since I found both patches in the git repo/branch used to prepare the next update, I mark the Trusty task as "fix committed". That still does not mean there is a kernel to test officially. Just that it is in the repository.

Changed in linux (Ubuntu Trusty):
status:	In Progress → Fix Committed

Revision history for this message

Luis Henriques (henrix) wrote on 2014-07-10:

#29

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-trusty' to 'verification-done-trusty'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-trusty

Revision history for this message

Daniele Viganò (daniele-vigano) wrote on 2014-07-10:

#30

I have the same issue on Ubuntu 12.04 LTS using Trusty kernel:

Linux version 3.13.0-29-generic (buildd@roseapple) (gcc version 4.6.3 (Ubuntu/Linaro 4.6.3-1ubuntu5) ) #53~precise1-Ubuntu SMP Wed Jun 4 22:06:25 UTC 2014

Distributor ID: Ubuntu
Description: Ubuntu 12.04.4 LTS
Release: 12.04
Codename: precise

Revision history for this message

Andrew G Meyer (andrewgmeyer) wrote on 2014-07-11:

#31

Using henrix's -proposed kernel appears to resolve the issue for me.

Under 3.13.0-32-generic #56-Ubuntu, Kworker is no longer running away, whereas before under *.30-generic #55 it was.

My setup is a server with a nfs client on a KVM VM coming in over openvswitch. Restarting the NFS client VM seemed to hang the NFS daemon on the server, and Kworker was running away.

I replicated that same scenario, and everything appears to be working correctly now. The only difference was that there was a significant amount of NFS activity that I didn't recreate, but as I said, everything appears normal now.

Revision history for this message

Andrew G Meyer (andrewgmeyer) wrote on 2014-07-11:

#32

tags:added: verification-done-trusty

tags:

added: verification-done-trusty
removed: verification-needed-trusty

Revision history for this message

Mark Haidekker (mhaidekk) wrote on 2014-07-11:

#33

Thanks, Andrew, for testing and for updating the tags. I have just tested the -proposed kernel and can confirm the fix, too. You beat me by an hour or so in adding verification-done-trusty.

Mark

Revision history for this message

Gordon Dracup (gordon-dracup) wrote on 2014-07-13:

#34

Unbutu Server 14.04 Linux Package versions Edit (75.5 KiB, image/png)

I confirm also that the proposed kernel changes #56 fixes this issue. Problem still occurs with #55. See attached package versions which work. Sorry, took me ages to work out how to install proposed kernel changes to test this. :-)

Revision history for this message

Launchpad Janitor (janitor) wrote on 2014-07-16:

#35

Download full text (35.8 KiB)

This bug was fixed in the package linux - 3.13.0-32.57

---------------
linux (3.13.0-32.57) trusty; urgency=low

[ Upstream Kernel Changes ]

  * l2tp: Privilege escalation in ppp over l2tp sockets
    - LP: #1341472
    - CVE-2014-4943

linux (3.13.0-32.56) trusty; urgency=low

[ Luis Henriques ]

  * Merged back Ubuntu-3.13.0-30.55 security release
  * Revert "x86_64,ptrace: Enforce RIP <= TASK_SIZE_MAX (CVE-2014-4699)"
    - LP: #1337339
  * Release Tracking Bug
    - LP: #1338524

[ Upstream Kernel Changes ]

  * ptrace,x86: force IRET path after a ptrace_stop()
    - LP: #1337339
    - CVE-2014-4699
  * hpsa: add new Smart Array PCI IDs (May 2014)
    - LP: #1337516

linux (3.13.0-31.55) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
- LP: #1336278

[ Andy Whitcroft ]

  * [Config] switch hyper-keyboard to virtual
    - LP: #1325306
  * [Packaging] linux-udeb-flavour -- standardise on linux prefix

[ dann frazier ]

* [Config] CONFIG_GPIO_DWAPB=m
- LP: #1334823

[ Feng Kan ]

  * SAUCE: (no-up) arm64: dts: Add Designware GPIO dts binding to APM
    X-Gene platform
    - LP: #1334823

[ John Johansen ]

* SAUCE: (no-up) apparmor: fix apparmor spams log with warning message
- LP: #1308761

[ Kamal Mostafa ]

* [Config] updateconfigs ACPI_PROCFS_POWER=y after v3.13.11.4 rebase

[ Loc Ho ]

* SAUCE: (no-up) phy-xgene: Use correct tuning for Mustang
- LP: #1335636

[ Michael Ellerman ]

  * SAUCE: (no-up) powerpc/perf: Ensure all EBB register state is cleared
    on fork()
    - LP: #1328914

[ Ming Lei ]

* Revert "SAUCE: (no-up) rtc: Add X-Gene SoC Real Time Clock Driver"
- LP: #1274305

[ Suman Tripathi ]

  * SAUCE: (no-up) libahci: Implement the function ahci_restart_engine to
    restart the port dma engine.
    - LP: #1335645
  * SAUCE: (no-up) ata: Fix the dma state machine lockup for the IDENTIFY
    DEVICE PIO mode command.
    - LP: #1335645

[ Tim Gardner ]

  * [Config] CONFIG_POWERNV_CPUFREQ=y for powerpc, ppc64el
    - LP: #1324571
  * [Debian] Add UTS_UBUNTU_RELEASE_ABI to utsrelease.h
    - LP: #1327619
  * [Config] CONFIG_HAVE_MEMORYLESS_NODES=y
    - LP: #1332063
  * [Config] CONFIG_HID_RMI=m
    - LP: #1305522

[ Upstream Kernel Changes ]

  * Revert "offb: Add palette hack for little endian"
    - LP: #1333430
  * Revert "net: mvneta: fix usage as a module on RGMII configurations"
    - LP: #1333837
  * Revert "USB: serial: add usbid for dell wwan card to sierra.c"
    - LP: #1333837
  * Revert "macvlan : fix checksums error when we are in bridge mode"
    - LP: #1333838
  * serial: uart: add hw flow control support configuration
    - LP: #1328295
  * mm/numa: Remove BUG_ON() in __handle_mm_fault()
    - LP: #1323165
  * Tools: hv: Handle the case when the target file exists correctly
    - LP: #1306215
  * Documentation/devicetree/bindings: add documentation for the APM X-Gene
    SoC RTC DTS binding
    - LP: #1274305
  * drivers/rtc: add APM X-Gene SoC RTC driver
    - LP: #1274305
  * arm64: add APM X-Gene SoC RTC DTS entry
    - LP: #1274305
  * powerpc/perf: Add Power8 cache & TLB events
    - LP: #1328914
  * powerpc/perf: Configure BH...

This bug was fixed in the package linux - 3.13.0-32.57

---------------
linux (3.13.0-32.57) trusty; urgency=low

[ Upstream Kernel Changes ]

* l2tp: Privilege escalation in ppp over l2tp sockets
    - LP: #1341472
    - CVE-2014-4943

linux (3.13.0-32.56) trusty; urgency=low

[ Luis Henriques ]

* Merged back Ubuntu-3.13.0-30.55 security release
  * Revert "x86_64,ptrace: Enforce RIP <= TASK_SIZE_MAX (CVE-2014-4699)"
    - LP: #1337339
  * Release Tracking Bug
    - LP: #1338524

[ Upstream Kernel Changes ]

* ptrace,x86: force IRET path after a ptrace_stop()
    - LP: #1337339
    - CVE-2014-4699
  * hpsa: add new Smart Array PCI IDs (May 2014)
    - LP: #1337516

linux (3.13.0-31.55) trusty; urgency=low

[ Luis Henriques ]

* Release Tracking Bug
    - LP: #1336278

[ Andy Whitcroft ]

* [Config] switch hyper-keyboard to virtual
    - LP: #1325306
  * [Packaging] linux-udeb-flavour -- standardise on linux prefix

[ dann frazier ]

* [Config] CONFIG_GPIO_DWAPB=m
    - LP: #1334823

[ Feng Kan ]

* SAUCE: (no-up) arm64: dts: Add Designware GPIO dts binding to APM
    X-Gene platform
    - LP: #1334823

[ John Johansen ]

* SAUCE: (no-up) apparmor: fix apparmor spams log with warning message
    - LP: #1308761

[ Kamal Mostafa ]

* [Config] updateconfigs ACPI_PROCFS_POWER=y after v3.13.11.4 rebase

[ Loc Ho ]

* SAUCE: (no-up) phy-xgene: Use correct tuning for Mustang
    - LP: #1335636

[ Michael Ellerman ]

* SAUCE: (no-up) powerpc/perf: Ensure all EBB register state is cleared
    on fork()
    - LP: #1328914

[ Ming Lei ]

* Revert "SAUCE: (no-up) rtc: Add X-Gene SoC Real Time Clock Driver"
    - LP: #1274305

[ Suman Tripathi ]

* SAUCE: (no-up) libahci: Implement the function ahci_restart_engine to
    restart the port dma engine.
    - LP: #1335645
  * SAUCE: (no-up) ata: Fix the dma state machine lockup for the IDENTIFY
    DEVICE PIO mode command.
    - LP: #1335645

[ Tim Gardner ]

* [Config] CONFIG_POWERNV_CPUFREQ=y for powerpc, ppc64el
    - LP: #1324571
  * [Debian] Add UTS_UBUNTU_RELEASE_ABI to utsrelease.h
    - LP: #1327619
  * [Config] CONFIG_HAVE_MEMORYLESS_NODES=y
    - LP: #1332063
  * [Config] CONFIG_HID_RMI=m
    - LP: #1305522

[ Upstream Kernel Changes ]

* Revert "offb: Add palette hack for little endian"
    - LP: #1333430
  * Revert "net: mvneta: fix usage as a module on RGMII configurations"
    - LP: #1333837
  * Revert "USB: serial: add usbid for dell wwan card to sierra.c"
    - LP: #1333837
  * Revert "macvlan : fix checksums error when we are in bridge mode"
    - LP: #1333838
  * serial: uart: add hw flow control support configuration
    - LP: #1328295
  * mm/numa: Remove BUG_ON() in __handle_mm_fault()
    - LP: #1323165
  * Tools: hv: Handle the case when the target file exists correctly
    - LP: #1306215
  * Documentation/devicetree/bindings: add documentation for the APM X-Gene
    SoC RTC DTS binding
    - LP: #1274305
  * drivers/rtc: add APM X-Gene SoC RTC driver
    - LP: #1274305
  * arm64: add APM X-Gene SoC RTC DTS entry
    - LP: #1274305
  * powerpc/perf: Add Power8 cache & TLB events
    - LP: #1328914
  * powerpc/perf: Configure BHRB filter before enabling PMU interrupts
    - LP: #1328914
  * powerpc/perf: Define perf_event_print_debug() to print PMU register
    values
    - LP: #1328914
  * powerpc: Add a cpu feature CPU_FTR_PMAO_BUG
    - LP: #1328914
  * powerpc/perf: Add lost exception workaround
    - LP: #1328914
  * powerpc/perf: Reject EBB events which specify a sample_type
    - LP: #1328914
  * powerpc/perf: Clean up the EBB hash defines a little
    - LP: #1328914
  * powerpc/perf: Avoid mutating event in power8_get_constraint()
    - LP: #1328914
  * powerpc/perf: Add BHRB constraint and IFM MMCRA handling for EBB
    - LP: #1328914
  * powerpc/perf: Enable BHRB access for EBB events
    - LP: #1328914
  * powerpc/perf: Fix handling of L3 events with bank == 1
    - LP: #1328914
  * Bluetooth: Add support for Intel Bluetooth device [8087:0a2a]
    - LP: #1329184
  * iwlwifi: mvm: disable beacon filtering
    - LP: #1293569
  * SUNRPC: Ensure that call_connect times out correctly
    - LP: #1322407
  * SUNRPC: Ensure call_connect_status() deals correctly with SOFTCONN
    tasks
    - LP: #1322407
  * bitops: Fix signedness of compile-time hweight implementations
    - LP: #1321791
  * cpumask.h: silence warning with -Wsign-compare
    - LP: #1321791
  * fbdev/fb.h: silence warning with -Wsign-compare
    - LP: #1321791
  * rtlwifi: make MSI support a module parameter
    - LP: #1296591
  * rtlwifi: rtl8188ee: add msi module parameter
    - LP: #1296591
  * rtlwifi: rtl8723be: add msi module parameter
    - LP: #1296591
  * net: avoid dependency of net_get_random_once on nop patching
    - LP: #1330671
  * x86-64, modify_ldt: Make support for 16-bit segments a runtime option
    - LP: #1328965
  * ALSA: usb-audio: Prevent printk ratelimiting from spamming kernel log
    while DEBUG not defined
    - LP: #1319457
  * btrfs: fix defrag 32-bit integer overflow
    - LP: #1324953
  * dell-laptop: Only install the i8042 filter when rfkill is active
    - LP: #1289238
  * kthread: ensure locality of task_struct allocations
    - LP: #1332063
  * slub: search partial list on numa_mem_id(), instead of numa_node_id()
    - LP: #1332063
  * powerpc/numa: Enable USE_PERCPU_NUMA_NODE_ID
    - LP: #1332063
  * powerpc/numa: Enable CONFIG_HAVE_MEMORYLESS_NODES
    - LP: #1332063
  * drm/i915: Allow user modes to exceed DVI 165MHz limit
    - LP: #1332220
  * HID: rmi: introduce RMI driver for Synaptics touchpads
    - LP: #1305522
  * HID: rmi: do not stop the device at the end of probe
    - LP: #1305522
  * HID: rmi: check for the existence of some optional queries before
    reading query 12
    - LP: #1305522
  * HID: rmi: do not fetch more than 16 bytes in a query
    - LP: #1305522
  * HID: rmi: fix wrong struct field name
    - LP: #1305522
  * HID: rmi: fix masks for x and w_x data
    - LP: #1305522
  * HID: rmi: do not handle touchscreens through hid-rmi
    - LP: #1305522
  * ipv6: Fix regression caused by efe4208 in udp_v6_mcast_next()
    - LP: #1332420
  * HID: core: do not scan constant input report
    - LP: #1333837
  * drm/radeon: fix audio pin counts for DCE6+ (v2)
    - LP: #1333837
  * mac80211: fix software remain-on-channel implementation
    - LP: #1333837
  * mac80211: exclude AP_VLAN interfaces from tx power calculation
    - LP: #1333837
  * iwlwifi: add new 7265 HW IDs
    - LP: #1333837
  * parisc: fix epoll_pwait syscall on compat kernel
    - LP: #1333837
  * iwlwifi: add MODULE_FIRMWARE for 7265
    - LP: #1333837
  * dma: edma: fix incorrect SG list handling
    - LP: #1333837
  * ALSA: hda/realtek - Add support of ALC288 codec
    - LP: #1333837
  * xen/spinlock: Don't enable them unconditionally.
    - LP: #1333837
  * tick-common: Fix wrong check in tick_check_replacement()
    - LP: #1333837
  * tick-sched: Check tick_nohz_enabled in tick_nohz_switch_to_nohz()
    - LP: #1333837
  * ALSA: hda - add headset mic detect quirk for a Dell laptop
    - LP: #1297581, #1333837
  * ALSA: hda/realtek - Add headset Mic support for Dell machine
    - LP: #1333837
  * staging: r8188eu: Calling rtw_get_stainfo() with a NULL sta_addr will
    return NULL
    - LP: #1333837
  * cifs: Wait for writebacks to complete before attempting write.
    - LP: #1333837
  * mlx4_en: don't use napi_synchronize inside mlx4_en_netpoll
    - LP: #1333837
  * mei: ignore client writing state during cb completion
    - LP: #1333837
  * staging: r8712u: Fix case where ethtype was never obtained and always
    be checked against 0
    - LP: #1333837
  * staging: r8188eu: Fix case where ethtype was never obtained and always
    be checked against 0
    - LP: #1333837
  * USB: serial: ftdi_sio: add id for Brainboxes serial cards
    - LP: #1333837
  * usb: option driver, add support for Telit UE910v2
    - LP: #1333837
  * USB: cp210x: Add 8281 (Nanotec Plug & Drive)
    - LP: #1333837
  * USB: pl2303: add ids for Hewlett-Packard HP POS pole displays
    - LP: #1333837
  * USB: usb_wwan: fix handling of missing bulk endpoints
    - LP: #1333837
  * USB: fix crash during hotplug of PCI USB controller card
    - LP: #1333837
  * USB: cdc-acm: Remove Motorola/Telit H24 serial interfaces from ACM
    driver
    - LP: #1333837
  * Drivers: hv: vmbus: Negotiate version 3.0 when running on ws2012r2
    hosts
    - LP: #1333837
  * serial: omap: Fix missing pm_runtime_resume handling by simplifying
    code
    - LP: #1333837
  * drm/radeon: disable mclk dpm on R7 260X
    - LP: #1333837
  * drm/radeon: fix runpm handling on APUs (v4)
    - LP: #1333837
  * drm/radeon: add support for newer mc ucode on SI (v2)
    - LP: #1333837
  * drm/radeon: add support for newer mc ucode on CI (v2)
    - LP: #1333837
  * drm/radeon: re-enable mclk dpm on R7 260X asics
    - LP: #1333837
  * drm/radeon: memory leak on bo reservation failure. v2
    - LP: #1333837
  * drm/radeon/si: make sure mc ucode is loaded before checking the size
    - LP: #1333837
  * drm/radeon/ci: make sure mc ucode is loaded before checking the size
    - LP: #1333837
  * init/Kconfig: move the trusted keyring config option to general setup
    - LP: #1333837
  * mm/hugetlb.c: add cond_resched_lock() in return_unused_surplus_pages()
    - LP: #1333837
  * thp: close race between split and zap huge pages
    - LP: #1333837
  * coredump: fix va_list corruption
    - LP: #1333837
  * powerpc/tm: Disable IRQ in tm_recheckpoint
    - LP: #1333837
  * ACPI / EC: Process rather than discard events in acpi_ec_clear
    - LP: #1333837
  * ath9k: Fix sequence number assignment for non-data frames
    - LP: #1333837
  * xhci: Switch Intel Lynx Point ports to EHCI on shutdown.
    - LP: #1333837
  * iio: adc: at91_adc: Repair broken platform_data support
    - LP: #1333837
  * iio: querying buffer scan_mask should return 0/1
    - LP: #1333837
  * iio: cm36651: Fix i2c client leak and possible NULL pointer dereference
    - LP: #1333837
  * libata: Update queued trim blacklist for M5x0 drives
    - LP: #1333837
  * pata_at91: fix ata_host_activate() failure handling
    - LP: #1333837
  * ext4: avoid possible overflow in ext4_map_blocks()
    - LP: #1333837
  * ext4: FIBMAP ioctl causes BUG_ON due to handle EXT_MAX_BLOCKS
    - LP: #1333837
  * ext4: note the error in ext4_end_bio()
    - LP: #1333837
  * ext4: fix jbd2 warning under heavy xattr load
    - LP: #1333837
  * ext4: move ext4_update_i_disksize() into mpage_map_and_submit_extent()
    - LP: #1333837
  * ext4: use i_size_read in ext4_unaligned_aio()
    - LP: #1333837
  * locks: allow __break_lease to sleep even when break_time is 0
    - LP: #1333837
  * usb: gadget: zero: Fix SuperSpeed enumeration for alternate setting 1
    - LP: #1333837
  * ahci: do not request irq for dummy port
    - LP: #1333837
  * genirq: Allow forcing cpu affinity of interrupts
    - LP: #1333837
  * irqchip: Gic: Support forced affinity setting
    - LP: #1333837
  * clocksource: Exynos_mct: Use irq_force_affinity() in cpu bringup
    - LP: #1333837
  * clocksource: Exynos_mct: Register clock event after request_irq()
    - LP: #1333837
  * nfsd: set timeparms.to_maxval in setup_callback_client
    - LP: #1333837
  * ahci: Do not receive interrupts sent by dummy ports
    - LP: #1333837
  * libata/ahci: accommodate tag ordered controllers
    - LP: #1333837
  * drm/radeon: disable dpm on rv770 by default
    - LP: #1333837
  * Input: synaptics - add min/max quirk for ThinkPad T431s, L440, L540, S1
    Yoga and X1
    - LP: #1333837
  * drm/radeon: fix count in cik_sdma_ring_test()
    - LP: #1333837
  * drm/radeon: properly unregister hwmon interface (v2)
    - LP: #1333837
  * drm/radeon/pm: don't walk the crtc list before it has been initialized
    (v2)
    - LP: #1333837
  * drm/radeon: fix ATPX detection on non-VGA GPUs
    - LP: #1333837
  * drm/radeon: don't allow runpm=1 on systems with out ATPX
    - LP: #1333837
  * mm: make fixup_user_fault() check the vma access rights too
    - LP: #1333837
  * ARM: 8027/1: fix do_div() bug in big-endian systems
    - LP: #1333837
  * ARM: 8030/1: ARM : kdump : add arch_crash_save_vmcoreinfo
    - LP: #1333837
  * ARM: pxa: hx4700.h: include "irqs.h" for PXA_NR_BUILTIN_GPIO
    - LP: #1333837
  * ARM: tegra: remove UART5/UARTE from tegra124.dtsi
    - LP: #1333837
  * USB: serial: fix sysfs-attribute removal deadlock
    - LP: #1333837
  * 8250_core: Fix unwanted TX chars write
    - LP: #1333837
  * serial: 8250: Fix thread unsafe __dma_tx_complete function
    - LP: #1333837
  * Btrfs: fix inode caching vs tree log
    - LP: #1333837
  * xhci: For streams the css flag most be read from the stream-ctx on ep
    stop
    - LP: #1333837
  * usb: xhci: Prefer endpoint context dequeue pointer over stopped_trb
    - LP: #1333837
  * USB: io_ti: fix firmware download on big-endian machines
    - LP: #1333837
  * usb: qcserial: add Sierra Wireless EM7355
    - LP: #1333837
  * usb: qcserial: add Sierra Wireless MC73xx
    - LP: #1333837
  * usb: qcserial: add Sierra Wireless MC7305/MC7355
    - LP: #1333837
  * usb: option: add Olivetti Olicard 500
    - LP: #1333837
  * usb: option: add Alcatel L800MA
    - LP: #1333837
  * usb: option: add and update a number of CMOTech devices
    - LP: #1333837
  * word-at-a-time: avoid undefined behaviour in zero_bytemask macro
    - LP: #1333837
  * s390/chsc: fix SEI usage on old FW levels
    - LP: #1333837
  * irqchip: armada-370-xp: fix invalid cast of signed value into unsigned
    variable
    - LP: #1333837
  * irqchip: armada-370-xp: implement the ->check_device() msi_chip
    operation
    - LP: #1333837
  * irqchip: armada-370-xp: Fix releasing of MSIs
    - LP: #1333837
  * ASoC: dapm: Fix widget double free with auto-disable DAPM kcontrol
    - LP: #1333837
  * drm/i915: Don't check gmch state on inherited configs
    - LP: #1333837
  * drm/vmwgfx: Make sure user-space can't DMA across buffer object
    boundaries v2
    - LP: #1333837
  * of/irq: do irq resolution in platform_get_irq
    - LP: #1333837
  * s390/bpf,jit: initialize A register if 1st insn is BPF_S_LDX_B_MSH
    - LP: #1333837
  * drm/i915: Don't WARN nor handle unexpected hpd interrupts on gmch
    platforms
    - LP: #1333837
  * module: remove warning about waiting module removal.
    - LP: #1333837
  * ALSA: hda - add headset mic detect quirk for a Dell laptop
    - LP: #1297581, #1333837
  * arm: KVM: fix possible misalignment of PGDs and bounce page
    - LP: #1333837
  * KVM: ARM: vgic: Fix sgi dispatch problem
    - LP: #1333837
  * KVM: async_pf: mm->mm_users can not pin apf->mm
    - LP: #1333837
  * ftrace/module: Hardcode ftrace_module_init() call into load_module()
    - LP: #1333837
  * [SCSI] mpt2sas: Don't disable device twice at suspend.
    - LP: #1333837
  * [SCSI] virtio-scsi: Skip setting affinity on uninitialized vq
    - LP: #1333837
  * drivercore: deferral race condition fix
    - LP: #1333837
  * hrtimer: Prevent all reprogramming if hang detected
    - LP: #1333837
  * hrtimer: Prevent remote enqueue of leftmost timers
    - LP: #1333837
  * timer: Prevent overflow in apply_slack
    - LP: #1333837
  * ARC: !PREEMPT: Ensure Return to kernel mode is IRQ safe
    - LP: #1333837
  * aio: fix potential leak in aio_run_iocb().
    - LP: #1333837
  * dm cache: fix writethrough mode quiescing in cache_map
    - LP: #1333837
  * fix races between __d_instantiate() and checks of dentry flags
    - LP: #1333837
  * net: Fix ns_capable check in sock_diag_put_filterinfo
    - LP: #1333837
  * rt2x00: fix beaconing on USB
    - LP: #1333837
  * rtlwifi: rtl8188ee: initialize packet_beacon
    - LP: #1333837
  * Input: synaptics - add min/max quirk for ThinkPad Edge E431
    - LP: #1333837
  * Input: atkbd - fix keyboard not working on some LG laptops
    - LP: #1333837
  * Bluetooth: Fix triggering BR/EDR L2CAP Connect too early
    - LP: #1333837
  * drm/i915: Break encoder->crtc link separately in intel_sanitize_crtc()
    - LP: #1333837
  * iio:imu:mpu6050: Fixed segfault in Invensens MPU driver due to null
    dereference
    - LP: #1333837
  * ALSA: hda - add headset mic detect quirk for a Dell laptop
    - LP: #1297581, #1333837
  * rtlwifi: rtl8192se: Fix regression due to commit 1bf4bbb
    - LP: #1333837
  * rtl8192cu: Fix unbalanced irq enable in error path of rtl92cu_hw_init()
    - LP: #1333837
  * drm/radeon/uvd: use lower clocks on old UVD to boot v2
    - LP: #1333837
  * drm/radeon: use pflip irq on R600+ v2
    - LP: #1333837
  * drm/radeon: check buffer relocation offset
    - LP: #1333837
  * drm/nouveau/acpi: allow non-optimus setups to load vbios from acpi
    - LP: #1333837
  * drm/nouveau: fix another lock unbalance in nouveau_crtc_page_flip
    - LP: #1333837
  * ALSA: usb-audio: work around corrupted TEAC UD-H01 feedback data
    - LP: #1333837
  * USB: OHCI: fix problem with global suspend on ATI controllers
    - LP: #1333837
  * usb: qcserial: add a number of Dell devices
    - LP: #1333837
  * usb: storage: shuttle_usbat: fix discs being detected twice
    - LP: #1333837
  * fsl-usb: do not test for PHY_CLK_VALID bit on controller version 1.6
    - LP: #1333837
  * tty: serial: 8250_core.c Bug fix for Exar chips.
    - LP: #1333837
  * drivers/tty/hvc: don't free hvc_console_setup after init
    - LP: #1333837
  * tty: Fix lockless tty buffer race
    - LP: #1333837
  * USB: Nokia 305 should be treated as unusual dev
    - LP: #1333837
  * USB: Nokia 5300 should be treated as unusual dev
    - LP: #1333837
  * HID: add NO_INIT_REPORTS quirk for Synaptics Touch Pad V 103S
    - LP: #1333837
  * ALSA: hda - hdmi: Set converter channel count even without sink
    - LP: #1333837
  * Input: elantech - fix touchpad initialization on Gigabyte U2442
    - LP: #1333837
  * posix_acl: handle NULL ACL in posix_acl_equiv_mode
    - LP: #1333837
  * mm/page-writeback.c: fix divide by zero in pos_ratio_polynom
    - LP: #1333837
  * mm: compaction: detect when scanners meet in isolate_freepages
    - LP: #1333837
  * mm/compaction: make isolate_freepages start at pageblock boundary
    - LP: #1333837
  * autofs: fix lockref lookup
    - LP: #1333837
  * libata: Blacklist queued trim for Crucial M500
    - LP: #1333837
  * Linux 3.13.11.3
    - LP: #1333837
  * net: sctp: wake up all assocs if sndbuf policy is per socket
    - LP: #1333838
  * net: sctp: test if association is dead in sctp_wake_up_waiters
    - LP: #1333838
  * l2tp: take PMTU from tunnel UDP socket
    - LP: #1333838
  * net: core: don't account for udp header size when computing seglen
    - LP: #1333838
  * bonding: Remove debug_fs files when module init fails
    - LP: #1333838
  * bridge: Fix double free and memory leak around br_allowed_ingress
    - LP: #1333838
  * ipv6: Limit mtu to 65575 bytes
    - LP: #1333838
  * gre: don't allow to add the same tunnel twice
    - LP: #1333838
  * vti: don't allow to add the same tunnel twice
    - LP: #1333838
  * ipv4: return valid RTA_IIF on ip route get
    - LP: #1333838
  * filter: prevent nla extensions to peek beyond the end of the message
    - LP: #1333838
  * ip6_gre: don't allow to remove the fb_tunnel_dev
    - LP: #1333838
  * vlan: Fix lockdep warning when vlan dev handle notification
    - LP: #1333838
  * net: Find the nesting level of a given device by type.
    - LP: #1333838
  * net: Allow for more then a single subclass for netif_addr_lock
    - LP: #1333838
  * vlan: Fix lockdep warning with stacked vlan devices.
    - LP: #1333838
  * macvlan: Fix lockdep warnings with stacked macvlan devices
    - LP: #1333838
  * tg3: update rx_jumbo_pending ring param only when jumbo frames are
    enabled
    - LP: #1333838
  * net: sctp: cache auth_enable per endpoint
    - LP: #1333838
  * rtnetlink: Warn when interface's information won't fit in our packet
    - LP: #1333838
  * rtnetlink: Only supply IFLA_VF_PORTS information when RTEXT_FILTER_VF
    is set
    - LP: #1333838
  * ipv6: fib: fix fib dump restart
    - LP: #1333838
  * bridge: Handle IFLA_ADDRESS correctly when creating bridge device
    - LP: #1333838
  * sctp: reset flowi4_oif parameter on route lookup
    - LP: #1333838
  * net: qmi_wwan: add Sierra Wireless EM7355
    - LP: #1333838
  * net: qmi_wwan: add Sierra Wireless MC73xx
    - LP: #1333838
  * net: qmi_wwan: add Sierra Wireless MC7305/MC7355
    - LP: #1333838
  * net: qmi_wwan: add Olivetti Olicard 500
    - LP: #1333838
  * net: qmi_wwan: add Alcatel L800MA
    - LP: #1333838
  * net: qmi_wwan: add a number of CMOTech devices
    - LP: #1333838
  * net: qmi_wwan: add a number of Dell devices
    - LP: #1333838
  * slip: fix spinlock variant
    - LP: #1333838
  * net: sctp: Potentially-Failed state should not be reached from
    unconfirmed state
    - LP: #1333838
  * net: sctp: Don't transition to PF state when transport has exhausted
    'Path.Max.Retrans'.
    - LP: #1333838
  * mactap: Fix checksum errors for non-gso packets in bridge mode
    - LP: #1333838
  * tcp_cubic: fix the range of delayed_ack
    - LP: #1333838
  * vsock: Make transport the proto owner
    - LP: #1333838
  * net: cdc_ncm: fix buffer overflow
    - LP: #1333838
  * ip_tunnel: Set network header properly for IP_ECN_decapsulate()
    - LP: #1333838
  * net: cdc_mbim: __vlan_find_dev_deep need rcu_read_lock
    - LP: #1333838
  * net: ipv4: ip_forward: fix inverted local_df test
    - LP: #1333838
  * net: ipv6: send pkttoobig immediately if orig frag size > mtu
    - LP: #1333838
  * ipv4: fib_semantics: increment fib_info_cnt after fib_info allocation
    - LP: #1333838
  * net: cdc_mbim: handle unaccelerated VLAN tagged frames
    - LP: #1333838
  * macvlan: Don't propagate IFF_ALLMULTI changes on down interfaces.
    - LP: #1333838
  * sfc: fix calling of free_irq with already free vector
    - LP: #1333838
  * ip6_tunnel: fix potential NULL pointer dereference
    - LP: #1333838
  * net: filter: x86: fix JIT address randomization
    - LP: #1333838
  * net: filter: s390: fix JIT address randomization
    - LP: #1333838
  * ipv6: fix calculation of option len in ip6_append_data
    - LP: #1333838
  * rtnetlink: wait for unregistering devices in rtnl_link_unregister()
    - LP: #1333838
  * net: gro: make sure skb->cb[] initial content has not to be zero
    - LP: #1333838
  * batman-adv: fix reference counting imbalance while sending fragment
    - LP: #1333838
  * batman-adv: increase orig refcount when storing ref in gw_node
    - LP: #1333838
  * batman-adv: fix local TT check for outgoing arp requests in DAT
    - LP: #1333838
  * ip_tunnel: Initialize the fallback device properly
    - LP: #1333838
  * ipv4: initialise the itag variable in __mkroute_input
    - LP: #1333838
  * net-gro: reset skb->truesize in napi_reuse_skb()
    - LP: #1333838
  * netfilter: ipv4: defrag: set local_df flag on defragmented skb
    - LP: #1333838
  * ima: introduce ima_kernel_read()
    - LP: #1333838
  * ima: audit log files opened with O_DIRECT flag
    - LP: #1333838
  * percpu: make pcpu_alloc_chunk() use pcpu_mem_free() instead of kfree()
    - LP: #1333838
  * workqueue: fix bugs in wq_update_unbound_numa() failure path
    - LP: #1333838
  * [media] fc2580: fix tuning failure on 32-bit arch
    - LP: #1333838
  * memory: mvebu-devbus: fix the conversion of the bus width
    - LP: #1333838
  * ARM: orion5x: fix target ID for crypto SRAM window
    - LP: #1333838
  * workqueue: make rescuer_thread() empty wq->maydays list before exiting
    - LP: #1333838
  * workqueue: fix a possible race condition between rescuer and
    pwq-release
    - LP: #1333838
  * spi: core: Ignore unsupported Dual/Quad Transfer Mode bits
    - LP: #1333838
  * device_cgroup: rework device access check and exception checking
    - LP: #1333838
  * PCI: mvebu: fix off-by-one in the computed size of the mbus windows
    - LP: #1333838
  * bus: mvebu-mbus: allow several windows with the same target/attribute
    - LP: #1333838
  * PCI: mvebu: split PCIe BARs into multiple MBus windows when needed
    - LP: #1333838
  * ARM: mvebu: fix NOR bus-width in Armada XP GP Device Tree
    - LP: #1333838
  * ARM: mvebu: fix NOR bus-width in Armada XP DB Device Tree
    - LP: #1333838
  * ARM: mvebu: fix NOR bus-width in Armada XP OpenBlocks AX3 Device Tree
    - LP: #1333838
  * crypto: caam - add allocation failure handling in SPRINTFCAT macro
    - LP: #1333838
  * ARM: common: edma: Fix xbar mapping
    - LP: #1333838
  * clk: Fix double free due to devm_clk_register()
    - LP: #1333838
  * [media] media-device: fix infoleak in ioctl media_enum_entities()
    - LP: #1333838
  * ARM: dts: kirkwood: fix mislocated pcie-controller nodes
    - LP: #1333838
  * device_cgroup: check if exception removal is allowed
    - LP: #1333838
  * md: avoid possible spinning md thread at shutdown.
    - LP: #1333838
  * ACPI: Remove Kconfig symbol ACPI_PROCFS
    - LP: #1333838
  * ACPI: Revert "ACPI: Remove CONFIG_ACPI_PROCFS_POWER and cm_sbsc.c"
    - LP: #1333838
  * ACPI: Revert "ACPI / Battery: Remove battery's proc directory"
    - LP: #1333838
  * NFSd: Move default initialisers from create_client() to alloc_client()
    - LP: #1333838
  * NFSd: call rpc_destroy_wait_queue() from free_client()
    - LP: #1333838
  * genirq: Provide irq_force_affinity fallback for non-SMP
    - LP: #1333838
  * libata: clean up ZPODD when a port is detached
    - LP: #1333838
  * ACPI / blacklist: Add dmi_enable_osi_linux quirk for Asus EEE PC 1015PX
    - LP: #1333838
  * ACPI: Revert "ACPI / AC: convert ACPI ac driver to platform bus"
    - LP: #1333838
  * ACPI / processor: do not mark present at boot but not onlined CPU as
    onlined
    - LP: #1333838
  * NFSD: Call ->set_acl with a NULL ACL structure if no entries
    - LP: #1333838
  * ALSA: hda - add headset mic detect quirks for three Dell laptops
    - LP: #1297581, #1333838
  * gpio: mcp23s08: Bug fix of SPI device tree registration.
    - LP: #1333838
  * drm/i915/vlv: reset VLV media force wake request register
    - LP: #1333838
  * ARM: dts: i.MX53: Fix ipu register space size
    - LP: #1333838
  * mm, thp: close race between mremap() and split_huge_page()
    - LP: #1333838
  * intel_pstate: Set turbo VID for BayTrail
    - LP: #1333838
  * powerpc/powernv: Reset root port in firmware
    - LP: #1333838
  * hrtimer: Set expiry time before switch_hrtimer_base()
    - LP: #1333838
  * hwmon: (emc1403) fix inverted store_hyst()
    - LP: #1333838
  * hwmon: (emc1403) Fix resource leak on module unload
    - LP: #1333838
  * hwmon: (emc1403) Support full range of known chip revision numbers
    - LP: #1333838
  * iommu/amd: Fix interrupt remapping for aliased devices
    - LP: #1333838
  * ASoC: wm8962: Update register CLASS_D_CONTROL_1 to be non-volatile
    - LP: #1333838
  * [media] V4L2: ov7670: fix a wrong index, potentially Oopsing the kernel
    from user-space
    - LP: #1333838
  * [media] V4L2: fix VIDIOC_CREATE_BUFS in 64- / 32-bit compatibility mode
    - LP: #1333838
  * x86, mm, hugetlb: Add missing TLB page invalidation for hugetlb_cow()
    - LP: #1333838
  * i2c: designware: Mask all interrupts during i2c controller enable
    - LP: #1333838
  * i2c: s3c2410: resume race fix
    - LP: #1333838
  * i2c: rcar: bail out on zero length transfers
    - LP: #1333838
  * dm crypt: fix cpu hotplug crash by removing per-cpu structure
    - LP: #1333838
  * metag: fix memory barriers
    - LP: #1333838
  * metag: Reduce maximum stack size to 256MB
    - LP: #1333838
  * drm/i915: restore backlight precision when converting from ACPI
    - LP: #1333838
  * drm/i915: Increase WM memory latency values on SNB
    - LP: #1333838
  * PCI: shpchp: Check bridge's secondary (not primary) bus speed
    - LP: #1333838
  * parisc: ratelimit userspace segfault printing
    - LP: #1333838
  * parisc: Improve LWS-CAS performance
    - LP: #1333838
  * Target/iser: Fix wrong connection requests list addition
    - LP: #1333838
  * Target/iser: Fix iscsit_accept_np and rdma_cm racy flow
    - LP: #1333838
  * iscsi-target: Change BUG_ON to REJECT in iscsit_process_nop_out
    - LP: #1333838
  * tcm_fc: Fix free-after-use regression in ft_free_cmd
    - LP: #1333838
  * target: Don't allow setting WC emulation if device doesn't support
    - LP: #1333838
  * arm: dts: Fix missing device_type="memory" for ste-ccu8540
    - LP: #1333838
  * mips: dts: Fix missing device_type="memory" property in memory nodes
    - LP: #1333838
  * arm64: fix pud_huge() for 2-level pagetables
    - LP: #1333838
  * libceph: fix corruption when using page_count 0 page in rbd
    - LP: #1333838
  * clk: tegra: use pll_ref as the pll_e parent
    - LP: #1333838
  * clk: tegra: Fix wrong value written to PLLE_AUX
    - LP: #1333838
  * target: fix memory leak on XCOPY
    - LP: #1333838
  * sysfs: make sure read buffer is zeroed
    - LP: #1333838
  * cfg80211: free sme on connection failures
    - LP: #1333838
  * sched: Sanitize irq accounting madness
    - LP: #1333838
  * sched: Use CPUPRI_NR_PRIORITIES instead of MAX_RT_PRIO in cpupri check
    - LP: #1333838
  * mac80211: fix suspend vs. association race
    - LP: #1333838
  * mac80211: fix on-channel remain-on-channel
    - LP: #1333838
  * af_iucv: wrong mapping of sent and confirmed skbs
    - LP: #1333838
  * perf: Limit perf_event_attr::sample_period to 63 bits
    - LP: #1333838
  * perf: Prevent false warning in perf_swevent_add
    - LP: #1333838
  * drm/gf119-/disp: fix nasty bug which can clobber SOR0's clock setup
    - LP: #1333838
  * drm/radeon: also try GART for CPU accessed buffers
    - LP: #1333838
  * drm/radeon: handle non-VGA class pci devices with ATRM
    - LP: #1333838
  * drm/radeon: fix register typo on si
    - LP: #1333838
  * drm/radeon: avoid segfault on device open when accel is not working.
    - LP: #1333838
  * drm/radeon/pm: don't allow debugfs/sysfs access when PX card is off
    (v2)
    - LP: #1333838
  * can: peak_pci: prevent use after free at netdev removal
    - LP: #1333838
  * nfsd4: remove lockowner when removing lock stateid
    - LP: #1333838
  * nfsd4: warn on finding lockowner without stateid's
    - LP: #1333838
  * dma: mv_xor: Flush descriptors before activating a channel
    - LP: #1333838
  * dmaengine: fix dmaengine_unmap failure
    - LP: #1333838
  * hwpoison, hugetlb: lock_page/unlock_page does not match for handling a
    free hugepage
    - LP: #1333838
  * mm/memory-failure.c: fix memory leak by race between poison and
    unpoison
    - LP: #1333838
  * ARM: OMAP3: clock: Back-propagate rate change from cam_mclk to dpll4_m5
    on all OMAP3 platforms
    - LP: #1333838
  * dmaengine: dw: went back to plain {request,free}_irq() calls
    - LP: #1333838
  * ARM: omap5: hwmod_data: Correct IDLEMODE for McPDM
    - LP: #1333838
  * Input: synaptics - add min/max quirk for the ThinkPad W540
    - LP: #1333838
  * ARM: OMAP2+: nand: Fix NAND on OMAP2 and OMAP3 boards
    - LP: #1333838
  * futex: Add another early deadlock detection check
    - LP: #1333838
  * futex: Prevent attaching to kernel threads
    - LP: #1333838
  * ARM: OMAP4: Fix the boot regression with CPU_IDLE enabled
    - LP: #1333838
  * cpufreq: remove race while accessing cur_policy
    - LP: #1333838
  * cpufreq: cpu0: drop wrong devm usage
    - LP: #1333838
  * ARM: imx: fix error handling in ipu device registration
    - LP: #1333838
  * ALSA: hda - Fix onboard audio on Intel H97/Z97 chipsets
    - LP: #1333838
  * ARM: 8051/1: put_user: fix possible data corruption in put_user
    - LP: #1333838
  * ARM: 8064/1: fix v7-M signal return
    - LP: #1333838
  * Input: synaptics - T540p - unify with other LEN0034 models
    - LP: #1333838
  * drm/i915: Only copy back the modified fields to userspace from
    execbuffer
    - LP: #1333838
  * dm cache: always split discards on cache block boundaries
    - LP: #1333838
  * virtio_blk: don't crash, report error if virtqueue is broken.
    - LP: #1333838
  * virtio_blk: fix race between start and stop queue
    - LP: #1333838
  * powerpc: Fix 64 bit builds with binutils 2.24
    - LP: #1333838
  * powerpc, kexec: Fix "Processor X is stuck" issue during kexec from ST
    mode
    - LP: #1333838
  * rtmutex: Fix deadlock detector for real
    - LP: #1333838
  * drm/radeon: avoid crash if VM command submission isn't available
    - LP: #1333838
  * drm/radeon: don't allow RADEON_GEM_DOMAIN_CPU for command submission
    - LP: #1333838
  * iwlwifi: mvm: fix setting channel in monitor mode
    - LP: #1333838
  * Staging: speakup: Move pasting into a work item
    - LP: #1333838
  * USB: Avoid runtime suspend loops for HCDs that can't handle
    suspend/resume
    - LP: #1333838
  * can: only rename enabled led triggers when changing the netdev name
    - LP: #1333838
  * USB: io_ti: fix firmware download on big-endian machines (part 2)
    - LP: #1333838
  * USB: ftdi_sio: add NovaTech OrionLXm product ID
    - LP: #1333838
  * USB: serial: option: add support for Novatel E371 PCIe card
    - LP: #1333838
  * USB: cdc-wdm: properly include types.h
    - LP: #1333838
  * md: always set MD_RECOVERY_INTR when aborting a reshape or other
    "resync".
    - LP: #1333838
  * xhci: delete endpoints from bandwidth list before freeing whole device
    - LP: #1333838
  * md: always set MD_RECOVERY_INTR when interrupting a reshape thread.
    - LP: #1333838
  * ALSA: hda/analog - Fix silent output on ASUS A8JN
    - LP: #1333838
  * drm/radeon/dpm: resume fixes for some systems
    - LP: #1333838
  * drm/radeon: use the CP DMA on CIK
    - LP: #1333838
  * ALSA: hda/realtek - Correction of fixup codes for PB V7900 laptop
    - LP: #1333838
  * ALSA: hda/realtek - Fix COEF widget NID for ALC260 replacer fixup
    - LP: #1333838
  * iser-target: Add missing target_put_sess_cmd for ImmedateData failure
    - LP: #1333838
  * iscsi-target: Fix wrong buffer / buffer overrun in
    iscsi_change_param_value()
    - LP: #1333838
  * percpu-refcount: fix usage of this_cpu_ops
    - LP: #1333838
  * target: Fix alua_access_state attribute OOPs for un-configured devices
    - LP: #1333838
  * mm: rmap: fix use-after-free in __put_anon_vma
    - LP: #1333838
  * mm: add !pte_present() check on existing hugetlb_entry callbacks
    - LP: #1333838
  * target: Fix NULL pointer dereference for XCOPY in target_put_sess_cmd
    - LP: #1333838
  * Linux 3.13.11.4
    - LP: #1333838
  * powerpc/powernv: Infrastructure to read opal messages in generic
    format.
    - LP: #1334268
  * powerpc/powernv: Infrastructure to support OPAL async completion
    - LP: #1334268
  * powerpc/powernv: Enable fetching of platform sensor data
    - LP: #1334268
  * powerpc/powernv: Fix endian issues with sensor code
    - LP: #1334268
  * powerpc/powernv: Add OPAL message log interface
    - LP: #1334268
  * powerpc/powernv: Fix kexec races going back to OPAL
    - LP: #1334268
  * powerpc/powernv: Fix little endian issues in OPAL flash code
    - LP: #1334268
  * powerpc/powernv: Fix little endian issues with opal_do_notifier calls
    - LP: #1334268
  * powerpc/powernv: Fix little endian issues in OPAL error log code
    - LP: #1334268
  * powerpc/powernv: Create OPAL sglist helper functions and fix endian
    issues
    - LP: #1334268
  * powerpc/powernv: Fix little endian issues in OPAL dump code
    - LP: #1334268
  * powerpc: Fix error return in rtas_flash module init
    - LP: #1334268
  * powerpc/powernv: Increase candidate fw image size
    - LP: #1334268
  * powerpc/powernv: Return secondary CPUs to firmware before FW update
    - LP: #1334268
  * powerpc/powernv: Pass buffer size to OPAL validate flash call
    - LP: #1334268
  * gpio: add a driver for the Synopsys DesignWare APB GPIO block
    - LP: #1334823
  * gpio: dwapb: drop irq_setup_generic_chip()
    - LP: #1334823
  * gpio: dwapb: use a second irq chip
    - LP: #1334823
  * lzo: properly check for overruns
    - LP: #1335313
    - CVE-2014-4608
  * lz4: ensure length does not wrap
    - LP: #1335314
    - CVE-2014-4611
  * netfilter: nf_nat: fix oops on netns removal
    - LP: #1314274
  * ALSA: hda - add device ID for Broadwell display audio controller
    - LP: #1188091
  * ALSA: hda - add codec ID for Broadwell display audio codec
    - LP: #1188091
  * ALSA: hda/hdmi - apply all Haswell fix-ups to Broadwell display codec
    - LP: #1188091
  * ALSA: hda - using POS_FIX_LPIB on Broadwell HDMI Audio
    - LP: #1188091
 -- Luis Henriques <luis.henriques@canonical.com>   Mon, 14 Jul 2014 10:03:44 +0100

Changed in linux (Ubuntu Trusty):
status:	Fix Committed → Fix Released

Revision history for this message

Yuri Sa (9-yuri-1) wrote on 2015-01-23:

#36

Sorry to ressurect an old thread, but I did all of this but the problem still persisted.

I have some 25+ clients connected to NFS server.

When i dug into the clients, it was 2 of them with connection failures (one on the eth card, another the cable was damaged)...

When I corrected those problems, my Kworker related to NFS went back to usual 0.5%....

Ubuntu
linux package

NFS kernel server creates a kworker with 100% CPU usage, then hangs randomly

Bug Description

Related branches

CVE References

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Affects		Status	Importance	Assigned to	Milestone
	linux (Ubuntu)	Fix Released	High	Unassigned
	Trusty	Fix Released	High	Stefan Bader

Ubuntulinux package

NFS kernel server creates a kworker with 100% CPU usage, then hangs randomly

Bug Description

Related branches

CVE References

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package