Bug #974664 “Kernel Oops - BUG: unable to handle kernel paging r...” : Bugs : linux package : Ubuntu

Revision history for this message

rvaliant (rvaliant) wrote on 2012-04-05:

#1

AcpiTables.txt Edit (137.2 KiB, text/plain; charset="utf-8")
AlsaDevices.txt Edit (711 bytes, text/plain; charset="utf-8")
AplayDevices.txt Edit (382 bytes, text/plain; charset="utf-8")
ArecordDevices.txt Edit (415 bytes, text/plain; charset="utf-8")
AudioDevicesInUse.txt Edit (192 bytes, text/plain; charset="utf-8")
BootDmesg.txt Edit (77.7 KiB, text/plain; charset="utf-8")
Card0.Amixer.info.txt Edit (171 bytes, text/plain; charset="utf-8")
Card0.Amixer.values.txt Edit (4.5 KiB, text/plain; charset="utf-8")
Card0.Codecs.codec.0.txt Edit (13.8 KiB, text/plain; charset="utf-8")
Card1.Amixer.info.txt Edit (178 bytes, text/plain; charset="utf-8")
Card1.Amixer.values.txt Edit (124 bytes, text/plain; charset="utf-8")
Card1.Codecs.codec.0.txt Edit (1.1 KiB, text/plain; charset="utf-8")
CurrentDmesg.txt Edit (5.9 KiB, text/plain; charset="utf-8")
Dependencies.txt Edit (1.9 KiB, text/plain; charset="utf-8")
Lspci.txt Edit (28.1 KiB, text/plain; charset="utf-8")
Lsusb.txt Edit (705 bytes, text/plain; charset="utf-8")
OopsText.txt Edit (2.6 KiB, text/plain; charset="utf-8")
PciMultimedia.txt Edit (3.3 KiB, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (2.6 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (1.9 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (2.8 KiB, text/plain; charset="utf-8")
UdevDb.txt Edit (120.8 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (242.0 KiB, text/plain; charset="utf-8")
WifiSyslog.txt Edit (167.5 KiB, text/plain; charset="utf-8")

Brad Figg (brad-figg) on 2012-04-05

Changed in linux (Ubuntu):
status:	New → Confirmed

Revision history for this message

penalvch (penalvch) wrote on 2012-04-06:

#2

rvaliant, thank you for reporting this bug and helping make Ubuntu better. Please answer these questions:

* Is this reproducible?
* If so, what specific steps should we take to recreate this bug?
* Could you also please test the latest upstream kernel available? It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

summary:	- BUG: unable to handle kernel paging request at ffffffffffffffb8 + Kernel Oops - BUG: unable to handle kernel paging request at + ffffffffffffffb8; RIP: 0010:[<ffffffffa05a5839>] [<ffffffffa05a5839>] + nfs_have_delegation+0x9/0x40 [nfs]
Changed in linux (Ubuntu):
status:	Confirmed → Incomplete
tags:	added: needs-upstream-testing

Joseph Salisbury (jsalisbury) on 2012-04-06

Changed in linux (Ubuntu):
importance:	Undecided → Medium
tags:	added: nfs-have-delegation

Revision history for this message

In Red Hat Bugzilla #811138, Bert (bert-redhat-bugs) wrote on 2012-04-10:

#30

Download full text (3.9 KiB)

Description of problem:

Kernel traceback on NFS4 locking attempt

Version-Release number of selected component (if applicable):

kernel 3.3.1-2 (and 3.3.1-3)

How reproducible:

Always

Steps to Reproduce:
1. Install kernel 3.3.1-{2,3}; mount a users home over NFS4
2. Start e.g. pidgin, which tries to lock some file
3.

Actual results:

*) Pidgin crashes with 'Killed'. Strace shows that the last activity
is:

[...]
open("/users/visics/deknuydt/.config/enchant/en_US.dic", O_RDONLY) = 11
flock(11, LOCK_EX <unfinished ...>
+++ killed by SIGKILL +++
Killed

*) In dmesg, you see:

[ 322.987247] BUG: unable to handle kernel paging request at ffffffffffffffb8
[ 322.987330] IP: [<ffffffffa0e140e9>] nfs_have_delegation+0x9/0x40 [nfs]
[ 322.987418] PGD 1c07067 PUD 1c08067 PMD 0
[ 322.987495] Oops: 0000 [#1] SMP
[ 322.987565] CPU 1
[ 322.987574] Modules linked in: nfs fscache nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack sha256_generic dm_crypt nvidia(PO) nfsd uinput lockd nfs_acl auth_rpcgss sunrpc snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm iTCO_wdt i2c_i801 iTCO_vendor_support intel_rng microcode r8169 i2c_core mii snd_timer snd soundcore snd_page_alloc serio_raw firewire_ohci firewire_core crc_itu_t sata_sil24 video [last unloaded: scsi_wait_scan]
[ 322.988230]
[ 322.988230] Pid: 1697, comm: pidgin Tainted: P O 3.3.1-3.fc16.x86_64 #1 transtec AG /DG31PR
[ 322.988230] RIP: 0010:[<ffffffffa0e140e9>] [<ffffffffa0e140e9>] nfs_have_delegation+0x9/0x40 [nfs]
[ 322.988230] RSP: 0018:ffff880112e55dd8 EFLAGS: 00010246
[ 322.988230] RAX: ffff880122411800 RBX: ffff880112e55e68 RCX: 00000000ffffd8ca
[ 322.988230] RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
[ 322.988230] RBP: ffff880112e55dd8 R08: 0000000000016560 R09: ffffea00044b8400
[ 322.988230] R10: ffffffffa0e0457a R11: 0000000000000000 R12: 00000000ffffd8ca
[ 322.988230] R13: ffff8801218be000 R14: ffff88011573cc00 R15: ffff8801224b0480
[ 322.988230] FS: 00007fdda2060980(0000) GS:ffff88012fc80000(0000) knlGS:0000000000000000
[ 322.988230] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 322.988230] CR2: ffffffffffffffb8 CR3: 0000000112c07000 CR4: 00000000000006e0
[ 322.988230] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 322.988230] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 322.988230] Process pidgin (pid: 1697, threadinfo ffff880112e54000, task ffff880112dd2e60)
[ 322.988230] Stack:
[ 322.988230] ffff880112e55e28 ffffffffa0e01fa1 ffff880112e11e60 0000000000000000
[ 322.988230] ffff8801224b0480 ffff8801224b0780 ffff8801224b0480 0000000000000082
[ 322.988230] ffff880112e55e68 00000000fffffff5 ffff880112e55eb8 ffffffffa0e04b9c
[ 322.988230] Call Trace:
[ 322.988230] [<ffffffffa0e01fa1>] nfs4_handle_exception+0x241/0x3a0 [nfs]
[ 322.988230] [<ffffffffa0e04b9c>] nfs4_proc_lock+0xec/0x440 [nfs]
[ 322.988230] [<ffffffffa0de518d>] do_setlk+0xed/0x110 [nfs]
[ 322.988230] [<ffffffffa0de5239>] nfs_flock+0x89/0xe0 [nfs]
[ 322.988230] [<ffffffff811cbd53>] sys_flock+0x113/0x1c0
[ 322.988230] [<ffffffff815fbfe9>] system...

On our desktop clients kernel 3.3.1-5.fc16.x86_64 shows the same nfs_have_delegation error when using an NFS mounted home directory. This system was just installed today with fedora-updates repo enabled. To reproduce edit any existing file in the home directory with gedit and try to save it. I can confirm that kernel 3.3.0-8.fc16.x86_64 does not have this problem.

Apr 14 22:40:11 client kernel: [  913.752179] BUG: unable to handle kernel paging request at ffffffffffffffb8
Apr 14 22:40:11 client kernel: [  913.752204] IP: [<ffffffffa039c0e9>] nfs_have_delegation+0x9/0x40 [nfs]
Apr 14 22:40:11 client kernel: [  913.752234] PGD 1c07067 PUD 1c08067 PMD 0 
Apr 14 22:40:11 client kernel: [  913.752245] Oops: 0000 [#1] SMP 
Apr 14 22:40:11 client kernel: [  913.752256] CPU 1 
Apr 14 22:40:11 client kernel: [  913.752260] Modules linked in: nfs fscache auth_rpcgss nfs_acl lockd ip6t_REJECT nf_conntrack_ipv6 nf_defr
ag_ipv6 ip6table_filter ip6_tables nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack tpm_infineon snd_hda_codec_analog hp_wmi sparse_ke
ymap rfkill ppdev snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm microcode snd_timer snd soundcore snd_page_alloc seri
o_raw iTCO_wdt iTCO_vendor_support parport_pc parport e1000e tpm_tis tpm tpm_bios sunrpc uinput ata_generic pata_acpi nouveau ttm drm_kms_he
lper drm i2c_core mxm_wmi video wmi [last unloaded: scsi_wait_scan]
Apr 14 22:40:11 client kernel: [  913.752401] 
Apr 14 22:40:11 client kernel: [  913.752405] Pid: 2786, comm: gedit Tainted: G          I  3.3.1-5.fc16.x86_64 #1 ***REMOVED***
Apr 14 22:40:11 client kernel: [  913.752424] RIP: 0010:[<ffffffffa039c0e9>]  [<ffffffffa039c0e9>] nfs_have_delegation+0x9/0x40 [nfs]
Apr 14 22:40:11 client kernel: [  913.752447] RSP: 0018:ffff88005a6a3dd8  EFLAGS: 00010246
Apr 14 22:40:11 client kernel: [  913.752453] RAX: ffff88006428b400 RBX: ffff88005a6a3e68 RCX: 00000000ffffd8ca
Apr 14 22:40:11 client kernel: [  913.752461] RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
Apr 14 22:40:11 client kernel: [  913.752476] RBP: ffff88005a6a3dd8 R08: 0000000000016560 R09: ffffea0001694980
Apr 14 22:40:11 client kernel: [  913.752483] R10: ffffffffa038c57a R11: 0000000000000000 R12: 00000000ffffd8ca
Apr 14 22:40:11 client kernel: [  913.752656] R13: ffff8800659ea800 R14: ffff880064289400 R15: ffff8800523599c0
Apr 14 22:40:11 client kernel: [  913.752664] FS:  00007f5846754980(0000) GS:ffff88007da80000(0000) knlGS:0000000000000000
Apr 14 22:40:11 client kernel: [  913.752674] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 14 22:40:11 client kernel: [  913.752680] CR2: ffffffffffffffb8 CR3: 00000000642bb000 CR4: 00000000000006e0
Apr 14 22:40:11 client kernel: [  913.752691] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 14 22:40:11 client kernel: [  913.752699] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Apr 14 22:40:11 client kernel: [  913.752707] Process gedit (pid: 2786, threadinfo ffff88005a6a2000, task ffff8800646fdcc0)
Apr 14 22:40:11 client kernel: [  913.752725] Stack:
Apr 14 22:40:11 client kernel: [  913.752729]  ffff88005a6a3e28 ffffffffa0389fa1 ffff88005a527660 0000000000000000
Apr 14 22:40:11 client kernel: [  913.752766]  ffff8800523599c0 ffff880052359000 ffff8800523599c0 0000000000000082
Apr 14 22:40:11 client kernel: [  913.752786]  ffff88005a6a3e68 00000000fffffff5 ffff88005a6a3eb8 ffffffffa038cb9c
Apr 14 22:40:11 client kernel: [  913.752810] Call Trace:
Apr 14 22:40:11 client kernel: [  913.752827]  [<ffffffffa0389fa1>] nfs4_handle_exception+0x241/0x3a0 [nfs]
Apr 14 22:40:11 client kernel: [  913.752846]  [<ffffffffa038cb9c>] nfs4_proc_lock+0xec/0x440 [nfs]
Apr 14 22:40:11 client kernel: [  913.752860]  [<ffffffffa036d18d>] do_setlk+0xed/0x110 [nfs]
Apr 14 22:40:11 client kernel: [  913.752872]  [<ffffffffa036d239>] nfs_flock+0x89/0xe0 [nfs]
Apr 14 22:40:11 client kernel: [  913.752882]  [<ffffffff811cbd23>] sys_flock+0x113/0x1c0
Apr 14 22:40:11 client kernel: [  913.752895]  [<ffffffff815fc069>] system_call_fastpath+0x16/0x1b
Apr 14 22:40:11 client kernel: [  913.752902] Code: fd ff e9 40 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 66 66 66 66 90 f0 80 4
f 48 04 5d c3 55 48 89 e5 66 66 66 66 90 <48> 8b 57 b8 31 c0 48 85 d2 74 0c 8b 4a 30 83 e6 03 21 f1 39 f1 
Apr 14 22:40:11 client kernel: [  913.753124] RIP  [<ffffffffa039c0e9>] nfs_have_delegation+0x9/0x40 [nfs]
Apr 14 22:40:11 client kernel: [  913.753145]  RSP <ffff88005a6a3dd8>
Apr 14 22:40:11 client kernel: [  913.753150] CR2: ffffffffffffffb8
Apr 14 22:40:11 client kernel: [  913.753161] ---[ end trace a7919e7f17c0a727 ]---

Revision history for this message

In Red Hat Bugzilla #811138, Bert (bert-redhat-bugs) wrote on 2012-04-16:

#35

Some more tests:

1) kernels 3.3.2-1.fc1{6,7}.x86_64 from Koji have exactly the same problem.

2) With 3.4.0-0.rc2.git3.1.fc18.x86_64 I get no crash of e.g. pidgin,
but a hang, and this stuff in the dmesg

[ 216.350962] nfs4_reclaim_open_state: 3257 callbacks suppressed
[ 216.350965] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.352550] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.354033] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.355580] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.357055] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.358522] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.360042] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.361513] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.363006] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 216.364582] NFS: nfs4_reclaim_open_state: Lock reclaim failed!

This repeats endlessly till the offending program is killed.

Abt. 120 machines affected here.

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-16:

#36

*** Bug 812633 has been marked as a duplicate of this bug. ***

Revision history for this message

Dan Bishop (danbishop) wrote on 2012-04-16:

#4

This problem is reproduced by logging into Ubuntu with a user's home directory mounted over NFS... though only if the server is also running 12.04.

I have a stable server running 10.04 and 12.04 clients can still mount NFS home directories from there, moving to the test server, configured in the same way but running 12.04, clients crash as above.

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2012-04-16:

#5

Has anyone affected by this bug had a chance to test the latest mainline kernel:

http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.4-rc3-precise/

tags:

added: kernel-da-key kernel-key

Revision history for this message

rvaliant (rvaliant) wrote on 2012-04-16:

#6

Actually, my server is Ubuntu 11.10, so if this is an NFS related issue, it's not confined to 12.04. I have some time today and will try the latest mainline kernel on my client as suggested by jsalisbury and report back - if someone doesn't beat me to it.

Revision history for this message

In Red Hat Bugzilla #811138, Jeff (jeff-redhat-bugs) wrote on 2012-04-16:

#37

Looks like this is probably fixable by pushing commit 14977489 to stable.

Revision history for this message

Ian Morris (ipm) wrote on 2012-04-16:

#7

I'm seeing the same OOps -- again Logging into a 12.04 client system with autofs5, LDAP, kerberos authenticated nfs4 mounts. However this is to a server which is running 10.04 (contrary to #4 it seems) In my case the triggering process is gnome-keyring-d.

Running the mainline kernel (linux-image-3.4.0-030400rc3-generic_3.4.0-030400rc3.201204152235_amd64.deb) I no longer get the oops but get lots of messages:

[ 48.701213] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 48.701990] NFS: nfs4_reclaim_open_state: Lock reclaim failed!
[ 53.696076] nfs4_reclaim_open_state: 6440 callbacks suppressed

Until the client was upgraded to 12.04, it was running 11.10 perfectly in this configuration.

Revision history for this message

In Red Hat Bugzilla #811138, Jeff (jeff-redhat-bugs) wrote on 2012-04-16:

#38

cc'ing Trond...

It looks like this regression crept in with commit 3114ea7a. That commit was
sent to stable kernel series, but commit 149774 (which fixed a bug in the
above commit) was not.

Trond, any chance you can send 149774 to stable as well? We might also want
to take that into Fedora for now until it makes its way into stable kernels.

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-17:

#8

It looks like commit 14977489ffdb80d4caf5a184ba41b23b02fbacd9 should fix this issue. I have built a test kernel that includes this commit. Could someone check if it solves the problem? The test kernel can be obtained here:

http://people.canonical.com/~henrix/lp974664/

Revision history for this message

Ian Morris (ipm) wrote on 2012-04-17:

#9

Re: #8, tried this kernel with the same effect as running the mainline kernel in #7 specifically many many messages of the type:

NFS: nfs4_reclaim_open_state: Lock reclaim failed!

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-17:

#39

I've added 149774 to F15-F17 now. Rawhide already has it.

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-17:

#40

I've started a scratch build here:

http://koji.fedoraproject.org/koji/taskinfo?taskID=3999546

testing would be appreciated.

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-17:

#10

Ian: Thanks for testing the kernel. Apparently, these messages are harmless and the mainline kernel has already a ratelimiting patch for this. I have backported the ratelimiting commit (96dcadc2fdd111dca90d559f189a30c65394451a) on top of the kernel you tested and uploaded it here:

http://people.canonical.com/~henrix/lp974664v2/

Please let me know if this new kernel fixes it.

Revision history for this message

Ian Morris (ipm) wrote on 2012-04-17:

#11

Packet capture Edit (211.3 KiB, image/png)

Re: #10, Luis, thanks for your efforts on this. By running http://people.canonical.com/~henrix/lp974664v2, the Oops is indeed resolved and the rate limiting is definitely reducing the numbers of messages produced but none the less there are many many dozens of messages produced still -- just like with the mainline kernel infact. I assume the mainline kernel also had the rate limiting patch applied.

I'm not entirely sure about the "harmless" nature of the messages however. My perception is that logging in is somewhat slower -- this may be down to the beta nature of the release at present however what I am observing, even after logon of the user has completed is a high level of network activity. Running a packet trace and a wireshark dump of this activity, it seems that the activity is down to NFS and it seems (see screenshot attached) to be stuck in somewhat of a loop -- specifically continually getting an error when trying to lock the file (each time the same file). I wonder if the slowness has the same root cause (the shere number of failed retries -- looking at the packet trace it seems, if I'm not mistaken, there's about 1000 per second of these!)

I observe the file being processed is user.keystore and in my case, the prior oops was triggered by gnome-keyring-d so I can't help feeling these issues are related. However I will admit I'm no expert on NFS so I could be seeing ghosts :-)

Let me know if there is any information I can provide that might help.

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-17:

#12

Ian, thanks again for testing and helping triagging this bug.

When I said "harmless" (and I'm far from being an expert on NFS as well!) I was not aware that the flood of messages being logged would be kept that high. Obviously, logging such a huge amount of messages will at least impact the performance. My comment referred to the fact that the log was actually on an error path that would be retried again.

I'll ping upstreams for further information and post any development here.

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-17:

#41

I'm also seeing this (didn't report; tainted kernel, though I now have another box set up with an untainted kernel to play with)

I grabbed the SRPM and built myself a kernel-3.3.2-3.fc17.x86_64.rpm

Alas, I can't try my 100% reliable reproducer of "hit the New Message button in kmail" because this kernel gives me my NFS home directory owned by an unfeasibly large UID :-)

Under 3.3.0-8.fc17.x86_64 the files are owned by 'tim' but under kernel-3.3.2-3.fc17.x86_64 it's someone called 4294967294 with GID 4294967294. Which would be a 32-bit -2 which since uid_t is unsigned 32-bit, could be a signed return code cast where it oughtn't.

I know about getting /etc/idmapd.conf correct. I can flip back and forth between the states by rebooting and selecting the different kernels with no other change.

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-17:

#42

(In reply to comment #11)
> I'm also seeing this (didn't report; tainted kernel, though I now have another
> box set up with an untainted kernel to play with)
>
> I grabbed the SRPM and built myself a kernel-3.3.2-3.fc17.x86_64.rpm
>
> Alas, I can't try my 100% reliable reproducer of "hit the New Message button in
> kmail" because this kernel gives me my NFS home directory owned by an
> unfeasibly large UID :-)

F17 has additional NFS patches that are needed to work with the F17 userspace utilities. You would need to have an F17 SRPM for it to work properly.

Revision history for this message

In Red Hat Bugzilla #811138, Bert (bert-redhat-bugs) wrote on 2012-04-18:

#43

To Josh Boyer in reply to comment #10:

I tried your 3.3.2-3.fc16.x86_64 build from Koji. It behaves like the 3.4.0
kernel from F18: the program (pidgin in this case) hangs, and dmesg spits, by the thousands:

[ 96.542908] nfs4_reclaim_open_state: Lock reclaim failed!
[ 96.543687] nfs4_reclaim_open_state: Lock reclaim failed!
[ 96.544439] nfs4_reclaim_open_state: Lock reclaim failed!
[ 96.545258] nfs4_reclaim_open_state: Lock reclaim failed!
[ 96.546066] nfs4_reclaim_open_state: Lock reclaim failed!

So no solution yet ...

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-18:

#44

(In reply to comment #12)
> (In reply to comment #11)
> > I'm also seeing this (didn't report; tainted kernel, though I now have another
> > box set up with an untainted kernel to play with)
> >
> > I grabbed the SRPM and built myself a kernel-3.3.2-3.fc17.x86_64.rpm
> >
> > Alas, I can't try my 100% reliable reproducer of "hit the New Message button in
> > kmail" because this kernel gives me my NFS home directory owned by an
> > unfeasibly large UID :-)
>
> F17 has additional NFS patches that are needed to work with the F17 userspace
> utilities. You would need to have an F17 SRPM for it to work properly.

Any instructions for finding/making one? I furtled through koji.fedoraproject.org and couldn't find one. Would it be useful to try a 3.4.0 as comment #13 has? (I'm guessing here that f18 has the NFS patches of which you speak) Or should I drop that spare machine back to f16 to try this out?

Revision history for this message

Dan Bishop (danbishop) wrote on 2012-04-18:

#13

"Re: #10, Luis, thanks for your efforts on this. By running http://people.canonical.com/~henrix/lp974664v2, the Oops is indeed resolved and the rate limiting is definitely reducing the numbers of messages produced but none the less there are many many dozens of messages produced still -- just like with the mainline kernel infact. I assume the mainline kernel also had the rate limiting patch applied."

I can confirm that here too, using Luis' kernel on the client and the latest, untouched, untainted 3.2.0-23-generic on the server.

Revision history for this message

Dan Bishop (danbishop) wrote on 2012-04-18:

#14

Can this not be marked as confirmed now?

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-18:

#15

Ian, there's something I forgot to ask: could you take a look at the server logs (running Lucid, I believe) and check whether there's some extra information there? Please attach any relevant information (in particular the kernel logs).

Changed in linux (Ubuntu):
status:	Incomplete → Confirmed

Revision history for this message

Thomas Spitzlei (t-spitzlei) wrote on 2012-04-18:

#16

Trace - nfs_have_delegation Edit (3.2 KiB, text/plain)

Same Problem here:
Server 10.04 LTS Stock-Kernel

Clients: 11.04 - Custom Kernel, nfs-home

Kernel 3.1, 3.2 from kernel.org:
BUG: unable to handle kernel paging request at ffffffffffffffb8
IP: [<ffffffffa0e83bfb>] nfs_have_delegation+0xb/0x40 [nfs]

Kernel 3.4 from kernel.org:
NFS: nfs4_reclaim_open_state: Lock reclaim failed!

Appears when trying to run for example tomboy, gedit, pidgin. No 3.x (3.1, 3.2, 3.4) - Kernel allows to start the apps with a nfs-home.

No entries in Serverlogs.

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-18:

#45

*** Bug 813732 has been marked as a duplicate of this bug. ***

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-18:

#46

(In reply to comment #14)
> > F17 has additional NFS patches that are needed to work with the F17 userspace
> > utilities. You would need to have an F17 SRPM for it to work properly.
>
> Any instructions for finding/making one? I furtled through
> koji.fedoraproject.org and couldn't find one. Would it be useful to try a 3.4.0
> as comment #13 has? (I'm guessing here that f18 has the NFS patches of which
> you speak) Or should I drop that spare machine back to f16 to try this out?

I started an F17 scratch build here:

http://koji.fedoraproject.org/koji/taskinfo?taskID=4001445

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-04-18:

#47

(In reply to comment #13)
> To Josh Boyer in reply to comment #10:
>
> I tried your 3.3.2-3.fc16.x86_64 build from Koji. It behaves like the 3.4.0
> kernel from F18: the program (pidgin in this case) hangs, and dmesg spits, by
> the thousands:
>
> [ 96.542908] nfs4_reclaim_open_state: Lock reclaim failed!
> [ 96.543687] nfs4_reclaim_open_state: Lock reclaim failed!
> [ 96.544439] nfs4_reclaim_open_state: Lock reclaim failed!
> [ 96.545258] nfs4_reclaim_open_state: Lock reclaim failed!
> [ 96.546066] nfs4_reclaim_open_state: Lock reclaim failed!
>
> So no solution yet ...

Well, it's not oopsing at least.

Jeff?

Revision history for this message

Ian Morris (ipm) wrote on 2012-04-18:

#17

Re: #15, Correct -- server running lucid with stock kernel (from updates). There's no messages in the server logs that I can see (same as #16)

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-18:

#48

OK. Same "nfs4_reclaim_open_state: Lock reclaim failed!" here.

Even more trivial way to reproduce, BTW:

bash$ /usr/bin/flock /file/on/nfs echo Fish

Revision history for this message

In Red Hat Bugzilla #811138, Roland (roland-redhat-bugs) wrote on 2012-04-18:

#49

With kernel 3.3.2-3.fc16.x86_64 and saving a file on NFS mounted partition using gedit I have a similar result as described in comment #13: no longer a kernel oops and dmesg showing many lines of "Lock reclaim failed!":

[ 302.568318] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.569517] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.570724] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.571894] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.573091] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.574305] nfs4_reclaim_open_state: Lock reclaim failed!
[ 302.575437] nfs4_reclaim_open_state: Lock reclaim failed!

I cannot reproduce the "Lock reclaim failed!" error with the flock command in comment #18 though. Next I tried running

/usr/bin/flock ~/a sleep 10

in one terminal and

/usr/bin/flock ~/a echo foo

in another terminal and this kind of locking seems to work fine since the second flock waits with executing the echo command until the sleep command has exited.

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-18:

#18

I have ping'ed upstreams about the issue and two new patches (not yet upstream) have been provided. I have uploaded a new test kernel here:

http://people.canonical.com/~henrix/lp974664v3/

It contains the two mentioned patches and the other two you have already tested.

Please let me know if this new kernel finally solves the issue.

Revision history for this message

Ian Morris (ipm) wrote on 2012-04-18:

#19

Re: #18, Luis, I think you've cracked it! I have just installed that kernel and I am happy to report that I'm not getting an oops, Nor am I getting any messages on the client, the network traffic is back down to normal levels, I'm not getting any messages on the server side and last but not least, my subjective test of the time it takes to login time seems to be back to normal!

Revision history for this message

Thomas Spitzlei (t-spitzlei) wrote on 2012-04-19:

#20

Thats it, Luis! Bug is gone with your kernel.

Where can i get the patches to compile my own? When are the fixes @kernel.org?

Thanks!

Revision history for this message

Shawn Haggett (podge-9) wrote on 2012-04-19:

#21

I can confirm as well that using the kernel from #18 and performing the same steps as before fails to generate the bug and everything seems to work correctly.

Thanks as well!

Revision history for this message

Luis Henriques (henrix) wrote on 2012-04-19:

#22

That's great, thank you for testing. So, to summarise, here's the list of commits that were present in this last kernel you have tested:

- From Linux mainline:
- 14977489ffdb80d4caf5a184ba41b23b02fbacd9
- 96dcadc2fdd111dca90d559f189a30c65394451a

- From linux-nfs, branch bugfixes:
- 487790f27df9bb27d3400486bd021dd59edc7589
- 5de4815015e550bdd33f39650554325540356f0c

Changed in linux (Ubuntu):
status:	Confirmed → Triaged

Revision history for this message

Dan Bishop (danbishop) wrote on 2012-04-19:

#23

Same here, thank you so much Luis!

Is there any way this will make it into precise before launch now? It seems almost unthinkable that an LTS could be released without the ability to mount NFS home directories :s

Is there anything we can do to help this process?

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-19:

#50

Hmm. I have more interesting information....

On 3.3.0-8.fc17.x86_64, where it works, /usr/bin/flock does this (which is odd, I suppose, but still...):

open("/home/tim/tmp/spon", O_RDONLY|O_CREAT|O_NOCTTY, 0666) = 3
flock(3, LOCK_EX) = -1 EIO (Input/output error)
access("/home/tim/tmp/spon", R_OK|W_OK) = 0
close(3) = 0
open("/home/tim/tmp/spon", O_RDWR|O_CREAT|O_NOCTTY, 0666) = 3
flock(3, LOCK_EX) = 0

So it opens the file read-only, then tries to LOCK_EX and the NFS server (entirely reasonably) says "NFS4ERR_OPENMODE, you silly person". Then and only then does /usr/bin/flock get around to checking the permissions, opening it read/write, and doing another LOCK_EX, which works.

On my 3.3.2-4.fc17.x86_64, however, /usr/bin/flock only gets as far as

open("/home/tim/tmp/spon", O_RDONLY|O_CREAT|O_NOCTTY, 0666) = 3
flock(3, LOCK_EX ....never returns

while a sniffer trace shows a continuous stream of OPEN(readonly)/LOCK(write) operations which get a continous stream of replies of the form "yeah, all right" and "what are you smoking?" respectively.

So I would venture to suggest that the problem lurks around the -NFS4ERR_OPENMODE case in nfs4_handle_exception() in fs/nfs/nfs4proc.c. I've got printk()s which clearly show it's going through that path, calling nfs4_schedule_stateid_recovery() and dropping down to wait_on_recovery:

Someone more familiar with the code will probably beat me to working out what the precise circumstances are in which it should be retrying and not returning the error, but I shall have a go at working that out anyway, 'cos I have a warped sense of fun...

Revision history for this message

In Red Hat Bugzilla #811138, Jeff (jeff-redhat-bugs) wrote on 2012-04-19:

#51

There are a couple of patches that Trond has proposed upstream and for stable for the other (non-oops) problem. Would any of the folks suffering from it be able to test this and see if it fixes the issue?

http://thread.gmane.org/gmane.linux.nfs/48690/focus=48705

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-19:

#52

(In reply to comment #21)
> There are a couple of patches that Trond has proposed upstream and for stable
> for the other (non-oops) problem. Would any of the folks suffering from it be
> able to test this and see if it fixes the issue?
>
> http://thread.gmane.org/gmane.linux.nfs/48690/focus=48705

Applied and tried.
This looks like correct behaviour:

[tim@groke ~]$ /usr/bin/flock ~/tmp/spon echo Fish
flock: /home/tim/tmp/spon: Bad file descriptor

(flock() should never have worked, really :-)

I will try this out with my original kmail problem when I have access to that desktop again in a few hours.

Revision history for this message

In Red Hat Bugzilla #811138, Jeff (jeff-redhat-bugs) wrote on 2012-04-19:

#53

By default, nfs.ko does flock() emulation on top of POSIX locks.

If you find that programs that use flock don't work properly on top of NFS, you can mount with local_lock=flock and that will disable that behavior. Of course, no other client will be aware of flock locks at that point...

Luis Henriques (henrix) on 2012-04-19

description:

updated

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-19:

#54

(In reply to comment #23)
> By default, nfs.ko does flock() emulation on top of POSIX locks.
>
> If you find that programs that use flock don't work properly on top of NFS, you
> can mount with local_lock=flock and that will disable that behavior. Of course,
> no other client will be aware of flock locks at that point...

In which case this may still be a problem. I mount with local_lock=none

The only way to get correct flock() emulation would be to silently reopen the FD as writable on the NFS server if LOCK_EX is requested (but presumably keep letting the VFS believe it's read-only). flock() will cheerfully let you exclusive-lock a read-only FD. Ick :-)

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-19:

#55

Aha! flock in f17's util-linux-2.21.1 has a test for EIO, and retries read-write. That might need an extra

case EBADF:

The check is not present in f16's util-linux-2.20.1

Was wondering why f16 behaved differently.

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-19:

#56

(In reply to comment #22)
> I will try this out with my original kmail problem when I have access to that
> desktop again in a few hours.

OK, the original kmail problem is fixed too. This appears simply to ignore the error it gets back from flock() anyway, once it actually returns :-)

Interestingly, it too attempts an exclusive lock on a read-only open of a file with read-write permission. Depending on how common this behaviour is, it might be less than useless to silently upgrade the NFS open to read-write on a flock(fd,LOCK_EX) IF (a) the file is actually writable AND (b) it is somehow possible to know that it *was* flock() requesting this operation and not fcntl().

...wanders off mumbling...

Joseph Salisbury (jsalisbury) on 2012-04-24

tags:

removed: kernel-key

Revision history for this message

Martin Pitt (pitti) wrote on 2012-04-25: Please test proposed package

#24

Hello rvaliant, or anyone else affected,

Accepted linux into precise-proposed. The package will build now and be available in a few hours. Please test and give feedback here. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you in advance!

Changed in linux (Ubuntu Precise):
status:	Triaged → Fix Committed

Revision history for this message

In Red Hat Bugzilla #811138, Bert (bert-redhat-bugs) wrote on 2012-04-26:

#57

3.3.2-6.fc16.x86_64 is all now. These

nfs4_reclaim_open_state: Lock reclaim failed!

messages are filling /var/log/messages at about 1MB/sec. So with
an average / filesystem of 8 or 16GB, you can estimate the remaining
uptime after it hits you ...

So can I have this kernel OOPS back please?

Revision history for this message

Launchpad Janitor (janitor) wrote on 2012-04-26:

#25

This bug was fixed in the package linux - 3.2.0-24.37

---------------
linux (3.2.0-24.37) precise-proposed; urgency=low

[ Herton Ronaldo Krzesinski ]

  * d-i: Add hid-logitech-dj to input-modules
    - LP: #975198
  * d-i: Add rtl8187 driver to nic-usb-modules
    - LP: #971719

[ Ian Abbott ]

  * SAUCE: staging: comedi: Add module parameters for default buffer size
    - LP: #981234
  * SAUCE: staging: comedi: Add kernel config for default buffer sizes
    - LP: #981234

[ K. Y. Srinivasan ]

* SAUCE: hv_storvsc: Account for in-transit packets in the RESET path
- LP: #978394

[ Leann Ogasawara ]

* [Config] Set CONFIG_COMEDI_DEFAULT_BUF_[SIZE_KB,MAXSIZE_KB]
- LP: #981234

[ Luis Henriques ]

* SAUCE: ite-cir: postpone ISR registration
- LP: #984387

[ Manoj Iyer ]

  * SAUCE: Bluetooth: btusb: Add vendor specific ID (0489 e042) for
    BCM20702A0
    - LP: #980965

[ Tim Gardner ]

  * Extract firmware module info during getabi
  * [Config] Remove hiq-quanta module references
    - LP: #913164
  * [Config] powerpc-smp: build in ATI and RADEON frame buffer drivers
    - LP: #949288

[ Trond Myklebust ]

  * SAUCE: NFSv4: Ensure that the LOCK code sets exception->inode
    - LP: #974664
  * SAUCE: NFSv4: Ensure that we check lock exclusive/shared type against
    open modes
    - LP: #974664

[ Upstream Kernel Changes ]

  * Input: psmouse - allow drivers to use psmouse_{de,}activate
    - LP: #969334
  * Input: psmouse - use psmouse_[de]activate() from sentelic and hgpk
    drivers
    - LP: #969334
  * Input: sentelic - refactor code for upcoming new hardware support
    - LP: #969334
  * Input: sentelic - enabling absolute coordinates output for newer
    hardware
    - LP: #969334
  * Input: sentelic - minor code cleanup
    - LP: #969334
  * Input: sentelic - improve packet debugging information
    - LP: #969334
  * Input: sentelic - filter taps in absolute mode
    - LP: #969334
  * drm/i915: Fixes distorted external screen image on HP 2730p
    - LP: #796030
  * NFSv4: Minor cleanups for nfs4_handle_exception and
    nfs4_async_handle_error
    - LP: #974664
  * NFSv4: Rate limit the state manager for lock reclaim warning messages
    - LP: #974664
  * HID: multitouch: merge quanta driver into hid-multitouch
    - LP: #913164
  * HID: usbhid: add quirk no_get for quanta 3008 devices
    - LP: #913164
-- Leann Ogasawara <email address hidden> Tue, 24 Apr 2012 07:47:49 -0700

This bug was fixed in the package linux - 3.2.0-24.37

---------------
linux (3.2.0-24.37) precise-proposed; urgency=low

[ Herton Ronaldo Krzesinski ]

* d-i: Add hid-logitech-dj to input-modules
    - LP: #975198
  * d-i: Add rtl8187 driver to nic-usb-modules
    - LP: #971719

[ Ian Abbott ]

* SAUCE: staging: comedi: Add module parameters for default buffer size
    - LP: #981234
  * SAUCE: staging: comedi: Add kernel config for default buffer sizes
    - LP: #981234

[ K. Y. Srinivasan ]

* SAUCE: hv_storvsc: Account for in-transit packets in the RESET path
    - LP: #978394

[ Leann Ogasawara ]

* [Config] Set CONFIG_COMEDI_DEFAULT_BUF_[SIZE_KB,MAXSIZE_KB]
    - LP: #981234

[ Luis Henriques ]

* SAUCE: ite-cir: postpone ISR registration
    - LP: #984387

[ Manoj Iyer ]

* SAUCE: Bluetooth: btusb: Add vendor specific ID (0489 e042) for
    BCM20702A0
    - LP: #980965

[ Tim Gardner ]

* Extract firmware module info during getabi
  * [Config] Remove hiq-quanta module references
    - LP: #913164
  * [Config] powerpc-smp: build in ATI and RADEON frame buffer drivers
    - LP: #949288

[ Trond Myklebust ]

* SAUCE: NFSv4: Ensure that the LOCK code sets exception->inode
    - LP: #974664
  * SAUCE: NFSv4: Ensure that we check lock exclusive/shared type against
    open modes
    - LP: #974664

[ Upstream Kernel Changes ]

* Input: psmouse - allow drivers to use psmouse_{de,}activate
    - LP: #969334
  * Input: psmouse - use psmouse_[de]activate() from sentelic and hgpk
    drivers
    - LP: #969334
  * Input: sentelic - refactor code for upcoming new hardware support
    - LP: #969334
  * Input: sentelic - enabling absolute coordinates output for newer
    hardware
    - LP: #969334
  * Input: sentelic - minor code cleanup
    - LP: #969334
  * Input: sentelic - improve packet debugging information
    - LP: #969334
  * Input: sentelic - filter taps in absolute mode
    - LP: #969334
  * drm/i915: Fixes distorted external screen image on HP 2730p
    - LP: #796030
  * NFSv4: Minor cleanups for nfs4_handle_exception and
    nfs4_async_handle_error
    - LP: #974664
  * NFSv4: Rate limit the state manager for lock reclaim warning messages
    - LP: #974664
  * HID: multitouch: merge quanta driver into hid-multitouch
    - LP: #913164
  * HID: usbhid: add quirk no_get for quanta 3008 devices
    - LP: #913164
 -- Leann Ogasawara <leann.ogasawara@canonical.com>   Tue, 24 Apr 2012 07:47:49 -0700

Changed in linux (Ubuntu Precise):
status:	Fix Committed → Fix Released

Revision history for this message

In Red Hat Bugzilla #811138, Ben (ben-redhat-bugs) wrote on 2012-04-27:

#58

(In reply to comment #27)
>
> 3.3.2-6.fc16.x86_64 is all now. These
>
> nfs4_reclaim_open_state: Lock reclaim failed!
>
> messages are filling /var/log/messages at about 1MB/sec. So with
> an average / filesystem of 8 or 16GB, you can estimate the remaining
> uptime after it hits you ...
>
> So can I have this kernel OOPS back please?

It looks like this is avoided in mainline due to a previous commit:

commit 96dcadc2fdd111dca90d559f189a30c65394451a
Author: William Dauchy <email address hidden>
Date: Wed Mar 14 12:32:04 2012 +0100

NFSv4: Rate limit the state manager for lock reclaim warning messages

This should presumably go into stable as well, though it seems to me that that still leaves a bug to fix - either the warning is real and we should try to avoid the problem, or the warning is bogus and should be removed.

Revision history for this message

In Red Hat Bugzilla #811138, Tim (tim-redhat-bugs) wrote on 2012-04-27:

#59

(In reply to comment #28)
> (In reply to comment #27)
> >
> > 3.3.2-6.fc16.x86_64 is all now. These
> >
> > nfs4_reclaim_open_state: Lock reclaim failed!
> >
>
> It looks like this is avoided in mainline due to a previous commit:
>
> commit 96dcadc2fdd111dca90d559f189a30c65394451a
> Author: William Dauchy <email address hidden>
> Date: Wed Mar 14 12:32:04 2012 +0100
>
> NFSv4: Rate limit the state manager for lock reclaim warning messages

Either that or the patch mentioned in comment #21 isn't in yet. If you have stuff hanging on a flock() attempt, you will need that one or one very similar.

Launchpad Janitor (janitor) on 2012-04-27

Changed in linux (Ubuntu):
status:	Fix Committed → Fix Released

Revision history for this message

In Red Hat Bugzilla #811138, Rik (rik-redhat-bugs) wrote on 2012-05-02:

#60

Hi,

I've applied the following patches on top of the 3.3.4 kernel I got from Koji:

https://admin.fedoraproject.org/updates/kernel-3.3.4-1.fc16?_csrf_token=1827f10d3a7a13866a70eb79f6cef4fbc950611d

They seem to resolve this bug. I believe these patches have been queued by Greg KH for the 3.3 stable queue.

I added the following patches:

NFS: put open context on error in nfs_flush_multi
commit 8ccd271f7a3a846ce6f85ead0760d9d12994a611 upstream

nfs: Enclose hostname in brackets when needed in
commit 98a2139f4f4d7b5fcc3a54c7fddbe88612abed20 upstream.

NFSv4: Ensure that we check lock exclusive/shared type against open modes
commit 55725513b5ef9d462aa3e18527658a0362aaae83 upstream

NFSv4: Ensure that the LOCK code sets exception->inode
commit 05ffe24f5290dc095f98fbaf84afe51ef404ccc5 upstream

I believe only the last two are needed to fix this issue.

Please apply these patches to the next Fedora kernel update.

Rik

Revision history for this message

In Red Hat Bugzilla #811138, Josh (josh-redhat-bugs) wrote on 2012-05-02:

#61

(In reply to comment #30)
> Hi,
>
> I've applied the following patches on top of the 3.3.4 kernel I got from Koji:
>
> https://admin.fedoraproject.org/updates/kernel-3.3.4-1.fc16?_csrf_token=1827f10d3a7a13866a70eb79f6cef4fbc950611d
>
> They seem to resolve this bug. I believe these patches have been queued by Greg
> KH for the 3.3 stable queue.
>
> I added the following patches:
>
> NFS: put open context on error in nfs_flush_multi
> commit 8ccd271f7a3a846ce6f85ead0760d9d12994a611 upstream
>
> nfs: Enclose hostname in brackets when needed in
> commit 98a2139f4f4d7b5fcc3a54c7fddbe88612abed20 upstream.
>
> NFSv4: Ensure that we check lock exclusive/shared type against open modes
> commit 55725513b5ef9d462aa3e18527658a0362aaae83 upstream
>
> NFSv4: Ensure that the LOCK code sets exception->inode
> commit 05ffe24f5290dc095f98fbaf84afe51ef404ccc5 upstream
>
> I believe only the last two are needed to fix this issue.
>
> Please apply these patches to the next Fedora kernel update.

Justin put all of those into the F17 kernel last night. We'll pick them up in f16 relatively soon.

Revision history for this message

In Red Hat Bugzilla #811138, Bert (bert-redhat-bugs) wrote on 2012-05-04:

#62

Solved in kernel-3.3.4-3.fc16.

Tnx all!

Revision history for this message

Andrei Terechko (terechko) wrote on 2012-05-10:

#27

extract /var/log/syslog with the bug description Edit (4.1 KiB, text/plain)

I observe the same kernel bug on Ubuntu 11.10 (oneiric) with kernel 3.0.0-19-generic, see the attached syslog. Interestingly, it's not there in the vanilla install of Ubuntu 11.10 with kernel 3.0.0-12-generic (tested by booting an older kernel from grub). This issue manifests itself when I log in using an LDAP user with home on an NFS mount.

Any chance of fixing it on Ubuntu 11.10?

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2012-05-14:

#28

Tagging this bug as verified for Precise. Patch is already released with Precise, but showing up on ti-omap4 SRU report where it isn't a specific ti-omap4 change.

tags:

added: verification-done-precise

Revision history for this message

Adam Conrad (adconrad) wrote on 2012-11-14: Update Released

#29

The verification of this Stable Release Update has completed successfully and the package has now been released to -updates. Subsequently, the Ubuntu Stable Release Updates Team is being unsubscribed and will not receive messages about this bug report. In the event that you encounter a regression using the package from -updates please report a new bug using ubuntu-bug and tag the bug report regression-update so we can easily find any regresssions.

Bug Watch Updater (bug-watch-updater) on 2017-10-28

Changed in linux:
importance:	Unknown → High
status:	Unknown → Fix Released

penalvch (penalvch) on 2017-10-29

Changed in linux:
importance:	High → Undecided
status:	Fix Released → New
status:	New → Invalid

Ubuntu
linux package

Kernel Oops - BUG: unable to handle kernel paging request at ffffffffffffffb8; RIP: 0010:[<ffffffffa05a5839>] [<ffffffffa05a5839>] nfs_have_delegation+0x9/0x40 [nfs]

Bug Description

Related branches

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
Linux	Invalid	Undecided	Unassigned
linux (Ubuntu)	Fix Released	Medium	Unassigned
Precise	Fix Released	Medium	Unassigned

Ubuntulinux package

Kernel Oops - BUG: unable to handle kernel paging request at ffffffffffffffb8; RIP: 0010:[<ffffffffa05a5839>] [<ffffffffa05a5839>] nfs_have_delegation+0x9/0x40 [nfs]

Bug Description

Related branches

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package