kernel bug rpc nfs client
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Critical
|
Unassigned | ||
Hardy |
Fix Released
|
High
|
Stefan Bader |
Bug Description
SRU justification:
Impact: Having NFS mounted filesystems with autobinding in use there seems
to be a good chance that cloned clients might call rpcb_getport_
and thus trigger a Bug() statement in the code.
Fix: A cherry pick from upstream to walk to the parent client to find out
who "owns" the port. Fix was verified to be working.
Testcase: see below
---
After viewing a film on an nfs4 share for about 20 minutes:
Apr 5 22:19:46 lena kernel: [10870.179278] ------------[ cut here ]------------
Apr 5 22:19:46 lena kernel: [10870.179287] kernel BUG at /build/
Apr 5 22:19:46 lena kernel: [10870.179292] invalid opcode: 0000 [#1] SMP
Apr 5 22:19:46 lena kernel: [10870.179297] Modules linked in: des_generic cbc af_packet binfmt_misc rfcomm l2cap bluetooth ppdev autofs4 ipv6 acpi_cpufreq cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_ondemand freq_table cpufreq_
Apr 5 22:19:46 lena kernel: 394 ieee1394 ehci_hcd uhci_hcd usbcore tg3 thermal processor fan fbcon tileblit font bitblit softcursor fuse
Apr 5 22:19:46 lena kernel: [10870.179436]
Apr 5 22:19:46 lena kernel: [10870.179440] Pid: 3393, comm: mplayer Tainted: P (2.6.24-14-generic #1)
Apr 5 22:19:46 lena kernel: [10870.179445] EIP: 0060:[<f8e7f540>] EFLAGS: 00010206 CPU: 1
Apr 5 22:19:46 lena kernel: [10870.179483] EIP is at rpcb_getport_
Apr 5 22:19:46 lena kernel: [10870.179487] EAX: c2a97c00 EBX: c2a97c00 ECX: c2a97c80 EDX: f8e98b40
Apr 5 22:19:46 lena kernel: [10870.179491] ESI: eb35ef00 EDI: eb35e900 EBP: c2b47000 ESP: c298fcc8
Apr 5 22:19:46 lena kernel: [10870.179495] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Apr 5 22:19:46 lena kernel: [10870.179500] Process mplayer (pid: 3393, ti=c298e000 task=c29685a0 task.ti=c298e000)
Apr 5 22:19:46 lena kernel: [10870.179503] Stack: c299d448 c298fd00 00000000 c298fd00 00000000 c298fd08 c0318059 c2a97c00
Apr 5 22:19:46 lena kernel: [10870.179514] c2a97c80 00000001 f8e770a0 c2a97c80 c03180cb c2a97c00 c2b47000 c298fd80
Apr 5 22:19:46 lena kernel: [10870.179525] c2a97c80 f8e70195 f8e71cc8 c298fd14 c298fd14 00000000 00000000 c2a97c00
Apr 5 22:19:46 lena kernel: [10870.179535] Call Trace:
Apr 5 22:19:46 lena kernel: [10870.179565] [__wait_
Apr 5 22:19:46 lena kernel: [10870.179587] [<f8e770a0>] rpc_wait_
Apr 5 22:19:46 lena kernel: [10870.179633] [nfs:out_
Apr 5 22:19:46 lena kernel: [10870.179656] [<f8e70195>] call_bind+0x45/0x80 [sunrpc]
Apr 5 22:19:46 lena kernel: [10870.179688] [<f8e71cc8>] call_timeout+
Apr 5 22:19:46 lena kernel: [10870.179739] [<f8e77542>] __rpc_execute+
Apr 5 22:19:46 lena kernel: [10870.179787] [<f8e76bf4>] rpc_set_
Apr 5 22:19:46 lena kernel: [10870.179835] [<f8e70f38>] rpc_do_
Apr 5 22:19:46 lena kernel: [10870.179888] [<f8e7100d>] rpc_call_
Apr 5 22:19:46 lena kernel: [10870.179935] [<f8f03c65>] _nfs4_proc_
Apr 5 22:19:46 lena kernel: [10870.179994] [<f8f0515b>] nfs4_proc_
Apr 5 22:19:46 lena kernel: [10870.180049] [<f8ef47c2>] __nfs_revalidat
Apr 5 22:19:46 lena kernel: [10870.180111] [enqueue_
Apr 5 22:19:46 lena kernel: [10870.180126] [enqueue_
Apr 5 22:19:46 lena kernel: [10870.180135] [enqueue_
Apr 5 22:19:46 lena kernel: [10870.180148] [nfs:__
Apr 5 22:19:46 lena kernel: [10870.180156] [find_busiest_
Apr 5 22:19:46 lena kernel: [10870.180165] [balance_
Apr 5 22:19:46 lena kernel: [10870.180199] [<f8ef5653>] nfs_revalidate_
Apr 5 22:19:46 lena kernel: [10870.180247] [<f8ef31b5>] nfs_file_
Apr 5 22:19:46 lena kernel: [10870.180303] [ext3:do_
Apr 5 22:19:46 lena kernel: [10870.180313] [nfs:schedule+
Apr 5 22:19:46 lena kernel: [10870.180341] [<c0141b70>] autoremove_
Apr 5 22:19:46 lena kernel: [10870.180355] [do_nanosleep+
Apr 5 22:19:46 lena kernel: [10870.180367] [hrtimer_
Apr 5 22:19:46 lena kernel: [10870.180384] [hrtimer_
Apr 5 22:19:46 lena kernel: [10870.180397] [<c018df30>] do_sync_
Apr 5 22:19:46 lena kernel: [10870.180405] [vfs_read+
Apr 5 22:19:46 lena kernel: [10870.180419] [sys_read+
Apr 5 22:19:46 lena kernel: [10870.180433] [sysenter_
Apr 5 22:19:46 lena kernel: [10870.180463] =======
Apr 5 22:19:46 lena kernel: [10870.180465] Code: ac 00 00 00 c7 44 24 08 04 4a e8 f8 c7 04 24 30 81 e8 f8 89 44 24 04 e8 2f e9 2a c7 c7 44 24 20 a0 ff ff ff e9 0c fe ff ff 66 90 <0f> 0b eb fe 0f b7 b0 ac 00 00 00 8b 57 24 8b 4f 20 8b 5f 2c 8b
Apr 5 22:19:46 lena kernel: [10870.180521] EIP: [<f8e7f540>] rpcb_getport_
Apr 5 22:19:46 lena kernel: [10870.180571] ---[ end trace f4581af82034a623 ]---
and the video stream stops. This bug is more-or-less reliably reproducable on my computer.
Changed in linux: | |
assignee: | nobody → stefan-bader-canonical |
importance: | Undecided → High |
status: | New → In Progress |
Changed in linux: | |
status: | In Progress → Fix Committed |
description: | updated |
tags: |
added: verification-done removed: verification-needed |
The error didn't occurre on 2.6.24-12-generic, but did occurred on 2.6.24-14-generic. So I think this is a regression introduced by -13 or -14.