nfs4 client hangs on LUCID
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nfs4-acl-tools (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
We do observe client hangs with NFS4 on the following occasions:
- a network outage
- a server outage
- an expiration of a kerberos ticket.
The Server is an SL 5.5 machine with latest kernel
We often see the following errors in /var/log/messages on the client:
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181453] chromium-brow D 0000000000000000 0 31737 31716 0x00000000
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181457] ffff88015b5bb8e8 0000000000000086 0000000000015bc0 0000000000015bc0
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181460] ffff880329cb5f38 ffff88015b5bbfd8 0000000000015bc0 ffff880329cb5b80
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181463] 0000000000015bc0 ffff88015b5bbfd8 0000000000015bc0 ffff880329cb5f38
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181466] Call Trace:
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181485] [<ffffffffa0d01
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181491] [<ffffffff81541
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181499] [<ffffffffa0d01
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181502] [<ffffffff81542
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181510] [<ffffffffa0d01
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181513] [<ffffffff81542
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181517] [<ffffffff81084
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181524] [<ffffffffa0d01
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181534] [<ffffffffa0d07
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181543] [<ffffffffa0d07
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181551] [<ffffffffa0d07
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181556] [<ffffffff8113b
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181563] [<ffffffffa0cf6
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181567] [<ffffffff810f3
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181570] [<ffffffff810f4
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181573] [<ffffffff810f5
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181576] [<ffffffff810f5
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181582] [<ffffffffa0cf6
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181585] [<ffffffff81142
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181589] [<ffffffff8117c
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181592] [<ffffffff81084
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181595] [<ffffffff8117c
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181600] [<ffffffff81252
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181602] [<ffffffff81143
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181605] [<ffffffff81143
Dec 2 14:36:19 th-ws-i706 kernel: [707908.181609] [<ffffffff81012
After that error message the processes are unkillable, the client sends a packet storm to the server and apport hangs too, so I had to report this manually.
If you need any further information, please ask. The problem is really urgent for us.
affects: | ubuntu → nfs4-acl-tools (Ubuntu) |
Hi, why do you think that this affects nfs4-acl-tools? I think it is a problem deep inside kernel