NFS4 kills system (no reboot possible)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
I have migrated from a NFS3 infrastructure to NFS4 (without kerberos) (to workaround #525154)
Setup:
* ubuntu/lucid amd64
autofs5/lucid uptodate 5.0.4-3.1ubuntu5
linux-
nfs-
* create /srv/nfs4 and export it via NFS4
* have bind mounts from /srv/nfs4 to the traditional mount points of the exported shares
With NFS4 I can't use bind mount with autofs (out of the box).
So I have to access "shared" drives locally also with NFS4.
If I copy a big file (e.g. a CD image) to a share mounted via NFS4 locally after short time the system is blocked.
* LoadAvg grows to infinity
* After some time I see messages about blocked task correlated to nfs or accessing nfs shares on the local sever and all clients accessing this server
* shutdown/reboot will also blocked and not come to an end
To reboot the system I have to issue a hard reboot on the server console
The problem doesn't occur if I:
* copy only smaller files
* copy files from client to the server (e.g. a 160GB hdd image was processed without error)
The problem occurs on both nfs servers.
Hardware:
2 similar boxes with:
* TYAN Thunder K8WE S2895
* 2 Opteron K8 CPUs
* SCSI and SATA hdds
[As a workaround I will create diverted autofs configuration with explicit local binding mounts.]
---
Architecture: amd64
DistroRelease: Ubuntu 10.04
Package: linux (not installed)
ProcEnviron:
LANGUAGE=
PATH=(custom, no user)
LANG=de_DE.UTF-8
SHELL=/bin/bash
LC_PAPER=
ProcVersionSign
Regression: Yes
Reproducible: Yes
Tags: lucid regression-release needs-upstream-
Uname: Linux 2.6.32-24-server x86_64
UserGroups:
affects: | nfs-utils (Ubuntu) → linux (Ubuntu) |
tags: | added: apport-collected |
description: | updated |
Changed in linux (Ubuntu): | |
status: | Incomplete → New |
Hi H.-Dirk,
Please be sure to confirm this issue exists with the latest development release of Ubuntu. ISO CD images are available from http:// cdimage. ubuntu. com/releases/ . If the issue remains, please run the following command from a Terminal (Applications- >Accessories- >Terminal) . It will automatically gather and attach updated debug information to this report.
apport-collect -p linux 578866
Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https:/ /wiki.ubuntu. com/KernelMainl ineBuilds . Once you've tested the upstream kernel, please remove the 'needs- upstream- testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs- upstream- testing' text. Please let us know your results.
Thanks in advance.
[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]