NFS latency spikes during openat syscalls associated with test_stateid RPC command
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
About once a week the NFS mounts on our eda servers (the nfs clients) slow to a crawl for a large number of workloads. strace reveals that openat system calls for files in the mounts take between 0.1 and 0.3 seconds this can be seen directly by touching any file in the homedir (this normally takes 0.001-0.002 seconds). This causes obvious problems, an IPython instance in a conda environment installed in NFS takes around 2-3 minutes to start for example.
I originally thought that this was a manifestation of #2009325, but we are on 5.15.0-89 and have seen this problem across several kernel versions.
Additionally we don't see access rpc calls but we do see loads of test_stateid calls:
- bursts of 300 or so periodically every couple seconds while idle
- 380,452 test_stateid calls when an ipython instance is starting (zero access calls in the same period, when at idle we see 10,267 test_stateid calls over the same ~2:30 time period and again no access calls)
We're not using kerberos, just ip based authentication, our mount options are:
exoserver.
We see this on servers with both linux-image-
---
ProblemType: Bug
ApportVersion: 2.20.11-
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
CasperMD5CheckR
DistroRelease: Ubuntu 20.04
InstallationDate: Installed on 2022-04-05 (618 days ago)
InstallationMedia: Ubuntu 20.04.4 LTS "Focal Fossa" - Release amd64 (20220223)
IwConfig:
lo no wireless extensions.
enp69s0 no wireless extensions.
enp68s0 no wireless extensions.
MachineType: ASUS System Product Name
NonfreeKernelMo
Package: linux (not installed)
ProcEnviron:
TERM=screen
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/zsh
ProcFB: 0 EFI VGA
ProcKernelCmdLine: BOOT_IMAGE=
ProcVersionSign
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageV
linux-
linux-
linux-firmware 1.187.39
RfKill:
Tags: focal
Uname: Linux 5.15.0-89-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 03/04/2022
dmi.bios.release: 16.3
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1603
dmi.board.
dmi.board.name: ROG ZENITH II EXTREME
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.sku: SKU
dmi.product.
dmi.sys.vendor: ASUS
apport information