nfs-ganesha server crashes regularly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nfs-ganesha (Ubuntu) |
New
|
Undecided
|
Unassigned | ||
Focal |
New
|
Undecided
|
Unassigned | ||
ntirpc (Ubuntu) |
New
|
Undecided
|
Unassigned | ||
Focal |
New
|
Undecided
|
Unassigned |
Bug Description
nfs-ganesha server crashes regularly.
It doesn't happen all the time or easily reproducible. But when it does crash, the backtrace looks like:
(gdb) bt
#0 atomic_
#1 svc_rqst_
#2 0x00007f620fa76565 in svc_rqst_
#3 svc_rqst_epoll_loop (wpe=0x564dc852
#4 0x00007f620fa816d6 in work_pool_thread (arg=0x7f606400
#5 0x00007f621029a6db in start_thread (arg=0x7f6097cf
#6 0x00007f620fdbb61f in clone () at ../sysdeps/
This was using 3.0.3 on Bionic (available via Ubuntu Cloud Archive packages).
Upstream nfs-ganesha developers suggested that a "number of fixes" related to libntirpc fixed what looks like a race condition.
libntirpc is a submodule used in nfs-ganesha and it's where the problem comes form:
https:/
There were a number of commits that went in since 3.0 [0]. Given the crash isn't reproducible easily, it's not straightforward to find the commits that fixed the issue between 3.0.3 and 3.5 for a potential SRU.
In a user environment where the problem occurred, they were able to test nfs-ganesha 3.5 and confirmed that it didn't crash over several days load test whereas 3.0.3 crashed at least once a day under a similar load/test environment.
[0] https:/
[1] https:/
tags: | added: sts |
description: | updated |
In Ubuntu, nfs-ganesha pulls ntirpc from the libntirpc-dev package, rather than using the version vendored into nfs-ganesha upstream