Brick SEGFAULTs in 11.1
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
glusterfs (Ubuntu) | Status tracked in Oracular | |||||
Noble |
In Progress
|
Undecided
|
Athos Ribeiro | |||
Oracular |
Triaged
|
Undecided
|
Athos Ribeiro |
Bug Description
[ Impact ]
* Users experience brick SEGFAULTs under certain not-yet-understood scenarios. Some reports include a high percentage of small file I/O. I encountered the issue roughly every hour with Minio backed by GlusterFS on ZFS.
* This bug introduces an increased risk of data loss or corruption depending on the user's configuration and timing of brick crashes.
* Core dumps from multiple users revealed that the SEGFAULTs are caused by a stack overflow when namespaced inodes are destroyed.
* The patch removes the recursive call to inode_unref when a namespaced inode is destroyed.
[ Test Plan ]
* The triggers for the issue are not well understood. I have compiled the patch against version 11.1-4build3 of glusterfs and it resolved the issue on my system.
[ Where problems could occur ]
* It's conceivable that this patch introduces undesired behavior when inodes are destroyed, however I highly doubt this scenario as __inode_destroy was not recursive before the change which introduced the bug.
[ Other Info ]
* PR which introduced the bug: https:/
* PR which added this patch: https:/
* Issue discussion: https:/
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
Changed in glusterfs (Ubuntu Noble): | |
assignee: | nobody → Nick O'Connor (nick-oconnor) |
status: | Triaged → In Progress |
summary: |
- Gluster 11.1 brick SEGFAULT + Brick SEGFAULTs in 11.1 |
Changed in glusterfs (Ubuntu Noble): | |
assignee: | Nick O'Connor (nick-oconnor) → nobody |
Changed in glusterfs (Ubuntu Oracular): | |
assignee: | nobody → Athos Ribeiro (athos-ribeiro) |
Changed in glusterfs (Ubuntu Noble): | |
assignee: | nobody → Athos Ribeiro (athos-ribeiro) |
I've recompiled glusterfs locally with the changes. I can confirm the fix linked above addresses the issue.