Brick SEGFAULTs in 11.1

Bug #2064843 reported by Nick O'Connor
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
glusterfs (Ubuntu)
Status tracked in Oracular
Noble
In Progress
Undecided
Athos Ribeiro
Oracular
Triaged
Undecided
Athos Ribeiro

Bug Description

[ Impact ]

 * Users experience brick SEGFAULTs under certain not-yet-understood scenarios. Some reports include a high percentage of small file I/O. I encountered the issue roughly every hour with Minio backed by GlusterFS on ZFS.

 * This bug introduces an increased risk of data loss or corruption depending on the user's configuration and timing of brick crashes.

 * Core dumps from multiple users revealed that the SEGFAULTs are caused by a stack overflow when namespaced inodes are destroyed.

 * The patch removes the recursive call to inode_unref when a namespaced inode is destroyed.

[ Test Plan ]

 * The triggers for the issue are not well understood. I have compiled the patch against version 11.1-4build3 of glusterfs and it resolved the issue on my system.

[ Where problems could occur ]

 * It's conceivable that this patch introduces undesired behavior when inodes are destroyed, however I highly doubt this scenario as __inode_destroy was not recursive before the change which introduced the bug.

[ Other Info ]

 * PR which introduced the bug: https://github.com/gluster/glusterfs/pull/1763
 * PR which added this patch: https://github.com/gluster/glusterfs/pull/4302
 * Issue discussion: https://github.com/gluster/glusterfs/issues/4295

Tags: server-todo
description: updated
description: updated
description: updated
Revision history for this message
Nick O'Connor (nick-oconnor) wrote (last edit ):

I've recompiled glusterfs locally with the changes. I can confirm the fix linked above addresses the issue.

description: updated
Revision history for this message
Athos Ribeiro (athos-ribeiro) wrote :

Thanks, Nick.

I am adding this to the server team backlog so someone can start working on this one soon.

If you are willing to drive this one, please let us know so we can aid you through the SRU process and sponsor uploads on your behalf in case it is needed.

Otherwise, someone in the server team will start checking/driving this one soon.

Changed in glusterfs (Ubuntu Oracular):
status: New → Triaged
Changed in glusterfs (Ubuntu Noble):
status: New → Triaged
tags: added: server-todo
Revision history for this message
Nick O'Connor (nick-oconnor) wrote :

SGTM. I can drive this. Let me know what needs to be done.

description: updated
description: updated
description: updated
description: updated
Changed in glusterfs (Ubuntu Noble):
assignee: nobody → Nick O'Connor (nick-oconnor)
status: Triaged → In Progress
summary: - Gluster 11.1 brick SEGFAULT
+ Brick SEGFAULTs in 11.1
Revision history for this message
Nick O'Connor (nick-oconnor) wrote :
Revision history for this message
Nick O'Connor (nick-oconnor) wrote (last edit ):

This is ready for review/sponsorship.

Changed in glusterfs (Ubuntu Noble):
assignee: Nick O'Connor (nick-oconnor) → nobody
Changed in glusterfs (Ubuntu Oracular):
assignee: nobody → Athos Ribeiro (athos-ribeiro)
Changed in glusterfs (Ubuntu Noble):
assignee: nobody → Athos Ribeiro (athos-ribeiro)
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.