ZFS Pool hangs for more than 120 seconds under high I/O load
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
zfs-linux (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
I see the issue happening when:
1.- deleting a FS --> sudo zfs destroy -r data/VMs
2.- then renaming a new FS as the (old) deleted --> sudo zfs rename data/VMsNew data/VMs
3.- removing data in other FS in the same zpool --> rm -rf * data/BackupsNew/*
4.- Also I've seen this messages when running a rsync operation between two FS in the same pool.
[Dec 5 13:42] INFO: task txg_sync:5468 blocked for more than 120 seconds.
[ +0,000016] Tainted: P O 5.15.0-56-generic #62-Ubuntu
[ +0,000006] "echo 0 > /proc/sys/
[ +0,000004] task:txg_sync state:D stack: 0 pid: 5468 ppid: 2 flags:0x00004000
[ +0,000013] Call Trace:
[ +0,000005] <TASK>
[ +0,000007] __schedule+
[ +0,000016] schedule+0x69/0x110
[ +0,000009] schedule_
[ +0,000011] ? __bpf_trace_
[ +0,000011] io_schedule_
[ +0,000011] __cv_timedwait_
[ +0,000031] ? wait_woken+
[ +0,000011] __cv_timedwait_
[ +0,000027] zio_wait+
[ +0,000477] ? spa_sync_
[ +0,000404] spa_sync_
[ +0,000396] spa_sync_
[ +0,000399] spa_sync+
[ +0,000393] txg_sync_
[ +0,000418] ? txg_dispatch_
[ +0,000416] thread_
[ +0,000035] ? __thread_
[ +0,000034] kthread+0x127/0x150
[ +0,000011] ? set_kthread_
[ +0,000012] ret_from_
[ +0,000017] </TASK>
[Dec 5 13:44] INFO: task rsync:165214 blocked for more than 120 seconds.
[ +0,000015] Tainted: P O 5.15.0-56-generic #62-Ubuntu
[ +0,000006] "echo 0 > /proc/sys/
[ +0,000004] task:rsync state:D stack: 0 pid:165214 ppid: 1 flags:0x00000006
[ +0,000013] Call Trace:
[ +0,000004] <TASK>
[ +0,000007] __schedule+
[ +0,000014] ? kmem_cache_
[ +0,000015] schedule+0x69/0x110
[ +0,000012] cv_wait_
[ +0,000029] ? wait_woken+
[ +0,000012] __cv_wait+0x15/0x20 [spl]
[ +0,000027] dmu_tx_
[ +0,000347] dmu_tx_
[ +0,000322] zfs_write+
[ +0,000465] ? unix_stream_
[ +0,000010] ? generic_
[ +0,000011] zpl_iter_
[ +0,000443] new_sync_
[ +0,000011] vfs_write+
[ +0,000008] ksys_write+
[ +0,000008] __x64_sys_
[ +0,000008] do_syscall_
[ +0,000009] ? syscall_
[ +0,000010] ? __x64_sys_
[ +0,000007] ? do_syscall_
[ +0,000007] ? do_syscall_
[ +0,000006] ? do_syscall_
[ +0,000008] entry_SYSCALL_
[ +0,000012] RIP: 0033:0x7f6b39c5fa37
[ +0,000008] RSP: 002b:00007fff8f
[ +0,000009] RAX: ffffffffffffffda RBX: 00007f6b390f2010 RCX: 00007f6b39c5fa37
[ +0,000006] RDX: 0000000000040000 RSI: 00007f6b390f2010 RDI: 0000000000000001
[ +0,000005] RBP: 0000000000000001 R08: 00007f6b3912a010 R09: 0000000000000010
[ +0,000005] R10: 000000000000000f R11: 0000000000000246 R12: 0000000000040000
[ +0,000005] R13: 0000000000008000 R14: 0000000000008000 R15: 0000557bb34da730
[ +0,000010] </TASK>
[Dec 6 01:45] INFO: task txg_sync:5468 blocked for more than 120 seconds.
[ +0,000016] Tainted: P O 5.15.0-56-generic #62-Ubuntu
[ +0,000006] "echo 0 > /proc/sys/
[ +0,000004] task:txg_sync state:D stack: 0 pid: 5468 ppid: 2 flags:0x00004000
[ +0,000013] Call Trace:
[ +0,000004] <TASK>
[ +0,000007] __schedule+
[ +0,000016] schedule+0x69/0x110
[ +0,000009] schedule_
[ +0,000011] ? __bpf_trace_
[ +0,000011] io_schedule_
[ +0,000012] __cv_timedwait_
[ +0,000030] ? wait_woken+
[ +0,000012] __cv_timedwait_
[ +0,000027] zio_wait+
[ +0,000477] ? spa_sync_
[ +0,000405] spa_sync_
[ +0,000399] spa_sync_
[ +0,000397] spa_sync+
[ +0,000393] txg_sync_
[ +0,000417] ? txg_dispatch_
[ +0,000417] thread_
[ +0,000036] ? __thread_
[ +0,000032] kthread+0x127/0x150
[ +0,000012] ? set_kthread_
[ +0,000011] ret_from_
[ +0,000017] </TASK>
[Dec 6 01:47] INFO: task txg_sync:5468 blocked for more than 241 seconds.
[ +0,000015] Tainted: P O 5.15.0-56-generic #62-Ubuntu
[ +0,000006] "echo 0 > /proc/sys/
[ +0,000005] task:txg_sync state:D stack: 0 pid: 5468 ppid: 2 flags:0x00004000
[ +0,000012] Call Trace:
[ +0,000007] <TASK>
[ +0,000007] __schedule+
[ +0,000017] schedule+0x69/0x110
[ +0,000008] schedule_
[ +0,000012] ? __bpf_trace_
[ +0,000010] io_schedule_
[ +0,000012] __cv_timedwait_
[ +0,000031] ? wait_woken+
[ +0,000011] __cv_timedwait_
[ +0,000028] zio_wait+
[ +0,000508] ? spa_sync_
[ +0,000419] spa_sync_
[ +0,000398] spa_sync_
[ +0,000398] spa_sync+
[ +0,000391] txg_sync_
[ +0,000418] ? txg_dispatch_
[ +0,000417] thread_
[ +0,000036] ? __thread_
[ +0,000033] kthread+0x127/0x150
[ +0,000012] ? set_kthread_
[ +0,000011] ret_from_
[ +0,000017] </TASK>
OS Info: RELEASE= 22.04 CODENAME= jammy DESCRIPTION= "Ubuntu Jammy Jellyfish"
DISTRIB_ID=Ubuntu
DISTRIB_
DISTRIB_
DISTRIB_