2016-11-28 03:38:13 |
john |
bug |
|
|
added bug |
2016-11-28 03:42:50 |
john |
attachment added |
|
hung_kernel_messages.txt https://bugs.launchpad.net/ubuntu/+source/linux-lts-xenial/+bug/1645187/+attachment/4783929/+files/hung_kernel_messages.txt |
|
2016-11-28 03:44:12 |
john |
description |
Server's load become high with tasks in D state, no choice but to reboot the system.
Could be related to the following ? :
https://bugzilla.kernel.org/show_bug.cgi?id=119841
https://www.redhat.com/archives/dm-devel/2016-June/msg00399.html
https://patchwork.kernel.org/patch/9223697/
https://xen.crc.id.au/bugs/view.php?id=75
ii xen-hypervisor-4.4-amd64 4.4.2-0ubuntu0.14.04.7 amd64 Xen Hypervisor on AMD64
ii linux-image-extra-4.4.0-47-generic 4.4.0-47.68~14.04.1 amd64 Linux kernel extra modules for version 4.4.0 on 64 bit x86 SMP
ii linux-image-4.4.0-47-generic 4.4.0-47.68~14.04.1 amd64 Linux kernel image for version 4.4.0 on 64 bit x86 SMP
kernel messages:
----------------
Nov 28 07:03:31 server1 kernel: [890070.994700] INFO: task blkback.3.xvda2:5756 blocked for more than 120 seconds.
Nov 28 07:03:31 server1 kernel: [890070.994758] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:03:31 server1 kernel: [890070.994806] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:03:31 server1 kernel: [890070.994884] blkback.3.xvda2 D ffff8800b7ff3928 0 5756 2 0x00000000
Nov 28 07:03:31 server1 kernel: [890070.994890] ffff8800b7ff3928 ffffffff81e13500 ffff8800b6a4be80 ffff8800b7ff4000
Nov 28 07:03:31 server1 kernel: [890070.994895] ffff88013a83cc18 ffff88013a83cc00 ffffffff00000000 fffffffe00000001
Nov 28 07:03:31 server1 kernel: [890070.994898] ffff8800b7ff3940 ffffffff817fafc5 ffff8800b6a4be80 ffff8800b7ff39c0
Nov 28 07:03:31 server1 kernel: [890070.994902] Call Trace:
Nov 28 07:03:31 server1 kernel: [890070.994912] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:03:31 server1 kernel: [890070.994917] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:03:31 server1 kernel: [890070.994923] [<ffffffff81689577>] ? push+0x47/0x50
Nov 28 07:03:31 server1 kernel: [890070.994927] [<ffffffff81689f07>] ? dm_kcopyd_copy+0x147/0x1f0
Nov 28 07:03:31 server1 kernel: [890070.994931] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:03:31 server1 kernel: [890070.994933] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:03:31 server1 kernel: [890070.994939] [<ffffffffc0314dae>] __origin_write+0x6e/0x210 [dm_snapshot]
Nov 28 07:03:31 server1 kernel: [890070.994944] [<ffffffff81185625>] ? mempool_alloc_slab+0x15/0x20
Nov 28 07:03:31 server1 kernel: [890070.994946] [<ffffffff8118574f>] ? mempool_alloc+0x5f/0x150
Nov 28 07:03:31 server1 kernel: [890070.994949] [<ffffffffc0314fb7>] do_origin.isra.14+0x67/0x90 [dm_snapshot]
Nov 28 07:03:31 server1 kernel: [890070.994952] [<ffffffffc0315042>] origin_map+0x62/0x80 [dm_snapshot]
Nov 28 07:03:31 server1 kernel: [890070.994955] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:03:31 server1 kernel: [890070.994957] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:03:31 server1 kernel: [890070.994960] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:03:31 server1 kernel: [890070.994964] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:03:31 server1 kernel: [890070.994968] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:03:31 server1 kernel: [890070.994971] [<ffffffff813a1c71>] ? bio_alloc_bioset+0x181/0x2a0
Nov 28 07:03:31 server1 kernel: [890070.994977] [<ffffffffc02c6a1d>] dispatch_rw_block_io+0x4fd/0x9b0 [xen_blkback]
Nov 28 07:03:31 server1 kernel: [890070.994981] [<ffffffff8101c244>] ? xen_load_sp0+0x84/0x180
Nov 28 07:03:31 server1 kernel: [890070.994985] [<ffffffffc02c70c5>] __do_block_io_op+0x1f5/0x650 [xen_blkback]
Nov 28 07:03:31 server1 kernel: [890070.994990] [<ffffffff810e5e18>] ? del_timer_sync+0x48/0x50
Nov 28 07:03:31 server1 kernel: [890070.994993] [<ffffffff817fd8ab>] ? schedule_timeout+0x16b/0x2d0
Nov 28 07:03:31 server1 kernel: [890070.994997] [<ffffffffc02c7880>] xen_blkif_schedule+0xd0/0x820 [xen_blkback]
Nov 28 07:03:31 server1 kernel: [890070.995002] [<ffffffff810a4e1a>] ? finish_task_switch+0x7a/0x290
Nov 28 07:03:31 server1 kernel: [890070.995004] [<ffffffff817fa969>] ? __schedule+0x359/0x980
Nov 28 07:03:31 server1 kernel: [890070.995010] [<ffffffff810bde70>] ? prepare_to_wait_event+0xf0/0xf0
Nov 28 07:03:31 server1 kernel: [890070.995014] [<ffffffffc02c77b0>] ? xen_blkif_be_int+0x30/0x30 [xen_blkback]
Nov 28 07:03:31 server1 kernel: [890070.995018] [<ffffffff8109ba29>] kthread+0xc9/0xe0
Nov 28 07:03:31 server1 kernel: [890070.995021] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:03:31 server1 kernel: [890070.995025] [<ffffffff817febcf>] ret_from_fork+0x3f/0x70
Nov 28 07:03:31 server1 kernel: [890070.995027] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:03:31 server1 kernel: [890070.995033] INFO: task kworker/u4:1:8922 blocked for more than 120 seconds.
Nov 28 07:03:31 server1 kernel: [890070.995106] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:03:31 server1 kernel: [890070.995148] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:03:31 server1 kernel: [890070.995200] kworker/u4:1 D ffff8800a7fbb628 0 8922 2 0x00000000
Nov 28 07:03:31 server1 kernel: [890070.995208] Workqueue: writeback wb_workfn (flush-252:12)
Nov 28 07:03:31 server1 kernel: [890070.995210] ffff8800a7fbb628 ffff880139ad3200 ffff8800b7dfcb00 ffff8800a7fbc000
Nov 28 07:03:31 server1 kernel: [890070.995212] ffff88013a83cc18 ffff88013a83cc00 ffffffff00000000 fffffffe00000001
Nov 28 07:03:31 server1 kernel: [890070.995215] ffff8800a7fbb640 ffffffff817fafc5 ffff8800b7dfcb00 ffff8800a7fbb6c0
Nov 28 07:03:31 server1 kernel: [890070.995217] Call Trace:
Nov 28 07:03:31 server1 kernel: [890070.995220] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:03:31 server1 kernel: [890070.995223] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:03:31 server1 kernel: [890070.995229] [<ffffffff811fad2a>] ? __slab_alloc+0x4d/0x5c
Nov 28 07:03:31 server1 kernel: [890070.995232] [<ffffffff811dc3eb>] ? kmem_cache_alloc+0x1bb/0x200
Nov 28 07:03:31 server1 kernel: [890070.995236] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:03:31 server1 kernel: [890070.995239] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:03:31 server1 kernel: [890070.995244] [<ffffffffc0315d12>] snapshot_map+0x62/0x390 [dm_snapshot]
Nov 28 07:03:31 server1 kernel: [890070.995272] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:03:31 server1 kernel: [890070.995275] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:03:31 server1 kernel: [890070.995278] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:03:31 server1 kernel: [890070.995282] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:03:31 server1 kernel: [890070.995284] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:03:31 server1 kernel: [890070.995286] [<ffffffff813a1c71>] ? bio_alloc_bioset+0x181/0x2a0
Nov 28 07:03:31 server1 kernel: [890070.995289] [<ffffffff812347ef>] submit_bh_wbc+0x12f/0x160
Nov 28 07:03:31 server1 kernel: [890070.995292] [<ffffffff81236615>] __block_write_full_page.constprop.39+0x125/0x360
Nov 28 07:03:31 server1 kernel: [890070.995293] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:03:31 server1 kernel: [890070.995295] [<ffffffff8123692e>] block_write_full_page+0xde/0x100
Nov 28 07:03:31 server1 kernel: [890070.995298] [<ffffffff81237378>] blkdev_writepage+0x18/0x20
Nov 28 07:03:31 server1 kernel: [890070.995300] [<ffffffff8118d253>] __writepage+0x13/0x40
Nov 28 07:03:31 server1 kernel: [890070.995302] [<ffffffff8118e5c1>] write_cache_pages+0x241/0x4c0
Nov 28 07:03:31 server1 kernel: [890070.995304] [<ffffffff8118d240>] ? wb_update_dirty_ratelimit+0x1c0/0x1c0
Nov 28 07:03:31 server1 kernel: [890070.995307] [<ffffffff8118e883>] generic_writepages+0x43/0x60
Nov 28 07:03:31 server1 kernel: [890070.995310] [<ffffffff8118d3ff>] ? __wb_calc_thresh+0x2f/0x120
Nov 28 07:03:31 server1 kernel: [890070.995313] [<ffffffff8118f46e>] do_writepages+0x1e/0x30
Nov 28 07:03:31 server1 kernel: [890070.995316] [<ffffffff8122b215>] __writeback_single_inode+0x45/0x340
Nov 28 07:03:31 server1 kernel: [890070.995319] [<ffffffff8122ba4b>] writeback_sb_inodes+0x26b/0x5c0
Nov 28 07:03:31 server1 kernel: [890070.995322] [<ffffffff8122be26>] __writeback_inodes_wb+0x86/0xc0
Nov 28 07:03:31 server1 kernel: [890070.995325] [<ffffffff8122c0b2>] wb_writeback+0x252/0x2e0
Nov 28 07:03:31 server1 kernel: [890070.995328] [<ffffffff8122c818>] wb_workfn+0x238/0x3d0
Nov 28 07:03:31 server1 kernel: [890070.995332] [<ffffffff810c4e11>] ? __raw_callee_save___pv_queued_spin_unlock+0x11/0x20
Nov 28 07:03:31 server1 kernel: [890070.995337] [<ffffffff81095b40>] process_one_work+0x150/0x3f0
Nov 28 07:03:31 server1 kernel: [890070.995341] [<ffffffff810962ba>] worker_thread+0x11a/0x470
Nov 28 07:03:31 server1 kernel: [890070.995345] [<ffffffff817fa969>] ? __schedule+0x359/0x980
Nov 28 07:03:31 server1 kernel: [890070.995347] [<ffffffff810961a0>] ? rescuer_thread+0x310/0x310
Nov 28 07:03:31 server1 kernel: [890070.995349] [<ffffffff8109ba29>] kthread+0xc9/0xe0
Nov 28 07:03:31 server1 kernel: [890070.995351] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:03:31 server1 kernel: [890070.995354] [<ffffffff817febcf>] ret_from_fork+0x3f/0x70
Nov 28 07:03:31 server1 kernel: [890070.995356] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:03:31 server1 kernel: [890070.995361] INFO: task fsck.ext4:11082 blocked for more than 120 seconds.
Nov 28 07:03:31 server1 kernel: [890070.995433] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:03:31 server1 kernel: [890070.995475] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:03:31 server1 kernel: [890070.995519] fsck.ext4 D ffff8800a7caf7d8 0 11082 11081 0x00000000
Nov 28 07:03:31 server1 kernel: [890070.995522] ffff8800a7caf7d8 ffff88013b000000 ffff8800b8463200 ffff8800a7cb0000
Nov 28 07:03:31 server1 kernel: [890070.995525] ffff88013a83cc18 ffff88013a83cc00 ffffffff00000000 fffffffe00000001
Nov 28 07:03:31 server1 kernel: [890070.995527] ffff8800a7caf7f0 ffffffff817fafc5 ffff8800b8463200 ffff8800a7caf870
Nov 28 07:03:31 server1 kernel: [890070.995529] Call Trace:
Nov 28 07:03:31 server1 kernel: [890070.995532] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:03:31 server1 kernel: [890070.995535] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:03:31 server1 kernel: [890070.995537] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:03:31 server1 kernel: [890070.995539] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:03:31 server1 kernel: [890070.995543] [<ffffffffc0315d12>] snapshot_map+0x62/0x390 [dm_snapshot]
Nov 28 07:03:31 server1 kernel: [890070.995545] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:03:31 server1 kernel: [890070.995547] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:03:31 server1 kernel: [890070.995550] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:03:31 server1 kernel: [890070.995553] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:03:31 server1 kernel: [890070.995555] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:03:31 server1 kernel: [890070.995558] [<ffffffff81191245>] ? release_pages+0xc5/0x260
Nov 28 07:03:31 server1 kernel: [890070.995560] [<ffffffff810c4e11>] ? __raw_callee_save___pv_queued_spin_unlock+0x11/0x20
Nov 28 07:03:31 server1 kernel: [890070.995563] [<ffffffff8123d3c9>] do_mpage_readpage+0x2d9/0x6d0
Nov 28 07:03:31 server1 kernel: [890070.995565] [<ffffffff81191ade>] ? lru_cache_add+0xe/0x10
Nov 28 07:03:31 server1 kernel: [890070.995567] [<ffffffff8123d8c3>] mpage_readpages+0x103/0x150
Nov 28 07:03:31 server1 kernel: [890070.995569] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:03:31 server1 kernel: [890070.995571] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:03:31 server1 kernel: [890070.995573] [<ffffffff8123733d>] blkdev_readpages+0x1d/0x20
Nov 28 07:03:31 server1 kernel: [890070.995575] [<ffffffff8118fb94>] __do_page_cache_readahead+0x174/0x200
Nov 28 07:03:31 server1 kernel: [890070.995577] [<ffffffff8118fd55>] ondemand_readahead+0x135/0x260
Nov 28 07:03:31 server1 kernel: [890070.995579] [<ffffffff8168033a>] ? dm_any_congested+0x4a/0x50
Nov 28 07:03:31 server1 kernel: [890070.995581] [<ffffffff8118feec>] page_cache_async_readahead+0x6c/0x70
Nov 28 07:03:31 server1 kernel: [890070.995584] [<ffffffff811846a0>] generic_file_read_iter+0x390/0x5c0
Nov 28 07:03:31 server1 kernel: [890070.995586] [<ffffffff812375e7>] blkdev_read_iter+0x37/0x40
Nov 28 07:03:31 server1 kernel: [890070.995589] [<ffffffff811fdc55>] new_sync_read+0x85/0xb0
Nov 28 07:03:31 server1 kernel: [890070.995591] [<ffffffff811fdca7>] __vfs_read+0x27/0x40
Nov 28 07:03:31 server1 kernel: [890070.995593] [<ffffffff811fe24f>] vfs_read+0x7f/0x130
Nov 28 07:03:31 server1 kernel: [890070.995595] [<ffffffff811ff026>] SyS_read+0x46/0xa0
Nov 28 07:03:31 server1 kernel: [890070.995597] [<ffffffff811fdf27>] ? SyS_lseek+0x87/0xb0
Nov 28 07:03:31 server1 kernel: [890070.995599] [<ffffffff817fe836>] entry_SYSCALL_64_fastpath+0x16/0x75
Nov 28 07:04:29 server4 kernel: [504268.435113] INFO: task blkback.44.xvda:8031 blocked for more than 120 seconds.
Nov 28 07:04:29 server4 kernel: [504268.435173] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:04:29 server4 kernel: [504268.435233] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:04:29 server4 kernel: [504268.435295] blkback.44.xvda D ffff8801709b7928 0 8031 2 0x00000000
Nov 28 07:04:29 server4 kernel: [504268.435334] ffff8801709b7928 ffff88017ccf8dc0 ffff880119c30000 ffff8801709b8000
Nov 28 07:04:29 server4 kernel: [504268.435339] ffff8800047c0c18 ffff8800047c0c00 ffffffff00000000 fffffffe00000001
Nov 28 07:04:29 server4 kernel: [504268.435343] ffff8801709b7940 ffffffff817fafc5 ffff880119c30000 ffff8801709b79c0
Nov 28 07:04:29 server4 kernel: [504268.435347] Call Trace:
Nov 28 07:04:29 server4 kernel: [504268.435358] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:04:29 server4 kernel: [504268.435362] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:04:29 server4 kernel: [504268.435367] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:04:29 server4 kernel: [504268.435369] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:04:29 server4 kernel: [504268.435377] [<ffffffffc0397dae>] __origin_write+0x6e/0x210 [dm_snapshot]
Nov 28 07:04:29 server4 kernel: [504268.435382] [<ffffffff81185625>] ? mempool_alloc_slab+0x15/0x20
Nov 28 07:04:29 server4 kernel: [504268.435385] [<ffffffff8118574f>] ? mempool_alloc+0x5f/0x150
Nov 28 07:04:29 server4 kernel: [504268.435388] [<ffffffffc0397fb7>] do_origin.isra.14+0x67/0x90 [dm_snapshot]
Nov 28 07:04:29 server4 kernel: [504268.435391] [<ffffffffc0398042>] origin_map+0x62/0x80 [dm_snapshot]
Nov 28 07:04:29 server4 kernel: [504268.435395] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:04:29 server4 kernel: [504268.435398] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:04:29 server4 kernel: [504268.435401] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:04:29 server4 kernel: [504268.435406] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:04:29 server4 kernel: [504268.435409] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:04:29 server4 kernel: [504268.435411] [<ffffffff813a1c71>] ? bio_alloc_bioset+0x181/0x2a0
Nov 28 07:04:29 server4 kernel: [504268.435417] [<ffffffffc03a8a1d>] dispatch_rw_block_io+0x4fd/0x9b0 [xen_blkback]
Nov 28 07:04:29 server4 kernel: [504268.435420] [<ffffffff8101c244>] ? xen_load_sp0+0x84/0x180
Nov 28 07:04:29 server4 kernel: [504268.435423] [<ffffffffc03a90c5>] __do_block_io_op+0x1f5/0x650 [xen_blkback]
Nov 28 07:04:29 server4 kernel: [504268.435427] [<ffffffff810e5e18>] ? del_timer_sync+0x48/0x50
Nov 28 07:04:29 server4 kernel: [504268.435429] [<ffffffff817fd8ab>] ? schedule_timeout+0x16b/0x2d0
Nov 28 07:04:29 server4 kernel: [504268.435432] [<ffffffffc03a9880>] xen_blkif_schedule+0xd0/0x820 [xen_blkback]
Nov 28 07:04:29 server4 kernel: [504268.435436] [<ffffffff810a4e1a>] ? finish_task_switch+0x7a/0x290
Nov 28 07:04:29 server4 kernel: [504268.435438] [<ffffffff817fa969>] ? __schedule+0x359/0x980
Nov 28 07:04:29 server4 kernel: [504268.435443] [<ffffffff810bde70>] ? prepare_to_wait_event+0xf0/0xf0
Nov 28 07:04:29 server4 kernel: [504268.435446] [<ffffffffc03a97b0>] ? xen_blkif_be_int+0x30/0x30 [xen_blkback]
Nov 28 07:04:29 server4 kernel: [504268.435449] [<ffffffff8109ba29>] kthread+0xc9/0xe0
Nov 28 07:04:29 server4 kernel: [504268.435451] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:04:29 server4 kernel: [504268.435455] [<ffffffff817febcf>] ret_from_fork+0x3f/0x70
Nov 28 07:04:29 server4 kernel: [504268.435457] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:04:29 server4 kernel: [504268.435473] INFO: task kworker/u4:2:10156 blocked for more than 120 seconds.
Nov 28 07:04:29 server4 kernel: [504268.435523] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:04:29 server4 kernel: [504268.435568] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:04:29 server4 kernel: [504268.435617] kworker/u4:2 D ffff880160633628 0 10156 2 0x00000000
Nov 28 07:04:29 server4 kernel: [504268.435625] Workqueue: writeback wb_workfn (flush-252:36)
Nov 28 07:04:29 server4 kernel: [504268.435627] ffff880160633628 ffff880007366040 ffff880076470dc0 ffff880160634000
Nov 28 07:04:29 server4 kernel: [504268.435630] ffff8800047c0c18 ffff8800047c0c00 ffffffff00000000 fffffffe00000001
Nov 28 07:04:29 server4 kernel: [504268.435632] ffff880160633640 ffffffff817fafc5 ffff880076470dc0 ffff8801606336c0
Nov 28 07:04:29 server4 kernel: [504268.435635] Call Trace:
Nov 28 07:04:29 server4 kernel: [504268.435637] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:04:29 server4 kernel: [504268.435640] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:04:29 server4 kernel: [504268.435642] [<ffffffff813a1a5c>] ? bvec_alloc+0x5c/0xf0
Nov 28 07:04:29 server4 kernel: [504268.435645] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:04:29 server4 kernel: [504268.435647] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:04:29 server4 kernel: [504268.435650] [<ffffffffc0398d12>] snapshot_map+0x62/0x390 [dm_snapshot]
Nov 28 07:04:29 server4 kernel: [504268.435653] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:04:29 server4 kernel: [504268.435656] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:04:29 server4 kernel: [504268.435658] [<ffffffff810c4e11>] ? __raw_callee_save___pv_queued_spin_unlock+0x11/0x20
Nov 28 07:04:29 server4 kernel: [504268.435661] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:04:29 server4 kernel: [504268.435664] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:04:29 server4 kernel: [504268.435667] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:04:29 server4 kernel: [504268.435669] [<ffffffff813a1c71>] ? bio_alloc_bioset+0x181/0x2a0
Nov 28 07:04:29 server4 kernel: [504268.435673] [<ffffffff812347ef>] submit_bh_wbc+0x12f/0x160
Nov 28 07:04:29 server4 kernel: [504268.435675] [<ffffffff81236615>] __block_write_full_page.constprop.39+0x125/0x360
Nov 28 07:04:29 server4 kernel: [504268.435677] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:04:29 server4 kernel: [504268.435679] [<ffffffff8123692e>] block_write_full_page+0xde/0x100
Nov 28 07:04:29 server4 kernel: [504268.435682] [<ffffffff81237378>] blkdev_writepage+0x18/0x20
Nov 28 07:04:29 server4 kernel: [504268.435684] [<ffffffff8118d253>] __writepage+0x13/0x40
Nov 28 07:04:29 server4 kernel: [504268.435687] [<ffffffff8118e5c1>] write_cache_pages+0x241/0x4c0
Nov 28 07:04:29 server4 kernel: [504268.435689] [<ffffffff8118d240>] ? wb_update_dirty_ratelimit+0x1c0/0x1c0
Nov 28 07:04:29 server4 kernel: [504268.435692] [<ffffffff817fe23a>] ? _raw_spin_unlock_irqrestore+0x1a/0x20
Nov 28 07:04:29 server4 kernel: [504268.435701] [<ffffffffc0126226>] ? _base_get_chain_buffer_tracker+0x86/0xd0 [mpt3sas]
Nov 28 07:04:29 server4 kernel: [504268.435703] [<ffffffff8118e883>] generic_writepages+0x43/0x60
Nov 28 07:04:29 server4 kernel: [504268.435706] [<ffffffff8118f46e>] do_writepages+0x1e/0x30
Nov 28 07:04:29 server4 kernel: [504268.435708] [<ffffffff8122b215>] __writeback_single_inode+0x45/0x340
Nov 28 07:04:29 server4 kernel: [504268.435710] [<ffffffff8122ba4b>] writeback_sb_inodes+0x26b/0x5c0
Nov 28 07:04:29 server4 kernel: [504268.435713] [<ffffffff8122be26>] __writeback_inodes_wb+0x86/0xc0
Nov 28 07:04:29 server4 kernel: [504268.435715] [<ffffffff8122c0b2>] wb_writeback+0x252/0x2e0
Nov 28 07:04:29 server4 kernel: [504268.435717] [<ffffffff8122c8a2>] wb_workfn+0x2c2/0x3d0
Nov 28 07:04:29 server4 kernel: [504268.435719] [<ffffffff810c4e11>] ? __raw_callee_save___pv_queued_spin_unlock+0x11/0x20
Nov 28 07:04:29 server4 kernel: [504268.435724] [<ffffffff81095b40>] process_one_work+0x150/0x3f0
Nov 28 07:04:29 server4 kernel: [504268.435727] [<ffffffff810962ba>] worker_thread+0x11a/0x470
Nov 28 07:04:29 server4 kernel: [504268.435729] [<ffffffff817fa969>] ? __schedule+0x359/0x980
Nov 28 07:04:29 server4 kernel: [504268.435732] [<ffffffff810961a0>] ? rescuer_thread+0x310/0x310
Nov 28 07:04:29 server4 kernel: [504268.435734] [<ffffffff8109ba29>] kthread+0xc9/0xe0
Nov 28 07:04:29 server4 kernel: [504268.435736] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:04:29 server4 kernel: [504268.435739] [<ffffffff817febcf>] ret_from_fork+0x3f/0x70
Nov 28 07:04:29 server4 kernel: [504268.435741] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
Nov 28 07:04:29 server4 kernel: [504268.435750] INFO: task fsck.ext4:17037 blocked for more than 120 seconds.
Nov 28 07:04:29 server4 kernel: [504268.435797] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
Nov 28 07:04:29 server4 kernel: [504268.435842] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Nov 28 07:04:29 server4 kernel: [504268.435890] fsck.ext4 D ffff88016b6a7888 0 17037 17035 0x00000000
Nov 28 07:04:29 server4 kernel: [504268.435893] ffff88016b6a7888 ffff880004569b80 ffff880010cb1b80 ffff88016b6a8000
Nov 28 07:04:29 server4 kernel: [504268.435895] ffff8800047c0c18 ffff8800047c0c00 ffffffff00000000 fffffffe00000001
Nov 28 07:04:29 server4 kernel: [504268.435898] ffff88016b6a78a0 ffffffff817fafc5 ffff880010cb1b80 ffff88016b6a7918
Nov 28 07:04:29 server4 kernel: [504268.435901] Call Trace:
Nov 28 07:04:29 server4 kernel: [504268.435903] [<ffffffff817fafc5>] schedule+0x35/0x80
Nov 28 07:04:29 server4 kernel: [504268.435905] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
Nov 28 07:04:29 server4 kernel: [504268.435908] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
Nov 28 07:04:29 server4 kernel: [504268.435910] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
Nov 28 07:04:29 server4 kernel: [504268.435913] [<ffffffffc0398d12>] snapshot_map+0x62/0x390 [dm_snapshot]
Nov 28 07:04:29 server4 kernel: [504268.435916] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
Nov 28 07:04:29 server4 kernel: [504268.435919] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
Nov 28 07:04:29 server4 kernel: [504268.435921] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
Nov 28 07:04:29 server4 kernel: [504268.435924] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
Nov 28 07:04:29 server4 kernel: [504268.435927] [<ffffffff813aa357>] submit_bio+0x77/0x150
Nov 28 07:04:29 server4 kernel: [504268.435930] [<ffffffff8123cf6a>] mpage_bio_submit+0x2a/0x40
Nov 28 07:04:29 server4 kernel: [504268.435932] [<ffffffff8123d8f4>] mpage_readpages+0x134/0x150
Nov 28 07:04:29 server4 kernel: [504268.435934] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:04:29 server4 kernel: [504268.435936] [<ffffffff81236af0>] ? I_BDEV+0x20/0x20
Nov 28 07:04:29 server4 kernel: [504268.435938] [<ffffffff8123733d>] blkdev_readpages+0x1d/0x20
Nov 28 07:04:29 server4 kernel: [504268.435940] [<ffffffff8118fb94>] __do_page_cache_readahead+0x174/0x200
Nov 28 07:04:29 server4 kernel: [504268.435943] [<ffffffff8118fd55>] ondemand_readahead+0x135/0x260
Nov 28 07:04:29 server4 kernel: [504268.435945] [<ffffffff8168033a>] ? dm_any_congested+0x4a/0x50
Nov 28 07:04:29 server4 kernel: [504268.435948] [<ffffffff8118feec>] page_cache_async_readahead+0x6c/0x70
Nov 28 07:04:29 server4 kernel: [504268.435951] [<ffffffff811846a0>] generic_file_read_iter+0x390/0x5c0
Nov 28 07:04:29 server4 kernel: [504268.435953] [<ffffffff812375e7>] blkdev_read_iter+0x37/0x40
Nov 28 07:04:29 server4 kernel: [504268.435957] [<ffffffff811fdc55>] new_sync_read+0x85/0xb0
Nov 28 07:04:29 server4 kernel: [504268.435959] [<ffffffff811fdca7>] __vfs_read+0x27/0x40
Nov 28 07:04:29 server4 kernel: [504268.435961] [<ffffffff811fe24f>] vfs_read+0x7f/0x130
Nov 28 07:04:29 server4 kernel: [504268.435964] [<ffffffff811ff026>] SyS_read+0x46/0xa0
Nov 28 07:04:29 server4 kernel: [504268.435966] [<ffffffff811fdf27>] ? SyS_lseek+0x87/0xb0
Nov 28 07:04:29 server4 kernel: [504268.435969] [<ffffffff817fe836>] entry_SYSCALL_64_fastpath+0x16/0x75 |
Server's load become high with tasks in D state, no choice but to reboot the system.
Could be related to the following ? :
https://bugzilla.kernel.org/show_bug.cgi?id=119841
https://www.redhat.com/archives/dm-devel/2016-June/msg00399.html
https://patchwork.kernel.org/patch/9223697/
https://xen.crc.id.au/bugs/view.php?id=75
ii xen-hypervisor-4.4-amd64 4.4.2-0ubuntu0.14.04.7 amd64 Xen Hypervisor on AMD64
ii linux-image-extra-4.4.0-47-generic 4.4.0-47.68~14.04.1 amd64 Linux kernel extra modules for version 4.4.0 on 64 bit x86 SMP
ii linux-image-4.4.0-47-generic 4.4.0-47.68~14.04.1 amd64 Linux kernel image for version 4.4.0 on 64 bit x86 SMP
kernel messages:
----------------
[890070.994700] INFO: task blkback.3.xvda2:5756 blocked for more than 120 seconds.
[890070.994758] Not tainted 4.4.0-47-generic #68~14.04.1-Ubuntu
[890070.994806] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[890070.994884] blkback.3.xvda2 D ffff8800b7ff3928 0 5756 2 0x00000000
[890070.994890] ffff8800b7ff3928 ffffffff81e13500 ffff8800b6a4be80 ffff8800b7ff4000
[890070.994895] ffff88013a83cc18 ffff88013a83cc00 ffffffff00000000 fffffffe00000001
[890070.994898] ffff8800b7ff3940 ffffffff817fafc5 ffff8800b6a4be80 ffff8800b7ff39c0
[890070.994902] Call Trace:
[890070.994912] [<ffffffff817fafc5>] schedule+0x35/0x80
[890070.994917] [<ffffffff817fd46a>] rwsem_down_write_failed+0x1da/0x320
[890070.994923] [<ffffffff81689577>] ? push+0x47/0x50
[890070.994927] [<ffffffff81689f07>] ? dm_kcopyd_copy+0x147/0x1f0
[890070.994931] [<ffffffff813e6a53>] call_rwsem_down_write_failed+0x13/0x20
[890070.994933] [<ffffffff817fcd7d>] ? down_write+0x2d/0x40
[890070.994939] [<ffffffffc0314dae>] __origin_write+0x6e/0x210 [dm_snapshot]
[890070.994944] [<ffffffff81185625>] ? mempool_alloc_slab+0x15/0x20
[890070.994946] [<ffffffff8118574f>] ? mempool_alloc+0x5f/0x150
[890070.994949] [<ffffffffc0314fb7>] do_origin.isra.14+0x67/0x90 [dm_snapshot]
[890070.994952] [<ffffffffc0315042>] origin_map+0x62/0x80 [dm_snapshot]
[890070.994955] [<ffffffff8167f2da>] __map_bio+0x3a/0x110
[890070.994957] [<ffffffff816809c0>] __split_and_process_bio+0x240/0x3c0
[890070.994960] [<ffffffff81680baa>] dm_make_request+0x6a/0xd0
[890070.994964] [<ffffffff813aa221>] generic_make_request+0xe1/0x1a0
[890070.994968] [<ffffffff813aa357>] submit_bio+0x77/0x150
[890070.994971] [<ffffffff813a1c71>] ? bio_alloc_bioset+0x181/0x2a0
[890070.994977] [<ffffffffc02c6a1d>] dispatch_rw_block_io+0x4fd/0x9b0 [xen_blkback]
[890070.994981] [<ffffffff8101c244>] ? xen_load_sp0+0x84/0x180
[890070.994985] [<ffffffffc02c70c5>] __do_block_io_op+0x1f5/0x650 [xen_blkback]
[890070.994990] [<ffffffff810e5e18>] ? del_timer_sync+0x48/0x50
[890070.994993] [<ffffffff817fd8ab>] ? schedule_timeout+0x16b/0x2d0
[890070.994997] [<ffffffffc02c7880>] xen_blkif_schedule+0xd0/0x820 [xen_blkback]
[890070.995002] [<ffffffff810a4e1a>] ? finish_task_switch+0x7a/0x290
[890070.995004] [<ffffffff817fa969>] ? __schedule+0x359/0x980
[890070.995010] [<ffffffff810bde70>] ? prepare_to_wait_event+0xf0/0xf0
[890070.995014] [<ffffffffc02c77b0>] ? xen_blkif_be_int+0x30/0x30 [xen_blkback]
[890070.995018] [<ffffffff8109ba29>] kthread+0xc9/0xe0
[890070.995021] [<ffffffff8109b960>] ? kthread_park+0x60/0x60
[890070.995025] [<ffffffff817febcf>] ret_from_fork+0x3f/0x70
[890070.995027] [<ffffffff8109b960>] ? kthread_park+0x60/0x60 |
|
2016-11-29 01:58:32 |
john |
bug watch added |
|
http://bugzilla.kernel.org/show_bug.cgi?id=119841 |
|
2016-11-29 01:58:32 |
john |
attachment added |
|
fullstack.txt.gz https://bugs.launchpad.net/ubuntu/+source/linux-lts-xenial/+bug/1645187/+attachment/4784464/+files/fullstack.txt.gz |
|
2016-11-29 13:36:12 |
Stefan Bader |
bug |
|
|
added subscriber Stefan Bader |
2016-11-29 17:51:14 |
Joseph Salisbury |
affects |
linux-lts-xenial (Ubuntu) |
linux (Ubuntu) |
|
2016-11-29 17:51:14 |
Joseph Salisbury |
linux (Ubuntu): importance |
Undecided |
Medium |
|
2016-11-29 17:51:14 |
Joseph Salisbury |
linux (Ubuntu): status |
New |
Triaged |
|
2016-11-29 17:51:42 |
Joseph Salisbury |
linux (Ubuntu): importance |
Medium |
High |
|
2016-11-29 17:52:05 |
Joseph Salisbury |
nominated for series |
|
Ubuntu Xenial |
|
2016-11-29 17:52:05 |
Joseph Salisbury |
bug task added |
|
linux (Ubuntu Xenial) |
|
2016-11-29 17:52:13 |
Joseph Salisbury |
linux (Ubuntu Xenial): status |
New |
Triaged |
|
2016-11-29 17:52:16 |
Joseph Salisbury |
linux (Ubuntu Xenial): importance |
Undecided |
High |
|
2016-11-29 17:52:23 |
Joseph Salisbury |
tags |
|
kernel-da-key xenial |
|
2017-03-31 03:59:14 |
john |
cve linked |
|
2017-7184 |
|