XFS: Corruption of in-memory data detected
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Expired
|
Low
|
Unassigned |
Bug Description
I've got 2 NFS servers that have been running Ubuntu 12.04 for a number of years. For the last 9 months or so they've been happily running kernel 3.8.0-44 with no problem. As soon as I upgraded them to 3.13.0-53 the both experienced the same problem on an XFS filesystem when it started to have a heavy write load:
[267127.795322] XFS (sda1): Internal error xfs_attr3_
[267127.811202] CPU: 10 PID: 28744 Comm: xfsaild/sda1 Tainted: G I 3.13.0-53-generic #89~precise1-Ubuntu
[267127.811206] Hardware name: IBM System x3690 X5 -[7148AC1]-/Node 1, System Card, BIOS -[MLE170CUS-1.70]- 09/23/2011
[267127.811211] 0000000000000001 ffff881016b7bb38 ffffffff8175a3e4 ffff88103f34f030
[267127.811221] ffff881023d9c000 ffff881016b7bb58 ffffffffa025317f ffffffffa0250ada
[267127.811228] ffffffff00000001 ffff881016b7bb98 ffffffffa02531ee 00000fe000200000
[267127.811236] Call Trace:
[267127.811251] [<ffffffff8175a
[267127.811311] [<ffffffffa0253
[267127.811337] [<ffffffffa0250
[267127.811363] [<ffffffffa0253
[267127.811396] [<ffffffffa0271
[267127.811421] [<ffffffffa0250
[267127.811445] [<ffffffffa0250
[267127.811470] [<ffffffffa0250
[267127.811494] [<ffffffffa0250
[267127.811505] [<ffffffff8109f
[267127.811530] [<ffffffffa0250
[267127.811554] [<ffffffffa0250
[267127.811579] [<ffffffffa0251
[267127.811604] [<ffffffffa0251
[267127.811630] [<ffffffffa0251
[267127.811673] [<ffffffffa02b4
[267127.811716] [<ffffffffa02b5
[267127.811757] [<ffffffffa02b5
[267127.811766] [<ffffffff8108f
[267127.811772] [<ffffffff8108f
[267127.811781] [<ffffffff8176f
[267127.811787] [<ffffffff8108f
[267127.811791] XFS (sda1): Corruption detected. Unmount and run xfs_repair
[267127.818679] XFS (sda1): xfs_do_
[267127.818927] XFS (sda1): xfs_imap_to_bp: xfs_trans_
[267127.820914] XFS (sda1): Corruption of in-memory data detected. Shutting down filesystem
[267127.829131] XFS (sda1): Please umount the filesystem and rectify the problem(s)
I've not been able to reproduce this on demand outside our production workload. Downgrading the machines back to 3.8.0-44 has returned us to stability.
I' having a similar problem with trusty 3.13.0-55:
[601201.817664] ffff88016e03e000: 00 00 00 00 00 00 00 00 fb ee 00 00 00 00 00 00 ................ leaf_write_ verify at line 216 of file /build/ buildd/ linux-3. 13.0/fs/ xfs/xfs_ attr_leaf. c. Caller 0xffffffffa00996f0 294>] dump_stack+ 0x45/0x56 6fb>] xfs_error_ report+ 0x3b/0x40 [xfs] 6f0>] ? _xfs_buf_ ioapply+ 0x70/0x3a0 [xfs] 755>] xfs_corruption_ error+0x55/ 0x80 [xfs] b50>] xfs_attr3_ leaf_write_ verify+ 0x100/0x120 [xfs] 6f0>] ? _xfs_buf_ ioapply+ 0x70/0x3a0 [xfs] 3d5>] ? xfs_bdstrat_ cb+0x55/ 0xb0 [xfs] 6f0>] _xfs_buf_ ioapply+ 0x70/0x3a0 [xfs] bc0>] ? wake_up_ state+0x20/ 0x20 3d5>] ? xfs_bdstrat_ cb+0x55/ 0xb0 [xfs] 336>] xfs_buf_ iorequest+ 0x46/0x90 [xfs] 3d5>] xfs_bdstrat_ cb+0x55/ 0xb0 [xfs] 56b>] __xfs_buf_ delwri_ submit+ 0x13b/0x210 [xfs] 000>] ? xfs_buf_ delwri_ submit_ nowait+ 0x20/0x30 [xfs] aa0>] ? xfs_trans_ ail_cursor_ first+0x90/ 0x90 [xfs] 000>] xfs_buf_ delwri_ submit_ nowait+ 0x20/0x30 [xfs] cd7>] xfsaild+0x237/0x5c0 [xfs] aa0>] ? xfs_trans_ ail_cursor_ first+0x90/ 0x90 [xfs] 702>] kthread+0xd2/0xf0 630>] ? kthread_ create_ on_node+ 0x1c0/0x1c0 ca8>] ret_from_ fork+0x58/ 0x90 630>] ? kthread_ create_ on_node+ 0x1c0/0x1c0 force_shutdown( 0x8) called from line 1320 of file /build/ buildd/ linux-3. 13.0/fs/ xfs/xfs_ buf.c. Return address = 0xffffffffa009971c
[601201.818224] ffff88016e03e010: 10 00 00 00 00 20 0f e0 00 00 00 00 00 00 00 00 ..... ..........
[601201.818827] ffff88016e03e020: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[601201.819429] ffff88016e03e030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
[601201.820013] XFS (dm-0): Internal error xfs_attr3_
[601201.820910] CPU: 1 PID: 421 Comm: xfsaild/dm-0 Not tainted 3.13.0-55-generic #92-Ubuntu
[601201.820913] Hardware name: Dell Inc. PowerEdge R310/05XKKK, BIOS 1.8.2 08/17/2011
[601201.820914] 0000000000000001 ffff880035cc3bd0 ffffffff81723294 ffff8802ac71d800
[601201.820918] ffff880035cc3be8 ffffffffa009d6fb ffffffffa00996f0 ffff880035cc3c20
[601201.820919] ffffffffa009d755 000000d800205500 ffff88001aa2a400 ffff880050acc2b8
[601201.820922] Call Trace:
[601201.820930] [<ffffffff81723
[601201.820964] [<ffffffffa009d
[601201.820974] [<ffffffffa0099
[601201.820984] [<ffffffffa009d
[601201.820997] [<ffffffffa00ba
[601201.821007] [<ffffffffa0099
[601201.821016] [<ffffffffa009b
[601201.821026] [<ffffffffa0099
[601201.821030] [<ffffffff8109a
[601201.821040] [<ffffffffa009b
[601201.821050] [<ffffffffa009b
[601201.821060] [<ffffffffa009b
[601201.821070] [<ffffffffa009b
[601201.821081] [<ffffffffa009c
[601201.821100] [<ffffffffa00fa
[601201.821110] [<ffffffffa009c
[601201.821127] [<ffffffffa00fa
[601201.821145] [<ffffffffa00fa
[601201.821148] [<ffffffff8108b
[601201.821150] [<ffffffff8108b
[601201.821153] [<ffffffff81733
[601201.821155] [<ffffffff8108b
[601201.821158] XFS (dm-0): Corruption detected. Unmount and run xfs_repair
[601201.821581] XFS (dm-0): xfs_do_
[601201.832780] XFS (dm-0): Corruption of in-memory data detected. Shutting down filesystem
[601201.833292] XFS (dm-0): Please umount the filesystem and rectify the problem(s)
[601201.843375] ffff880238b00000: 58 44 32 42 01 00 0e c0 00 00 00 00 00 00 00 00 XD2B............