Bug #198125 “suspend and hibernate may cause data corruption bec...” : Bugs : pm-utils package : Ubuntu

Revision history for this message

sibidiba (sibidiba) wrote on 2008-03-03:

#1

/etc/acpi/suspend.d/99-sync.sh Edit (26 bytes, text/plain)

Revision history for this message

TerryG (tgalati4) wrote on 2008-03-03:

#2

Thanks for your bug submission. Sorry for your loss. This sounds serious. Marking as Confirmed.

What version of Hardy and what make/model of laptop?

What does the following say when this happened?

dmesg | tail -100

or perhaps

tail -100 /var/log/syslog

Changed in acpid:
status:	New → Confirmed

Revision history for this message

sibidiba (sibidiba) wrote on 2008-03-03:

#3

Download full text (6.4 KiB)

HW: ThinkPad R61i
SW: Hardy, daily update, since now I had kernel 2.6.24-8-generic

I have to apologize, because further examination of the logs revealed that there were I/O errors probably before I first suspended the box:

Mar 3 07:39:58 Kamorka kernel: [38410.059414] usb 2-2: reset high speed USB device using ehci_hcd and address 3
Mar 3 07:40:08 Kamorka kernel: [38412.832572] usb 2-2: reset high speed USB device using ehci_hcd and address 3
Mar 3 07:40:14 Kamorka kernel: [38413.130585] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #884738: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar 3 07:40:14 Kamorka kernel: [38413.137189] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #14123009: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar 3 07:40:14 Kamorka kernel: [38413.149699] EXT3-fs error (device sdb2): ext3_readdir: bad entry in directory #11: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar 3 07:40:14 Kamorka kernel: [38413.155155] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #15040513: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar 3 07:40:14 Kamorka kernel: [38413.163752] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #28327937: rec_len % 4 != 0 - offset=0, inode=1919240992, rec_len=25966, name_len=108
Mar 3 07:40:15 Kamorka kernel: [38413.174590] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #14483457: rec_len % 4 != 0 - offset=0, inode=1684628289, rec_len=28535, name_len=108
Mar 3 07:40:15 Kamorka kernel: [38413.183440] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #20447233: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
(...)
Mar 3 07:40:16 Kamorka kernel: [38413.779429] EXT3-fs error (device sdb2): ext3_readdir: bad entry in directory #11: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0

Also my flatmate said there was a short power-outage in the morning, just before I left, but I haven't noticed it at all.
It is a possible scenario, that the file-system corruption occurred when the laptop kept going on batteries, while the external disk shut down.

When first resumed without the disk attached:
Mar 3 09:46:43 Kamorka hald[6155]: forcibly attempting to lazy unmount /dev/sdb2 as enclosing drive was disconnected
ar 3 09:46:43 Kamorka kernel: [40226.138935] Buffer I/O error on device sdb2, logical block 1545
Mar 3 09:46:43 Kamorka kernel: [40226.138941] lost page write due to I/O error on sdb2
Mar 3 09:46:43 Kamorka kernel: [40226.138968] WARNING: at /build/buildd/linux-2.6.24/fs/buffer.c:1169 mark_buffer_dirty()
Mar 3 09:46:43 Kamorka kernel: [40226.138972] Pid: 22078, comm: umount Not tainted 2.6.24-8-generic #1
Mar 3 09:46:43 Kamorka kernel: [40226.139006] [ext3:mark_buffer_dirty+0x7a/0x150] mark_buffer_dirty+0x7a/0x90
Mar 3 09:46:43 Kamorka kernel: [40226.139029] [<f89ab8e0>] journal_update_superblock+0x70/0xd0 [j...

HW: ThinkPad R61i
SW: Hardy, daily update, since now I had kernel 2.6.24-8-generic

I have to apologize, because further examination of the logs revealed that there were I/O errors probably before I first suspended the box:

Mar  3 07:39:58 Kamorka kernel: [38410.059414] usb 2-2: reset high speed USB device using ehci_hcd and address 3
Mar  3 07:40:08 Kamorka kernel: [38412.832572] usb 2-2: reset high speed USB device using ehci_hcd and address 3
Mar  3 07:40:14 Kamorka kernel: [38413.130585] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #884738: rec_len is smaller than minimal -  offset=0, inode=0, rec_len=0, name_len=0
Mar  3 07:40:14 Kamorka kernel: [38413.137189] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #14123009: rec_len is smaller than          minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar  3 07:40:14 Kamorka kernel: [38413.149699] EXT3-fs error (device sdb2): ext3_readdir: bad entry in directory #11: rec_len is smaller than minimal - offset=0,      inode=0, rec_len=0, name_len=0
Mar  3 07:40:14 Kamorka kernel: [38413.155155] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #15040513: rec_len is smaller than          minimal - offset=0, inode=0, rec_len=0, name_len=0
Mar  3 07:40:14 Kamorka kernel: [38413.163752] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #28327937: rec_len % 4 != 0 - offset=0,     inode=1919240992, rec_len=25966, name_len=108
Mar  3 07:40:15 Kamorka kernel: [38413.174590] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #14483457: rec_len % 4 != 0 - offset=0,     inode=1684628289, rec_len=28535, name_len=108
Mar  3 07:40:15 Kamorka kernel: [38413.183440] EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #20447233: rec_len is smaller than          minimal - offset=0, inode=0, rec_len=0, name_len=0
(...)
Mar  3 07:40:16 Kamorka kernel: [38413.779429] EXT3-fs error (device sdb2): ext3_readdir: bad entry in directory #11: rec_len is smaller than minimal - offset=0,      inode=0, rec_len=0, name_len=0

Also my flatmate said there was a short power-outage in the morning, just before I left, but I haven't noticed it at all.
It is a possible scenario, that the file-system corruption occurred when the laptop kept going on batteries, while the external disk shut down.

When first resumed without the disk attached:
Mar  3 09:46:43 Kamorka hald[6155]: forcibly attempting to lazy unmount /dev/sdb2 as enclosing drive was disconnected
ar  3 09:46:43 Kamorka kernel: [40226.138935] Buffer I/O error on device sdb2, logical block 1545 
Mar  3 09:46:43 Kamorka kernel: [40226.138941] lost page write due to I/O error on sdb2 
Mar  3 09:46:43 Kamorka kernel: [40226.138968] WARNING: at /build/buildd/linux-2.6.24/fs/buffer.c:1169 mark_buffer_dirty()
Mar  3 09:46:43 Kamorka kernel: [40226.138972] Pid: 22078, comm: umount Not tainted 2.6.24-8-generic #1
Mar  3 09:46:43 Kamorka kernel: [40226.139006]  [ext3:mark_buffer_dirty+0x7a/0x150] mark_buffer_dirty+0x7a/0x90
Mar  3 09:46:43 Kamorka kernel: [40226.139029]  [<f89ab8e0>] journal_update_superblock+0x70/0xd0 [jbd]
Mar  3 09:46:43 Kamorka kernel: [40226.139051]  [<f89aa446>] cleanup_journal_tail+0x86/0xd0 [jbd]
Mar  3 09:46:43 Kamorka kernel: [40226.139070]  [<f89aa758>] log_do_checkpoint+0x288/0x360 [jbd]
Mar  3 09:46:43 Kamorka kernel: [40226.139103]  [sched_clock+0x13/0x40] sched_clock+0x13/0x40
Mar  3 09:46:43 Kamorka kernel: [40226.139118]  [update_curr+0x103/0x110] update_curr+0x103/0x110
Mar  3 09:46:43 Kamorka kernel: [40226.139136]  [set_next_entity+0x1c/0x50] set_next_entity+0x1c/0x50
Mar  3 09:46:43 Kamorka kernel: [40226.139147]  [pick_next_task_fair+0x2b/0x40] pick_next_task_fair+0x2b/0x40
Mar  3 09:46:43 Kamorka kernel: [40226.139159]  [dm_mod:schedule+0x27e/0x650] schedule+0x27e/0x600
Mar  3 09:46:43 Kamorka kernel: [40226.139176]  [load_balance_start_fair+0x0/0x10] load_balance_start_fair+0x0/0x10
Mar  3 09:46:43 Kamorka kernel: [40226.139203]  [__cond_resched+0x13/0x40] __cond_resched+0x13/0x40
Mar  3 09:46:43 Kamorka kernel: [40226.139207]  [scsi_mod:cond_resched+0x27/0x1a0] cond_resched+0x27/0x30
Mar  3 09:46:43 Kamorka kernel: [40226.139210]  [__reacquire_kernel_lock+0x1c/0x3c] __reacquire_kernel_lock+0x1c/0x3c
Mar  3 09:46:43 Kamorka kernel: [40226.139223]  [dm_mod:schedule+0x51b/0x650] schedule+0x51b/0x600
Mar  3 09:46:43 Kamorka kernel: [40226.139259]  [<f89ac46b>] journal_destroy+0xfb/0x1b0 [jbd]
Mar  3 09:46:43 Kamorka kernel: [40226.139276]  [<c0141bb0>] autoremove_wake_function+0x0/0x40
Mar  3 09:46:43 Kamorka kernel: [40226.139294]  [<f89f44b2>] ext3_put_super+0x22/0x1e0 [ext3]
Mar  3 09:46:43 Kamorka kernel: [40226.139316]  [invalidate_inodes+0xc4/0xd0] invalidate_inodes+0xc4/0xd0
Mar  3 09:46:43 Kamorka kernel: [40226.139334]  [generic_shutdown_super+0x55/0xf0] generic_shutdown_super+0x55/0xf0
Mar  3 09:46:43 Kamorka kernel: [40226.139339]  [fuse:mntput_no_expire+0x3b/0x3ed0] mntput_no_expire+0x3b/0x70
Mar  3 09:46:43 Kamorka kernel: [40226.139351]  [fuse:kill_block_super+0xc/0x20] kill_block_super+0xc/0x20
Mar  3 09:46:43 Kamorka kernel: [40226.139358]  [deactivate_super+0x5d/0x80] deactivate_super+0x5d/0x80
Mar  3 09:46:43 Kamorka kernel: [40226.139368]  [sys_umount+0x46/0x250] sys_umount+0x46/0x250
Mar  3 09:46:43 Kamorka kernel: [40226.139392]  [do_munmap+0x180/0x1f0] do_munmap+0x180/0x1f0
Mar  3 09:46:43 Kamorka kernel: [40226.139422]  [sysenter_past_esp+0x6b/0xa9] sysenter_past_esp+0x6b/0xa9
Mar  3 09:46:43 Kamorka kernel: [40226.139457]  =======================
Mar  3 09:46:43 Kamorka kernel: [40226.139465] Buffer I/O error on device sdb2, logical block 1545 
Mar  3 09:46:43 Kamorka kernel: [40226.139467] lost page write due to I/O error on sdb2 
Mar  3 09:46:43 Kamorka kernel: [40226.139534] Buffer I/O error on device sdb2, logical block 1545
Mar  3 09:46:43 Kamorka kernel: [40226.139537] lost page write due to I/O error on sdb2
Mar  3 09:46:43 Kamorka kernel: [40226.139725] Buffer I/O error on device sdb2, logical block 0
Mar  3 09:46:43 Kamorka kernel: [40226.139727] lost page write due to I/O error on sdb2
(...)

I updated the bugreport's description. I think it is still a bug, because the latter log shows, that the disk was not umounted before entering suspend.
I also couldn't find any acpid scripts that would do any umount/mount.

Current state is:
Mar  3 18:36:44 Kamorka kernel: [    7.967199] EXT2-fs: corrupt root inode, run e2fsck

The loss is not a big deal, mostly I kept backups there.

description:	updated
Changed in acpid:
status:	Confirmed → New

Revision history for this message

TerryG (tgalati4) wrote on 2008-03-04:

#4

I assume that a 500 GB drive has a wall-wart for power. Loss of power to the drive with the laptop still running could be problematic. I'm going to plug mine into a spare UPS that I have lying around. You can track the problems by following the timestamp in the syslog file from when you booted to when the problems occurred and see if that corresponds to the time of the power outage. Any VCR's or microwave clocks flashing?

Revision history for this message

Theodore Ts'o (tytso) wrote on 2008-03-04:

#5

Given the I/O errors reported by the user, the filesystem was probably very badly damanged before the power loss event. Normally ext3 recovers from power failures without a hitch. Enabling laptop mode may increase the amount of files whose data might be lost, but power failures will not result in this kind of damage as reported by the user here and in bug #198131.

Revision history for this message

Jim Braux-Zin (j-brauxzin) wrote on 2008-03-09:

#6

This bug may be related to bug #108854.

I said there :

Hardy amd64 on a Lenovo 3000 N200 laptop (Core 2 Duo)

My external hard disks aren't switched off either.

What is more problematic to me is that they seem not to be unmounted, so when I unplugged a drive before resume, there still was its icon on the desktop. More problematic, when I plugged it again it would be mounted to a different location ("WD Passport_" instead of "WD Passport" the system thought was already in use), making all my bookmarks nonfunctional until reboot.

Revision history for this message

Jim Braux-Zin (j-brauxzin) wrote on 2008-04-13:

#7

Please, it's getting worse and worse ! Last day, I noticed my externel hard drive was mounted to /media/WD Passport____" and there was three empty folders starting with "WD Passport". All my bookmarks are disabled and rhythmbox can't find my music.

Also, I don't understand why there aren't more people complaining about this issue since it requires a manual removal of the empty folders.

Revision history for this message

DaveAbrahams (boostpro) wrote on 2008-04-17:

#8

This is a serious problem, and it applies to the internal disks as well. I am using JFS on LVM and have been testing suspend-to-RAM lately. Every time it failed, I ended up with really bad disk corruption (often couldn't boot or couldn't "touch /forcefsck").

Puhleeeze fix it. The cost is so low and the benfits so high!

Revision history for this message

Hendy Irawan (ceefour) wrote on 2008-04-25:

#9

Probably related to bug:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/203537

Does this bug still exist on release Hardy?

If so this is *VERY* dangerous people!!!!!!!!!!!!

Revision history for this message

sibidiba (sibidiba) wrote on 2008-04-25:

#10

It is still not fixed.

Removable media is mounted asynchronous and there is no sign of any attempt to umount/remount it upon suspend/resume.

Revision history for this message

alex941021 (alex941021) wrote on 2008-05-08:

#11

Confirmed on an ACER Ferrari 5000 with Hardy. Suspend does suspend the machine properly; however, upon resume it fails--the screen stays blank and a hard power off is required to reboot. Upon normal reboot I receive a GRUB error 17, indicative of a file system corruption. Even after an fsck, the disk is unrecoverable.

Same problem occurs if non-proprietary video drivers are not installed.

Revision history for this message

Seppe De Loore (seppe) wrote on 2008-05-09: Re: suspend and hibernate may cause DISK corruption because it doesn't syncs nor umounts external drives previously

#12

Confirmed on an ACER Ferrari 1100 with Hardy 64. Suspend does suspend the machine properly; however, upon resume it fails--the screen stays blank.
After a hard power off the system prompts GRUB error 17, indicating that the file system is corrupt.
As my data are on a separate home partition, not all was lost.

Revision history for this message

alex941021 (alex941021) wrote on 2008-06-20:

#13

I've found a workaround for this problem by passing the "iommu=soft" to the kernel!!! Amazing, everything works!

it is something that has to do with AMD iommu module and kernel incompatibilities.

Daniel T Chen (crimsun) on 2008-11-30

Changed in acpid:
importance:	Undecided → Low
status:	New → Confirmed

Revision history for this message

Loïc Minier (lool) wrote on 2009-01-07:

#14

Most comments here allude to the FS not being "sync"-ed; however /usr/lib/pm-utils/bin/pm-action (pm-suspend) in pm-utils does sync. I'm reassigning to pm-utils for now, but I rather suspect that this is a driver/hardware issue as we got a relatively low number of such reports.

These logs also point at drivers/hardware issues rather than userspace issues:
Mar 3 09:46:43 Kamorka kernel: [40226.138935] Buffer I/O error on device sdb2, logical block 1545
Mar 3 09:46:43 Kamorka kernel: [40226.138941] lost page write due to I/O error on sdb2

Revision history for this message

Steve Lemke (steve-lemkeville) wrote on 2009-03-31:

#15

Any chance this might be related to disk image corruption issues when running Hardy in VmWare Fusion?

I have lost numerous VmWare Fusion images with what seems to be a similar problem. Typically in the middle of a large project build, everything will come to a grinding halt with (something like) the following error:

[ 3380.587304] EXT3-fs error (device sda1): htree_dirblock_to_tree: bad entry in directory #1777878: rec_len is smaller than minimal - offset=0, inode=0, rec_len=0, name_len=0
[ 3380.587411] Aborting journal on device sda1.
[ 3380.588457] Remounting filesystem read-only
[ 3380.664339] __journal_remove_journal_head: freeing b_committed_data

After rebooting the virtual Hardy machine, the disk is generally unbootable. I use suspend/resume all the time in VmWare, but thought this was some strange VmWare I/O problem happening during my build. Now I'm wondering if it's a Hardy bug?

Revision history for this message

Steve (stevenm86) wrote on 2011-10-30:

#16

Seeing this exact problem in Ocelot. The filesystem on the external storage device is extremely upset that the device was ripped out from under it, and wasn't there yet (due to delay in the drive spinning up / enumerating) when the system was resumed. The system needs to delay suspending until all external storage devices are unmounted, and needs to NOT suspend if unmounting fails (due to a program/terminal/etc still being open somewhere pointing to these devices).

The current behavior of suspending regardless of external device state is WRONG and DOES result in data corruption. I don't understand why this is marked as 'Low priority'.

Revision history for this message

heckheck (jinfo) wrote on 2012-01-28:

#18

I too have experienced disk corruption following suspend to ram, which I use on my NAS storage server in conjunction with PowerNap. I have observed this working in both Lucid and Natty over the last year. In my experience, this problem is not limited to external drives. I have observed corruption of my boot drive on three different internal SATA controller cards in a Nehalem class X86 server. These include a

* Highpoint Rocket 620 PCIe add in card running Bios Version 1.1
* LSI MPT SAS controller running in IT mode on a Supermicro S8DA3 motherboard running MPTSAS BIOS v 6.30.00.00 2009_11_12
* Old Promise SATA300 PCI add in card

Note that I am using all 6 of the ICH10 SATA ports for a RAID5 array, so I do not know if this corruption ever occurs using the standard Intel ICH10 SATA ports. Perhaps that is why it is not reported more widely.

The corruption occurred most often when using the LSI SAS controller (about 1 in 5 boots). It occurs much less frequently on the Highpoint Rocket 620 card, but it just happened for the first time yesterday after about 2 months of testing.

I'm sorry I don't have fresh logs to post, but I had to get my system back on-line ASAP. I'll add logs the next time it happens if I can scrape them out of the corrupted filesystem.

This is a very serious problem, and I am baffled that it is marked Low priority. It doesn't get much more grave than when your boot drive gets corrupted every few months due to something not being right in the syncing of disks going into and out of suspend to ram.

If Ubuntu is serious about power management in the upcoming Precise release, this MUST be addressed.

Best Regards,

-Jim Heck

Revision history for this message

Tomasz 'Zen' Napierala (tzn) wrote on 2012-03-06:

#19

Download full text (5.9 KiB)

I cannot find how it might be related, but we are seeing massive filesystem corruptions in virtual guests on kvm in Lucid.
Host was running several kernels, from stock Lucid up to 3.0.0-14-server. Guests were booted with several different kernels as well. We also changed storag backen form qcow, to raw and eventually to lvm base with no avail.
Usuall message just after going to RO:
[2012-03-01 04:39:06] EXT4-fs (vda): error count: 10
[2012-03-01 04:39:06] EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-01 04:39:06] EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-02 04:40:50] EXT4-fs (vda): error count: 10
[2012-03-02 04:40:50] EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-02 04:40:50] EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-03 04:42:38] EXT4-fs (vda): error count: 10
[2012-03-03 04:42:38] EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-03 04:42:38] EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-04 04:44:25] EXT4-fs (vda): error count: 10
[2012-03-04 04:44:25] EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-04 04:44:25] EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-04 20:34:20] EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #171186: block 546842: comm chown: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0
[2012-03-04 20:34:20] Aborting journal on device vda-8.
[2012-03-04 20:34:20] EXT4-fs (vda): Remounting filesystem read-only
[2012-03-05 04:46:13] EXT4-fs (vda): error count: 11
[2012-03-05 04:46:13] EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-05 04:46:13] EXT4-fs (vda): last error at 1330893259: htree_dirblock_to_tree:587: inode 171186: block 546842

Or

[20768.343508] EXT3-fs error (device vda): htree_dirblock_to_tree: bad entry in directory #837494: rec_len is smaller than minimal - offset=0, inode=4210740, rec_len=0, name_len=0
[20768.348149] Aborting journal on device vda.
[20768.352064] EXT3-fs (vda): error: remounting filesystem read-only
[20768.396397] __journal_remove_journal_head: freeing b_committed_data
[20768.396405] __journal_remove_journal_head: freeing b_committed_data
[20768.396407] __journal_remove_journal_head: freeing b_committed_data
[20769.700102] ------------[ cut here ]------------
[20769.700125] WARNING: at /build/buildd/linux-lts-backport-oneiric-3.0.0/fs/ext3/inode.c:1571 ext3_ordered_writepage+0x223/0x250()
[20769.700127] Hardware name: Bochs
[20769.700128] Modules linked in: nfs lockd fscache auth_rpcgss nfs_acl sunrpc psmouse serio_raw virtio_balloon i2c_piix4 raid10 raid456 async_pq async_xor xor async_memcpy async_raid6_recov floppy raid6_pq async_tx raid1 raid0 multipath linear
[20769.700146] Pid: 2496, comm: fl...

I cannot find how it might be related, but we are seeing massive filesystem corruptions in virtual guests on kvm in Lucid.
Host was running several kernels, from stock Lucid up to 3.0.0-14-server. Guests were booted with several different kernels as well. We also changed storag backen form qcow, to raw and eventually to lvm base with no avail. 
Usuall message just after going to RO:
[2012-03-01 04:39:06]  EXT4-fs (vda): error count: 10
[2012-03-01 04:39:06]  EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-01 04:39:06]  EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-02 04:40:50]  EXT4-fs (vda): error count: 10
[2012-03-02 04:40:50]  EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-02 04:40:50]  EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-03 04:42:38]  EXT4-fs (vda): error count: 10
[2012-03-03 04:42:38]  EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-03 04:42:38]  EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-04 04:44:25]  EXT4-fs (vda): error count: 10
[2012-03-04 04:44:25]  EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-04 04:44:25]  EXT4-fs (vda): last error at 1329327878: ext4_remount:3754: inode 170313: block 543763
[2012-03-04 20:34:20]  EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #171186: block 546842: comm chown: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0
[2012-03-04 20:34:20]  Aborting journal on device vda-8.
[2012-03-04 20:34:20]  EXT4-fs (vda): Remounting filesystem read-only
[2012-03-05 04:46:13]  EXT4-fs (vda): error count: 11
[2012-03-05 04:46:13]  EXT4-fs (vda): initial error at 1323754623: htree_dirblock_to_tree:586: inode 371080: block 1229922
[2012-03-05 04:46:13]  EXT4-fs (vda): last error at 1330893259: htree_dirblock_to_tree:587: inode 171186: block 546842

Or

[20768.343508] EXT3-fs error (device vda): htree_dirblock_to_tree: bad entry in directory #837494: rec_len is smaller than minimal - offset=0, inode=4210740, rec_len=0, name_len=0
[20768.348149] Aborting journal on device vda.
[20768.352064] EXT3-fs (vda): error: remounting filesystem read-only
[20768.396397] __journal_remove_journal_head: freeing b_committed_data
[20768.396405] __journal_remove_journal_head: freeing b_committed_data
[20768.396407] __journal_remove_journal_head: freeing b_committed_data
[20769.700102] ------------[ cut here ]------------
[20769.700125] WARNING: at /build/buildd/linux-lts-backport-oneiric-3.0.0/fs/ext3/inode.c:1571 ext3_ordered_writepage+0x223/0x250()
[20769.700127] Hardware name: Bochs
[20769.700128] Modules linked in: nfs lockd fscache auth_rpcgss nfs_acl sunrpc psmouse serio_raw virtio_balloon i2c_piix4 raid10 raid456 async_pq async_xor xor async_memcpy async_raid6_recov floppy raid6_pq async_tx raid1 raid0 multipath linear
[20769.700146] Pid: 2496, comm: flush-253:0 Not tainted 3.0.0-14-server #23~lucid1-Ubuntu
[20769.700147] Call Trace:
[20769.700155]  [<ffffffff81061bcf>] warn_slowpath_common+0x7f/0xc0
[20769.700158]  [<ffffffff81061c2a>] warn_slowpath_null+0x1a/0x20
[20769.700160]  [<ffffffff811eaf33>] ext3_ordered_writepage+0x223/0x250
[20769.700166]  [<ffffffff81118cf7>] __writepage+0x17/0x40
[20769.700169]  [<ffffffff8111a001>] write_cache_pages+0x241/0x4d0
[20769.700171]  [<ffffffff81118ce0>] ? set_page_dirty+0x70/0x70
[20769.700173]  [<ffffffff8111a2e1>] generic_writepages+0x51/0x80
[20769.700176]  [<ffffffff8111a345>] do_writepages+0x35/0x40
[20769.700180]  [<ffffffff8119659e>] writeback_single_inode+0x10e/0x280
[20769.700183]  [<ffffffff81196af3>] writeback_sb_inodes+0xe3/0x1b0
[20769.700185]  [<ffffffff81196d64>] writeback_inodes_wb+0xa4/0x170
[20769.700187]  [<ffffffff81197943>] wb_writeback+0x2f3/0x430
[20769.700191]  [<ffffffff815fbf8f>] ? _raw_spin_lock_irqsave+0x2f/0x40
[20769.700194]  [<ffffffff81197c9f>] wb_do_writeback+0x21f/0x270
[20769.700196]  [<ffffffff81197d9a>] bdi_writeback_thread+0xaa/0x270
[20769.700199]  [<ffffffff81197cf0>] ? wb_do_writeback+0x270/0x270
[20769.700203]  [<ffffffff810843c6>] kthread+0x96/0xa0
[20769.700206]  [<ffffffff816053a4>] kernel_thread_helper+0x4/0x10
[20769.700208]  [<ffffffff81084330>] ? kthread_worker_fn+0x190/0x190
[20769.700211]  [<ffffffff816053a0>] ? gs_change+0x13/0x13
[20769.700212] ---[ end trace 787438b409e1b580 ]---

[2012-02-23 10:01:54] plackup[1101]: segfault at 404034 ip 0000000000404034 sp 00007fff61aad508 error 14 in perl[601000+1000]
[2012-02-23 10:03:57] plackup[1092]: segfault at 404034 ip 0000000000404034 sp 00007fff61aad458 error 14 in perl[601000+1000]
[2012-02-23 10:09:53] EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #135110: block 533749: comm grep: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0
[2012-02-23 10:09:53] Aborting journal on device vda-8.
[2012-02-23 10:09:53] EXT4-fs (vda): Remounting filesystem read-only
[2012-02-23 10:11:41] EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #135110: block 533749: comm grep: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0
[2012-02-23 10:12:15] EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #135110: block 533749: comm grep: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0
[2012-02-23 10:13:15] EXT4-fs error (device vda): htree_dirblock_to_tree:587: inode #135110: block 533749: comm grep: bad entry in directory: rec_len is smaller than minimal - offset=0(0), inode=4210740, rec_len=0, name_len=0

Interesting is inode 4210740 referred by all crashes (although I don't know why)

Affects		Status	Importance	Assigned to	Milestone
	pm-utils (Ubuntu)	Confirmed	Low	Unassigned
Nominated for Hardy by Alain Baeckeroot

Ubuntu
pm-utils package

suspend and hibernate may cause data corruption because it doesn't syncs nor umounts external drives previously

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntupm-utils package

suspend and hibernate may cause data corruption because it doesn't syncs nor umounts external drives previously

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
pm-utils package