Bug #509180 “ecryptfs sometimes seems to add trailing garbage to...” : Bugs : eCryptfs

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2010-01-18:

#1

Tyler-

You've chased down a few of this git-and-bzr-on-ecryptfs issues before... Any chance you can take a look at this?

Changed in ecryptfs:
importance:	Undecided → High
assignee:	nobody → Dustin Kirkland (kirkland)
assignee:	Dustin Kirkland (kirkland) → Tyler Hicks (tyhicks)

Revision history for this message

deja_vu (deja-vu) wrote on 2010-01-18:

#2

I've also run into this a few times (and again just now). The files in question were padded up to 12K, also with the original file data (plus zeros).

I'm using Debian testing with a 2.6.32.3 kernel, also on an ext4 partition.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2010-01-18: Re: [Bug 509180] Re: ecryptfs sometimes seems to add trailing garbage to encrypted files

#3

Oh, wait ... are we talking about encrypted files?

All encrypted files are padded by ecryptfs. That's by design.

Are you seeing any bad data in your cleartext?

Revision history for this message

deja_vu (deja-vu) wrote on 2010-01-19:

#4

The cleartext is affected. Which in my case Unison catches when I try to synchronise two computers.

I haven't tried remounting the partition, but copying the lower file somewhere else allows me to recover the original file.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2010-01-19:

#5

Okay, then I am interested in Tyler's take.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2010-01-22:

#6

Erik and deja_vu - next time you see this, please do the following and report your findings:

1.) Run `stat` on the decrypted file and paste the results.
2.) Run `stat` on the encrypted file and paste the results.
3.) Run `hexdump -Cn 8` against the encrypted file and paste the results.
4.) Remount the eCryptfs mount.
5.) Repeat steps 1 through 3.

Unfortunately, if you're using encrypted filenames, it is going to be difficult to figure out the encrypted filenames in order to run stat and hexdump against those files.

Also, if you don't mind running a bleeding edge kernel, I rewrote the previously buggy truncate path and that patch (http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=5f3ef64f4da1c587cdcfaaac72311225b7df094c) was released in 2.6.33-rc5. If I had to put my money on it, I'd guess that this is happening after truncating the eCryptfs inode.

I ran the git test suite in an eCryptfs mount several times and it only complained about the same tests that failed on plain ext4.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2010-01-27:

#7

If you're trying to map encrypted -> decrypted filenames, I use this
nasty little hack...

Chmod the file to a really odd permission, like "chmod 123 foo".

Then use find to locate your oddly permissioned file:
find . -perm 123

Nasty, yes, but it works quite well.

:-Dustin

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-02-05:

#8

I got the error again today, here's the output of the commands you suggested:

Before remount:

$ stat decrypted
  File: `/home/sandberg/4.2/.git/objects/49/776103dc27a460c5210dc388c98f11658a272c'
  Size: 12288 Blocks: 24 IO Block: 4096 regular file
Device: 19h/25d Inode: 474495 Links: 1
Access: (0444/-r--r--r--) Uid: ( 7654/sandberg) Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.000000000 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ stat encrypted
  File: `/home/.ecryptfs/sandberg/.Private/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1ykQVXjRxZSzscBFVoMzQsk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw10yDq6kkbjn9lr7qrnxlAvk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1la465N-wDANSG0ml4M8DpU--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw107I-eQq24En6jzyC.APn1E--/ECRYPTFS_FNEK_ENCRYPTED.FYaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1zNb4HZf1uQT2mFtWKsF287cTiKR8wMOGzmCY2IvqAtFOyyN3siOt83eXT.ooIMCf'
  Size: 12288 Blocks: 24 IO Block: 4096 regular file
Device: 806h/2054d Inode: 474495 Links: 1
Access: (0444/-r--r--r--) Uid: ( 7654/sandberg) Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ hexdump -Cn 8 encrypted
00000000 00 00 00 00 00 00 00 cd |........|
00000008

After remount:
$ stat decrypted
  File: `/home/sandberg/4.2/.git/objects/49/776103dc27a460c5210dc388c98f11658a272c'
  Size: 205 Blocks: 24 IO Block: 4096 regular file
Device: 16h/22d Inode: 474495 Links: 1
Access: (0444/-r--r--r--) Uid: ( 7654/sandberg) Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ stat encrypted
  File: `/home/.ecryptfs/sandberg/.Private/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1ykQVXjRxZSzscBFVoMzQsk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw10yDq6kkbjn9lr7qrnxlAvk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1la465N-wDANSG0ml4M8DpU--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw107I-eQq24En6jzyC.APn1E--/ECRYPTFS_FNEK_ENCRYPTED.FYaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1zNb4HZf1uQT2mFtWKsF287cTiKR8wMOGzmCY2IvqAtFOyyN3siOt83eXT.ooIMCf'
  Size: 12288 Blocks: 24 IO Block: 4096 regular file
Device: 806h/2054d Inode: 474495 Links: 1
Access: (0444/-r--r--r--) Uid: ( 7654/sandberg) Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ hexdump -Cn 8 encrypted
00000000 00 00 00 00 00 00 00 cd |........|
00000008

I also saved copies of the decrypted and encrypted files from before and after the remount, if you are interested (it's a rather uninteresting git tree object).

I got the error again today, here's the output of the commands you suggested:

Before remount:

$ stat decrypted
  File: `/home/sandberg/4.2/.git/objects/49/776103dc27a460c5210dc388c98f11658a272c'
  Size: 12288     	Blocks: 24         IO Block: 4096   regular file
Device: 19h/25d	Inode: 474495      Links: 1
Access: (0444/-r--r--r--)  Uid: ( 7654/sandberg)   Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.000000000 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ stat encrypted
  File: `/home/.ecryptfs/sandberg/.Private/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1ykQVXjRxZSzscBFVoMzQsk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw10yDq6kkbjn9lr7qrnxlAvk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1la465N-wDANSG0ml4M8DpU--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw107I-eQq24En6jzyC.APn1E--/ECRYPTFS_FNEK_ENCRYPTED.FYaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1zNb4HZf1uQT2mFtWKsF287cTiKR8wMOGzmCY2IvqAtFOyyN3siOt83eXT.ooIMCf'
  Size: 12288     	Blocks: 24         IO Block: 4096   regular file
Device: 806h/2054d	Inode: 474495      Links: 1
Access: (0444/-r--r--r--)  Uid: ( 7654/sandberg)   Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ hexdump -Cn 8 encrypted
00000000  00 00 00 00 00 00 00 cd                           |........|
00000008

After remount:
$ stat decrypted
  File: `/home/sandberg/4.2/.git/objects/49/776103dc27a460c5210dc388c98f11658a272c'
  Size: 205       	Blocks: 24         IO Block: 4096   regular file
Device: 16h/22d	Inode: 474495      Links: 1
Access: (0444/-r--r--r--)  Uid: ( 7654/sandberg)   Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ stat encrypted
  File: `/home/.ecryptfs/sandberg/.Private/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1ykQVXjRxZSzscBFVoMzQsk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw10yDq6kkbjn9lr7qrnxlAvk--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1la465N-wDANSG0ml4M8DpU--/ECRYPTFS_FNEK_ENCRYPTED.FWaZJpbmp..tZUREe43P.qrEcyavEC2QZFw107I-eQq24En6jzyC.APn1E--/ECRYPTFS_FNEK_ENCRYPTED.FYaZJpbmp..tZUREe43P.qrEcyavEC2QZFw1zNb4HZf1uQT2mFtWKsF287cTiKR8wMOGzmCY2IvqAtFOyyN3siOt83eXT.ooIMCf'
  Size: 12288     	Blocks: 24         IO Block: 4096   regular file
Device: 806h/2054d	Inode: 474495      Links: 1
Access: (0444/-r--r--r--)  Uid: ( 7654/sandberg)   Gid: ( 1000/sandberg)
Access: 2010-02-05 11:45:12.873957850 +0100
Modify: 2010-02-02 18:12:07.844575172 +0100
Change: 2010-02-05 11:44:59.413928291 +0100
$ hexdump -Cn 8 encrypted
00000000  00 00 00 00 00 00 00 cd                           |........|
00000008

I also saved copies of the decrypted and encrypted files from before and after the remount, if you are interested (it's a rather uninteresting git tree object).

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2010-02-06:

#9

Hey Erik - thanks for the *great* report!

So the eCryptfs inode's i_size is out of sync from what is stored in the eCryptfs metadata (the first 8 bytes of the encrypted file). Some initial guesses are that we're missing an i_size_write() somewhere (maybe down an error path?) or we could be incorrectly passing the lower inode's i_size to i_size_write() since the upper and lower i_sizes are the same.

Are you seeing any eCryptfs error messages in the logs? I'll begin looking at the suspect code paths.

Changed in ecryptfs:
status:	New → Confirmed

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-02-07:

#10

ecryptfs-syslog.bz2 Edit (2.3 KiB, application/octet-stream)

OK, I found a few ecryptfs errors in the logs. I'm attaching the output of 'grep ryptfs syslog'.

During the logged period, the computer was never rebooted, but it was suspended a couple of times.

It seems that the errors started at feb 3, and that a pair of messages (valid ecryptfs headers not found etc) appears about every 5 mins except when the computer was suspended. It might be of interest that the computer was turned on between
Feb 4 09:50 and Feb 4 16:12:20, with no messages displayed.

There is a bunch of different messages from ecryptfs_read_lower/ecryptfs_readpage/ecryptfs_decrypt_page on Feb 4. If I recall correctly, I didn't notice the broken git files until Feb 5, though.

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-02-17:

#11

I got the error again today, this time I know the error appeared within a couple of hours; during this time I could only see the following message in the logs (it appears in both syslog and kern.log, and in no other log):

Feb 17 13:36:37 ockeghem kernel: [549471.301068] ecryptfs_read_lower: octets_read = [-4]; expected [4096]
Feb 17 13:36:37 ockeghem kernel: [549471.301098] ecryptfs_read_and_validate_header_region: Error reading header region; rc = [-22]

Another possible clue: The only command I did around 13.36 was a 'git status', and when repeating the same 'git status' under strace, the only relevant system call related to the affected file is an lstat:
lstat("src/devices/PI7C9X1XX/pi7c9x110.dml", {st_mode=S_IFREG|0644, st_size=12288, ...}) = 0

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2010-02-17:

#12

Hi Erik - Once again, thanks for the excellent work on this bug. I've found the problem in the code, found a consistent way to reproduce the problem and will write a fix soon.

Changed in ecryptfs:
status:	Confirmed → Triaged

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-04-09:

#13

Do you know of any way to remount an ecryptfs mount without first having to log out? That would suffice as a workaround most of the time, and would make this bug a lot less annoying for me.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2010-04-09:

#14

Well if you're just trying to reproduce in a testcase, you can
probably just do a
custom ecryptfs mount as root:

mkdir testcrypt, testplain
cat > testme.sh << EOF
mount -t ecryptfs <options> testcrypt testplain
echo ab >> testplain/ab
umount testplain
EOF
chmod ugo+x testme.sh

for i in `seq 1 100`; do
./testme.sh
done

On Fri, Apr 9, 2010 at 2:38 AM, Erik Carstensen <email address hidden> wrote:
> Do you know of any way to remount an ecryptfs mount without first having
> to log out? That would suffice as a workaround most of the time, and
> would make this bug a lot less annoying for me.
>
> --
> ecryptfs sometimes seems to add trailing garbage to encrypted files
> https://bugs.launchpad.net/bugs/509180
> You received this bug notification because you are a member of eCryptfs
> Developers, which is subscribed to eCryptfs.
>
> Status in eCryptfs - Enterprise Cryptographic Filesystem: Triaged
> Status in “linux” package in Ubuntu: New
>
> Bug description:
> Quite frequently (about once per month), a file in my ecryptfs-encrypted home directory gets a few KiBs of extra trailing garbage bytes (it's usually padded up to about 12 KiB). I have only noticed the error in git repositories so far, probably because git creates a huge number of files, and because it doesn't tend to ignore trailing garbage anywhere.
>
> The trailing garbage usually consists mostly of zero bytes; sometimes I have also seen it contain a copy of parts of the original file.
>
> If I re-mount the ecryptfs volume (by logging out and logging in again), the trailing garbage always disappears; this is why I think it's caused by an ecryptfs bug. I cannot rule out a faulty RAM, either (I have only reproduced it on my laptop, which doesn't have ECC RAM).
>
> I'm using x86-64 Ubuntu 9.10, my ecryptfs volume resides on an ext4 partition.
>
> I understand that it's impossible for you to reproduce the problem given this report, but I'm willing to put some effort in tracking down the cause of this. Do you have any ideas on how I can extract useful debugging information the next time the problem occurs?
>
>
>

Well if you're just trying to reproduce in a testcase, you can
probably just do a
custom ecryptfs mount as root:

mkdir testcrypt, testplain
cat > testme.sh << EOF
mount -t ecryptfs <options> testcrypt testplain
echo ab >> testplain/ab
umount testplain
EOF
chmod ugo+x testme.sh

for i in `seq 1 100`; do
  ./testme.sh
done

On Fri, Apr 9, 2010 at 2:38 AM, Erik Carstensen <mandolaerik@gmail.com> wrote:
> Do you know of any way to remount an ecryptfs mount without first having
> to log out? That would suffice as a workaround most of the time, and
> would make this bug a lot less annoying for me.
>
> --
> ecryptfs sometimes seems to add trailing garbage to encrypted files
> https://bugs.launchpad.net/bugs/509180
> You received this bug notification because you are a member of eCryptfs
> Developers, which is subscribed to eCryptfs.
>
> Status in eCryptfs - Enterprise Cryptographic Filesystem: Triaged
> Status in “linux” package in Ubuntu: New
>
> Bug description:
> Quite frequently (about once per month), a file in my ecryptfs-encrypted home directory gets a few KiBs of extra trailing garbage bytes (it's usually padded up to about 12 KiB). I have only noticed the error in git repositories so far, probably because git creates a huge number of files, and because it doesn't tend to ignore trailing garbage anywhere.
>
> The trailing garbage usually consists mostly of zero bytes; sometimes I have also seen it contain a copy of parts of the original file.
>
> If I re-mount the ecryptfs volume (by logging out and logging in again), the trailing garbage always disappears; this is why I think it's caused by an ecryptfs bug. I cannot rule out a faulty RAM, either (I have only reproduced it on my laptop, which doesn't have ECC RAM).
>
> I'm using x86-64 Ubuntu 9.10, my ecryptfs volume resides on an ext4 partition.
>
> I understand that it's impossible for you to reproduce the problem given this report, but I'm willing to put some effort in tracking down the cause of this. Do you have any ideas on how I can extract useful debugging information the next time the problem occurs?
>
>
>

Jeremy Foshee (jeremyfoshee) on 2010-04-25

tags:

added: kj-triage

Revision history for this message

Jeremy Foshee (jeremyfoshee) wrote on 2010-05-17:

#15

Hi Erik,

This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux 509180

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags:	added: needs-kernel-logs
tags:	added: needs-upstream-testing
Changed in linux (Ubuntu):
status:	New → Incomplete

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: apport-collect data

#16

Architecture: amd64
ArecordDevices:
**** List of CAPTURE Hardware Devices ****
card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog]
   Subdevices: 2/2
   Subdevice #0: subdevice #0
   Subdevice #1: subdevice #1
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/controlC0: sandberg 1987 F.... pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
Card hw:0 'Intel'/'HDA Intel at 0xfe9fc000 irq 21'
   Mixer name : 'SigmaTel STAC9205'
   Components : 'HDA:838476a0,102801fe,00100204 HDA:14f12c06,14f1000f,00100000'
   Controls : 25
   Simple ctrls : 16
DistroRelease: Ubuntu 9.10
HibernationDevice: RESUME=UUID=5a83ce18-fd90-4c2c-ba30-96445496a9f9
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
MachineType: Dell Inc. Latitude D830
Package: linux (not installed)
PccardctlIdent:
Socket 0:
   no product info available
PccardctlStatus:
Socket 0:
   no card
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.31-20-generic root=UUID=5f04b461-c266-4082-80b5-36054e46688e ro quiet splash
ProcEnviron:
SHELL=/bin/bash
PATH=(custom, user)
LANG=en_US.UTF-8
ProcVersionSignature: Ubuntu 2.6.31-20.58-generic
RelatedPackageVersions:
linux-backports-modules-2.6.31-20-generic N/A
linux-firmware 1.26
Uname: Linux 2.6.31-20-generic x86_64
UserGroups: adm admin cdrom dialout fuse lpadmin mail sambashare www-data
dmi.bios.date: 06/07/2007
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A02
dmi.board.name: 0HN341
dmi.board.vendor: Dell Inc.
dmi.chassis.type: 8
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvrA02:bd06/07/2007:svnDellInc.:pnLatitudeD830:pvr:rvnDellInc.:rn0HN341:rvr:cvnDellInc.:ct8:cvr:
dmi.product.name: Latitude D830
dmi.sys.vendor: Dell Inc.

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: AlsaDevices.txt

#17

AlsaDevices.txt Edit (517 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: AplayDevices.txt

#18

AplayDevices.txt Edit (281 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: BootDmesg.txt

#19

BootDmesg.txt Edit (59.5 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: Card0.Amixer.values.txt

#20

Card0.Amixer.values.txt Edit (2.7 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: Card0.Codecs.codec.0.txt

#21

Card0.Codecs.codec.0.txt Edit (7.1 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: Card0.Codecs.codec.1.txt

#22

Card0.Codecs.codec.1.txt Edit (146 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: CurrentDmesg.txt

#23

CurrentDmesg.txt Edit (246.7 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: IwConfig.txt

#24

IwConfig.txt Edit (618 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: Lspci.txt

#25

Lspci.txt Edit (14.3 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: Lsusb.txt

#26

Lsusb.txt Edit (792 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: PciMultimedia.txt

#27

PciMultimedia.txt Edit (586 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: ProcCpuinfo.txt

#28

ProcCpuinfo.txt Edit (1.4 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: ProcInterrupts.txt

#29

ProcInterrupts.txt Edit (1.6 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: ProcModules.txt

#30

ProcModules.txt Edit (4.5 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: RfKill.txt

#31

RfKill.txt Edit (113 bytes, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: UdevDb.txt

#32

UdevDb.txt Edit (119.7 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: UdevLog.txt

#33

UdevLog.txt Edit (231.1 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: WifiSyslog.txt

#34

WifiSyslog.txt Edit (4.0 KiB, text/plain)

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17: XsessionErrors.txt

#35

XsessionErrors.txt Edit (1.6 MiB, text/plain)

Changed in linux (Ubuntu):
status:	Incomplete → New
tags:	added: apport-collected

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-05-17:

#36

The above data was posted with my current kernel, which is the standard one from karmic. With this setup the problem still happens. I will upgrade to lucid shortly, we'll see if the problem still exists there.

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-06-21:

#37

I got the same problem again today, with a new lucid kernel.

uname -a
Linux ockeghem 2.6.32-22-generic #36-Ubuntu SMP Thu Jun 3 19:31:57 UTC 2010 x86_64 GNU/Linux

The following error message is probably related:
Jun 21 14:22:07 ockeghem kernel: [191186.234951] ecryptfs_read_and_validate_header_region: Error reading header region; rc = [-4]

Tyler, you mentioned in February that you were working on a patch; what's its status?

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-06-22:

#38

Download full text (5.9 KiB)

I got this traceback in syslog today, together with a different set of symptoms (no signs of padded files, but a process is helplessly hung; when I try to inspect the process with gdb or strace they hang too and must be SIGKILL:ed. And I cannot suspend). ecryptfs appears in the traceback so I thought it might be another symptom of the same problem. Using the standard lucid kernel.

Jun 22 09:12:05 ockeghem kernel: [220132.893407] general protection fault: 0000
[#1] SMP
Jun 22 09:12:05 ockeghem kernel: [220132.893412] last sysfs file: /sys/devices/L
NXSYSTM:00/LNXSYBUS:00/PNP0C0A:00/power_supply/BAT0/charge_full
Jun 22 09:12:05 ockeghem kernel: [220132.893414] CPU 0
Jun 22 09:12:05 ockeghem kernel: [220132.893416] Modules linked in: ppp_deflate
zlib_deflate bsd_comp ppp_async crc_ccitt option usbserial nls_utf8 isofs usb_st
orage usbhid hid cryptd aes_x86_64 aes_generic binfmt_misc ppdev dm_crypt snd_hd
a_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_
pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event joydev
snd_seq snd_timer snd_seq_device arc4 psmouse snd pcmcia soundcore snd_page_all
oc serio_raw dell_wmi iwl3945 dell_laptop dcdbas iwlcore mac80211 led_class cfg8
0211 lp parport yenta_socket rsrc_nonstatic pcmcia_core fbcon tileblit font bitb
lit softcursor vga16fb vgastate i915 drm_kms_helper ohci1394 tg3 ieee1394 ahci i
ntel_agp drm i2c_algo_bit video output
Jun 22 09:12:05 ockeghem kernel: [220132.893466] Pid: 28411, comm: simics-common
Not tainted 2.6.32-22-generic #36-Ubuntu Latitude D830
Jun 22 09:12:05 ockeghem kernel: [220132.893469] RIP: 0010:[<ffffffff810f3859>]
[<ffffffff810f3859>] find_get_page+0x39/0xa0
Jun 22 09:12:05 ockeghem kernel: [220132.893477] RSP: 0018:ffff880001c81918 EFL
AGS: 00010203
Jun 22 09:12:05 ockeghem kernel: [220132.893479] RAX: 07371f01fffe00ff RBX: ffff
88009b8be600 RCX: ffff8800bf6813c8
Jun 22 09:12:05 ockeghem kernel: [220132.893481] RDX: 0000000000000000 RSI: 0000
000000000040 RDI: 07371f01fffe0100
Jun 22 09:12:05 ockeghem kernel: [220132.893483] RBP: ffff880001c81928 R08: 0000000000001000 R09: 0000000000000001
Jun 22 09:12:05 ockeghem kernel: [220132.893485] R10: ffff880001c81fd8 R11: 0000000000000000 R12: 0000000000000040
Jun 22 09:12:05 ockeghem kernel: [220132.893487] R13: 0000000000000040 R14: ffff880073b69d80 R15: 0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893490] FS: 00007f905ffaa700(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893492] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jun 22 09:12:05 ockeghem kernel: [220132.893494] CR2: 00007f905b31b008 CR3: 000000009228d000 CR4: 00000000000006f0
Jun 22 09:12:05 ockeghem kernel: [220132.893496] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893498] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Jun 22 09:12:05 ockeghem kernel: [220132.893501] Process simics-common (pid: 28411, threadinfo ffff880001c80000, task ffff88008fd2dbc0)
Jun 22 09:12:05 ockeghem kernel: [220132.893502] Stack:
Jun 22 09:12:05 ockeghem kernel: [2201...

I got this traceback in syslog today, together with a different set of symptoms (no signs of padded files, but a process is helplessly hung; when I try to inspect the process with gdb or strace they hang too and must be SIGKILL:ed. And I cannot suspend). ecryptfs appears in the traceback so I thought it might be another symptom of the same problem. Using the standard lucid kernel.

Jun 22 09:12:05 ockeghem kernel: [220132.893407] general protection fault: 0000 
[#1] SMP 
Jun 22 09:12:05 ockeghem kernel: [220132.893412] last sysfs file: /sys/devices/L
NXSYSTM:00/LNXSYBUS:00/PNP0C0A:00/power_supply/BAT0/charge_full
Jun 22 09:12:05 ockeghem kernel: [220132.893414] CPU 0 
Jun 22 09:12:05 ockeghem kernel: [220132.893416] Modules linked in: ppp_deflate 
zlib_deflate bsd_comp ppp_async crc_ccitt option usbserial nls_utf8 isofs usb_st
orage usbhid hid cryptd aes_x86_64 aes_generic binfmt_misc ppdev dm_crypt snd_hd
a_codec_idt snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_
pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event joydev
 snd_seq snd_timer snd_seq_device arc4 psmouse snd pcmcia soundcore snd_page_all
oc serio_raw dell_wmi iwl3945 dell_laptop dcdbas iwlcore mac80211 led_class cfg8
0211 lp parport yenta_socket rsrc_nonstatic pcmcia_core fbcon tileblit font bitb
lit softcursor vga16fb vgastate i915 drm_kms_helper ohci1394 tg3 ieee1394 ahci i
ntel_agp drm i2c_algo_bit video output
Jun 22 09:12:05 ockeghem kernel: [220132.893466] Pid: 28411, comm: simics-common
 Not tainted 2.6.32-22-generic #36-Ubuntu Latitude D830                   
Jun 22 09:12:05 ockeghem kernel: [220132.893469] RIP: 0010:[<ffffffff810f3859>] 
 [<ffffffff810f3859>] find_get_page+0x39/0xa0
Jun 22 09:12:05 ockeghem kernel: [220132.893477] RSP: 0018:ffff880001c81918  EFL
AGS: 00010203
Jun 22 09:12:05 ockeghem kernel: [220132.893479] RAX: 07371f01fffe00ff RBX: ffff
88009b8be600 RCX: ffff8800bf6813c8
Jun 22 09:12:05 ockeghem kernel: [220132.893481] RDX: 0000000000000000 RSI: 0000
000000000040 RDI: 07371f01fffe0100
Jun 22 09:12:05 ockeghem kernel: [220132.893483] RBP: ffff880001c81928 R08: 0000000000001000 R09: 0000000000000001
Jun 22 09:12:05 ockeghem kernel: [220132.893485] R10: ffff880001c81fd8 R11: 0000000000000000 R12: 0000000000000040
Jun 22 09:12:05 ockeghem kernel: [220132.893487] R13: 0000000000000040 R14: ffff880073b69d80 R15: 0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893490] FS:  00007f905ffaa700(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893492] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jun 22 09:12:05 ockeghem kernel: [220132.893494] CR2: 00007f905b31b008 CR3: 000000009228d000 CR4: 00000000000006f0
Jun 22 09:12:05 ockeghem kernel: [220132.893496] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 22 09:12:05 ockeghem kernel: [220132.893498] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Jun 22 09:12:05 ockeghem kernel: [220132.893501] Process simics-common (pid: 28411, threadinfo ffff880001c80000, task ffff88008fd2dbc0)
Jun 22 09:12:05 ockeghem kernel: [220132.893502] Stack:
Jun 22 09:12:05 ockeghem kernel: [220132.893504]  0000000000000000 ffff88009b8be5f8 ffff880001c819b8 ffffffff810f4a11
Jun 22 09:12:05 ockeghem kernel: [220132.893507] <0> 00000000ffffffff 0000000000001000 ffff880001c81ad8 000000008153e798
Jun 22 09:12:05 ockeghem kernel: [220132.893511] <0> 0000000000000041 ffff880073b69df0 000000000000003f ffff88009b8be4e0
Jun 22 09:12:05 ockeghem kernel: [220132.893515] Call Trace:
Jun 22 09:12:05 ockeghem kernel: [220132.893519]  [<ffffffff810f4a11>] T.804+0x151/0x410
Jun 22 09:12:05 ockeghem kernel: [220132.893523]  [<ffffffff810f4d86>] generic_file_aio_read+0xb6/0x1d0
Jun 22 09:12:05 ockeghem kernel: [220132.893527]  [<ffffffff8114284a>] do_sync_read+0xfa/0x140
Jun 22 09:12:05 ockeghem kernel: [220132.893531]  [<ffffffff81085320>] ? autoremove_wake_function+0x0/0x40
Jun 22 09:12:05 ockeghem kernel: [220132.893536]  [<ffffffff81250ba6>] ? security_file_permission+0x16/0x20
Jun 22 09:12:05 ockeghem kernel: [220132.893539]  [<ffffffff81143165>] vfs_read+0xb5/0x1a0
Jun 22 09:12:05 ockeghem kernel: [220132.893544]  [<ffffffff81229df2>] ecryptfs_read_lower+0x82/0xc0
Jun 22 09:12:05 ockeghem kernel: [220132.893547]  [<ffffffff8122bdcc>] ecryptfs_decrypt_page+0x10c/0x190
Jun 22 09:12:05 ockeghem kernel: [220132.893550]  [<ffffffff81229498>] ecryptfs_readpage+0xe8/0x150
Jun 22 09:12:05 ockeghem kernel: [220132.893554]  [<ffffffff810fd6a2>] __do_page_cache_readahead+0x172/0x210
Jun 22 09:12:05 ockeghem kernel: [220132.893557]  [<ffffffff810fd761>] ra_submit+0x21/0x30
Jun 22 09:12:05 ockeghem kernel: [220132.893560]  [<ffffffff810fdaf5>] ondemand_readahead+0x115/0x240
Jun 22 09:12:05 ockeghem kernel: [220132.893563]  [<ffffffff810fdd1e>] page_cache_sync_readahead+0x2e/0x40
Jun 22 09:12:05 ockeghem kernel: [220132.893565]  [<ffffffff810f559c>] filemap_fault+0x42c/0x460
Jun 22 09:12:05 ockeghem kernel: [220132.893569]  [<ffffffff81111a44>] __do_fault+0x54/0x500
Jun 22 09:12:05 ockeghem kernel: [220132.893573]  [<ffffffff8101179c>] ? __switch_to+0x1ac/0x320
Jun 22 09:12:05 ockeghem kernel: [220132.893576]  [<ffffffff81114f88>] handle_mm_fault+0x1a8/0x3c0
Jun 22 09:12:05 ockeghem kernel: [220132.893581]  [<ffffffff810397a9>] ? default_spin_lock_flags+0x9/0x10
Jun 22 09:12:05 ockeghem kernel: [220132.893586]  [<ffffffff8154386a>] do_page_fault+0x12a/0x3b0
Jun 22 09:12:05 ockeghem kernel: [220132.893589]  [<ffffffff815411c5>] page_fault+0x25/0x30
Jun 22 09:12:05 ockeghem kernel: [220132.893591] Code: 5f 08 49 89 f4 4c 89 e6 48 89 df e8 62 20 1c 00 48 85 c0 48 89 c1 74 4a 48 8b 38 40 f6 c7 01 75 e4 48 8d 47 ff 48 83 f8 fd 77 da <8b> 57 08 85 d2 74 d3 44 8d 42 01 48 63 c2 4c 8d 4f 08 4d 63 c0 
Jun 22 09:12:05 ockeghem kernel: [220132.893621] RIP  [<ffffffff810f3859>] find_get_page+0x39/0xa0
Jun 22 09:12:05 ockeghem kernel: [220132.893624]  RSP <ffff880001c81918>
Jun 22 09:12:05 ockeghem kernel: [220132.893627] ---[ end trace fff8f06f577692af ]---

Revision history for this message

Andres Jaan Tack (ajtack) wrote on 2010-07-13:

#39

I am running into this problem as well, running Ubuntu 10.04. I have git objects being padded up to 12kB.

Is there a formulaic workaround?

Is there some information I can provide about the incidence on my system?

Revision history for this message

Andres Jaan Tack (ajtack) wrote on 2010-07-13:

#40

No need for a workaround: I think I have it, for my particular situation.

http://superuser.com/questions/162589/problem-with-git-corrupted-files/163034#163034

The offer for more details still stands, as I imagine this will occur in the future.

Revision history for this message

Erik Carstensen (sandberg) wrote on 2010-07-14:

#41

Script to automatically repair the damage caused by this bug Edit (1.7 KiB, text/x-python)

One painful workaround is to not touch the file and remount the ecryptfs partition (which usually means that you have to log out and re-login).

Another workaround is to use a script that I just wrote, which automatically tries to drop the trailing garbage (kind of like the link you posted, but automated)

The link you posted didn't observe these things:
- In some cases the trailing garbage can contain non-zeros; if the original object size was s this is typically seen in the intervals [4096, 4096 + s) and [8192, 8192+s).
- For non-blobs, there is a small probability (1/256) that the original object ends with a 0 byte.

My script will therefore try all 12288 possibile lengths, but in a smart order so the above cases are tried early.

To use it, go to the working tree's root directory and run the script, with the sha1 sum as only argument.

Revision history for this message

Gioele Barabucci (gioele) wrote on 2010-10-28:

#42

The comments in <http://comments.gmane.org/gmane.comp.version-control.git/146847> say that this bug is probably related to
<https://bugs.launchpad.net/ubuntu/+source/linux/+bug/490005>, especially <https://bugs.launchpad.net/ubuntu/+source/linux/+bug/490005/comments/23>.

Revision history for this message

disabled (disabled-deactivatedaccount) wrote on 2010-11-30:

#43

The git corruption symptom still occurs in 10.10 with ext4 + ecryptfs home.

And Erik's solution worked perfectly for me:
https://bugs.launchpad.net/ecryptfs/+bug/509180/comments/41

So, thank you very much for the workaround !

Brad Figg (brad-figg) on 2010-12-03

tags:

added: acpi-method-return

Revision history for this message

Hanno Stock (hefe_bia) (hanno-stock) wrote on 2011-01-13:

#44

From my dmesg:

[111706.640947] ecryptfs_read_and_validate_header_region: Error reading header region; rc = [-4]
[111706.640952] Valid eCryptfs headers not found in file header region or xattr region
[111706.640953] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

Error occurred in a git repository. Don't know exactly what I did.

Revision history for this message

Gioele Barabucci (gioele) wrote on 2011-01-21:

#45

This problem is still present in Ubuntu Lucid 10.04.1 LTS.

A completely up-to-date installation with eCryptFS just trashed some of the files stored under $HOME.

Revision history for this message

Thomas Perl (thp) wrote on 2011-02-05:

#46

I still have this issue on Ubuntu 10.10, versions of ecryptfs packages:

ii ecryptfs-utils 83-0ubuntu3 ecryptfs cryptographic filesystem (utilities
ii libecryptfs0 83-0ubuntu3 ecryptfs cryptographic filesystem (library)

Kernel version: (uname -r) 2.6.35-25-generic

The git-remove-trailing-garbage.py script from comment 41 works as a workaround. This was in a Git repository inside my ecryptfs-mounted $HOME. The "outer" filesystem (i.e. that of "/") is ext3.

Revision history for this message

Paolo Bonzini (bonzini) wrote on 2011-02-23:

#47

Patch http://launchpadlibrarian.net/64378182/ecryptfs-fix-eintr.patch from bug 521523 seems to work for me.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-02-24:

#48

Hi Paolo - that patch is just masking the real problem behind this bug. This one is caused by a bad error path after failing to read the crypto metadata. I'll attach a fix for that.

We may want to still use something like the patch you linked to, in combination with the following fix, though.

Changed in ecryptfs:
status:	Triaged → In Progress

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-02-24:

#49

[PATCH] eCryptfs: Fix error paths when failing to read metadata Edit (4.0 KiB, text/plain)

I may end up breaking this one up into two patches before upstreaming it. The -EIO piece is a bit unrelated to the rest of the patch.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-02-24:

#50

Just so I don't forget, a decent way of reproducing this is by doing the following:

(ext4 is mounted at /lower and eCryptfs is mounted at /upper and foo isn't created until the truncate below)

# truncate -s 1 /upper/foo
# hexedit /lower/foo
      Note: increment the 9th byte by 1 so that the eCryptfs marker fails validation
# umount /upper/foo
# mount -i /upper/foo
# hexdump -C /upper/foo
hexdump: /upper/foo: Invalid argument
hexdump: /upper/foo: Bad file descriptor
# hexedit /lower/foo
      Note: decrement the 9th byte by 1 so that the eCryptfs marker is correct again
# hexdump -C /upper/foo
      Note: You should see extra zeroes at the end of the file

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2011-03-09:

#51

Tyler,

When you upstream this patch, would you please note that here with the git commits? We're going to want to pull this into the Ubuntu kernel for Natty.

Thanks,
Dustin

Revision history for this message

Gioele Barabucci (gioele) wrote on 2011-03-09:

#52

Can you please also backport this fix to the 10.04.x LTS kernel?

With this problem eCryptfs for $HOME (advertised at installation time) is basically unusable in Lucid.

Revision history for this message

Lealcy B. Junior (lealcy) wrote on 2011-03-15:

#53

I'm got a bunch of this messages on my syslog:

Mar 15 16:54:40 lealcy kernel: [448054.760153] Valid eCryptfs headers not found in file header region or xattr region
Mar 15 16:54:40 lealcy kernel: [448054.760158] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
Mar 15 16:54:41 lealcy kernel: [448054.824814] Valid eCryptfs headers not found in file header region or xattr region
Mar 15 16:54:41 lealcy kernel: [448054.824819] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
Mar 15 16:54:41 lealcy kernel: [448054.842427] Valid eCryptfs headers not found in file header region or xattr region
Mar 15 16:54:41 lealcy kernel: [448054.842434] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
Mar 15 16:54:41 lealcy kernel: [448054.898242] Valid eCryptfs headers not found in file header region or xattr region
Mar 15 16:54:41 lealcy kernel: [448054.898247] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

My linux: Linux version 2.6.35-27-generic (buildd@crested) (gcc version 4.4.5 (Ubuntu/Linaro 4.4.4-14ubuntu5) ) #48-Ubuntu SMP Tue Feb 22 20:25:46 UTC 2011 x86_64 GNU/Linux

I don't have any 0 byte files on my ~/.Private folder.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-03-15:

#54

eCryptfs: Handle failed metadata read in lookup Edit (5.5 KiB, text/plain)

After doing some more testing, I realized my previous fix is incorrect. It doesn't allow lookups of lower files that don't have proper metadata (plaintext files, 0 length files, etc.).

I'm attaching another fix, which will likely go upstream for 2.6.39-rc1.

Revision history for this message

Roland Dreier (roland.dreier) wrote on 2011-03-16:

#55

Does this latest patch address all the cases of signals interrupting ecryptfs operations? I don't know enough about ecryptfs to know whether this metadata problem is the only place the issue hits.

I've started seeing problems where I get

ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
ecryptfs_write_begin: Error decrypting page at index [15315]; rc = [-4]

in the kernel log, and then later tasks hang in sync_page called from truncate_inode_pages from ecryptfs_evict_inode.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2011-03-16: Re: [Bug 509180] Re: ecryptfs sometimes seems to add trailing garbage to encrypted files

#56

Tyler,

Should we probably carry this fix in Ubuntu's 2.6.38 kernel for 11.04?

Dustin

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-03-16:

#57

On Wed Mar 16, 2011 at 12:19:04AM -0000, Roland Dreier <email address hidden> wrote:
> Does this latest patch address all the cases of signals interrupting
> ecryptfs operations? I don't know enough about ecryptfs to know whether
> this metadata problem is the only place the issue hits.

The latest patch allows userspace to handle any interrupted eCryptfs
operations. When we're trying to read from the lower filesystem,
vfs_read() sometimes returns with -EINTR, so we'll just propagate that
to userspace and let the app deal with it.

>
> I've started seeing problems where I get
>
> ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
> ecryptfs_write_begin: Error decrypting page at index [15315]; rc = [-4]
>
> in the kernel log, and then later tasks hang in sync_page called from
> truncate_inode_pages from ecryptfs_evict_inode.

That's another bug that I just stumbled across myself and will have
fixed in the 2.6.39-rc1 time frame. We're not unlocking the page in the
ecryptfs_write_begin() error path.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-03-16:

#58

On Wed Mar 16, 2011 at 02:48:37PM -0500, Dustin Kirkland <email address hidden> wrote:
> Should we probably carry this fix in Ubuntu's 2.6.38 kernel for 11.04?

Yes, once it goes upstream. I kept it simple for easy back porting.
While I don't think there's really any on-disk data corruption here,
this is an annoying problem for those who are affected.

BTW, I normally tag any important patches, which I feel are
backport-worthy, with 'Cc: <email address hidden>' and this one will get
that tag when it goes upstream.

Revision history for this message

Shahar Or (mightyiam) wrote on 2011-03-17:

#59

Would it be possible to backport this to all supported releases?

One reason to do so is that users are getting scared about their data being corrupted, even if it is not so.

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-03-17:

#60

If the user copies or archives the "corrupted looking" file then their copy has become corrupted.

Copying files to a memory stick should me a good backup not a corrupt backup.

The excuse "your data is OK.... until you try to read it" isn't much of a distinction....

Andy Whitcroft (apw) on 2011-03-17

tags:

added: kernel-key
removed: needs-kernel-logs needs-upstream-testing

Andy Whitcroft (apw) on 2011-03-17

Changed in linux (Ubuntu):
status:	New → Triaged
importance:	Undecided → Medium
assignee:	nobody → Andy Whitcroft (apw)

Revision history for this message

John Johansen (jjohansen) wrote on 2011-03-17:

#61

shahar Or,

We will pick this patch up for older releases when tyler submits it upstream to the stable kernel trees

Changed in linux (Ubuntu):
assignee:	Andy Whitcroft (apw) → John Johansen (jjohansen)

Revision history for this message

Andy Whitcroft (apw) wrote on 2011-03-17:

#62

Ok i have pulled Tylers patch up to the Natty kernel for testing. Could those of you who are hitting this regularly please check out these kernels and let us know if they resolve the issues for you. Please report any testing here. Kerenls are at the URL below:

http://people.canonical.com/~apw/lp509180-natty/

Thanks!

Changed in linux (Ubuntu):
status:	Triaged → Incomplete

Revision history for this message

Shahar Or (mightyiam) wrote on 2011-03-17:

#63

Thanks, John,

Good work!

Revision history for this message

reliable-robin-22 (nicolasdiogo) wrote on 2011-03-29:

#64

same error here

had my $HOME full at on point and there was no error warning about it.

after deleting some files to make room.
and rebooting. i can no longer login using KDE.

i have run:

find $HOME/.Private/ -size 0c -exec ls '{}' \; | wc -l

and got:

203

i also noticed that my $HOME/.ecryptfs is corrupted.
but i am able to access all my files still.

so what is the next option here? delete all these (as they are empty anyhow.
how should i proceed.

i suppose we should try to fix the BUG of no alert when running out of space here

Revision history for this message

Paolo Bonzini (bonzini) wrote on 2011-03-30:

#67

Can you please update the patch so that it avoids spamming the kernel log upon EINTR?

Revision history for this message

Tomi Hukkalainen (tpievila) wrote on 2011-04-13:

#68

While it doesn't seem to be the focus of this bug, the relevant bugs have been marked as duplicates of this one.

I'm running 2.6.38-8 in Natty beta, and still getting a lot of

Apr 12 14:46:32 puppy-ubuntu kernel: [31106.234038] Valid eCryptfs headers not found in file header region or xattr region
Apr 12 14:46:32 puppy-ubuntu kernel: [31106.234047] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

so it would seem that either the patch hasn't fixed this, or #372014 is not actually a duplicate.

Revision history for this message

Serge Hallyn (serge-hallyn) wrote on 2011-04-13: Re: [Bug 509180] Re: ecryptfs sometimes seems to add trailing garbage to encrypted files

#69

Quoting Tomi Pieviläinen (<email address hidden>):
> While it doesn't seem to be the focus of this bug, the relevant bugs
> have been marked as duplicates of this one.
>
> I'm running 2.6.38-8 in Natty beta, and still getting a lot of
>
> Apr 12 14:46:32 puppy-ubuntu kernel: [31106.234038] Valid eCryptfs headers not found in file header region or xattr region
> Apr 12 14:46:32 puppy-ubuntu kernel: [31106.234047] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
>
> so it would seem that either the patch hasn't fixed this, or #372014 is
> not actually a duplicate.

Is this with clean underlying fs? If non-ecryptfs or corrupted files are
already there, you'll keep getting those warning until you remove them.
The bug would only be the creation of new files.

It seems like it might be helpful to log the underlying inode number
in these printks.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-04-26:

#70

Linus has merged 3aeb86ea4cd15f728147a3bd5469a205ada8c767, which is the fix for this bug.

Changed in ecryptfs:
status:	In Progress → Fix Committed

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-04-29:

#71

This fix was released in 2.6.39-rc5 as commit 3aeb86ea4cd15f728147a3bd5469a205ada8c767

Changed in ecryptfs:
status:	Fix Committed → Fix Released

Brad Figg (brad-figg) on 2011-05-05

tags:

added: b73a1py79

Revision history for this message

Shahar Or (mightyiam) wrote on 2011-05-06:

#72

What about the damage that's already been done to files? This should be pretty common amongst Ubuntu users who've enabled their home directory encryption feature.

Mine is. I've upgraded to natty and my dmesg is still full of those errors.

Revision history for this message

Erik Carstensen (sandberg) wrote on 2011-05-06:

#73

> What about the damage that's already been done to files

The damage is temporary and disappears as soon as you unmount the ecryptfs volume, unless you for some reason read the file and write it back to disk.

I wrote a script that fixes up git objects broken by this bug: https://bugs.launchpad.net/ecryptfs/+bug/509180/comments/41
If you have any non-git files that are affected by this, you might want to do something similar based on the script.

> Mine is. I've upgraded to natty and my dmesg is still full of those errors.

I think the dmesg errors are just one symptom of the bug; trailing garbage is another. Can you still see trailing garbage in any of your files?

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-05-06: Re: [Bug 509180] Re: ecryptfs sometimes seems to add trailing garbagetoencrypted files

#74

I'm still fixing corrupt git repositories caused by this.

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-05-12:

#75

Repair script works great and is painless! Thanks very much for posting that.

Revision history for this message

Thorsten Zachmann (t-zachmann) wrote on 2011-05-25:

#76

I had two times the error again since I updated to natty. From the bug messages I'm not sure the fix is included in nattay. Has this been included in the kernel available on natty?

Revision history for this message

Michael Rodríguez-Torrent (mrtorrent) wrote on 2011-06-04:

#77

Another request for clarification on the status of the fix: has it been released for Maverick?

I'm continuing to get these error messages very frequently, followed immediately by the system becoming totally unresponsive, at which point I have to hard power off. This can't be good for the hardware nor the data and, I'm sorry, but the lack of clear information on such a critical bug is very, very frustrating.

I notice there is no mention of freezes/hangs on this bug, which is maybe why this has been considered as important, but that's consistently what I'm experiencing and there are several reports of that on this duplicate: https://bugs.launchpad.net/ecryptfs/+bug/372014

Revision history for this message

agent 8131 (agent-8131) wrote on 2011-06-17:

#78

I found this to be helpful for finding the corrupted files. They may not be 0 length:

find . -type f -exec cat {} \; > /dev/null

and look for "Input/output error"

taken from:

https://bugs.launchpad.net/ubuntu/+source/ecryptfs-utils/+bug/372014/comments/68

Revision history for this message

Shahar Or (mightyiam) wrote on 2011-06-18:

#79

Thanks for the tip, agent!

Revision history for this message

Ville Ranki (ville-ranki) wrote on 2011-06-20:

#80

I just got this issue on an up to date Natty x86_32. Ubuntu crashed totally while i was compiling and after reboot i got Input/output error's on files modified by the build (Makefiles, .o's etc). Modifying files (for example with make) doesn't work, only deleting helps. This bug may cause loss of data, so I'd treat it with high priority.

Revision history for this message

avdd (avdd) wrote on 2011-07-06:

#81

Will this fix be released for lucid? When?

Revision history for this message

avdd (avdd) wrote on 2011-07-06:

#82

I found this bug because I am observing the padding effects (although to other multiples of 4K). To me this is filesystem corruption plain and simple. The only conclusion is that ecryptfs in lucid (LTS) is buggy and should not be used. This is somewhat disappointing as it was stable on karmic.

Incidentally, to confirm that I am seeing the same bug, I tried the steps from comment #50. First, I don't know what this means:

# umount /upper/foo
# mount -i /upper/foo

Is it a typo, or am I missing something about mounting files?

Second, assuming a typo, I cannot reproduce the padding effect.

Revision history for this message

Brad Figg (brad-figg) wrote on 2011-07-14: Unsupported series, setting status to "Won't Fix".

#83

This bug was filed against a series that is no longer supported and so is being marked as Won't Fix. If this issue still exists in a supported series, please file a new bug.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status:	Incomplete → Won't Fix

Revision history for this message

avdd (avdd) wrote on 2011-07-15:

#84

What series? Where is the series stated? How do I report a bug against a series?

Revision history for this message

Tuomas Heino (iheino+ub) wrote on 2011-07-15:

#85

Where should I report a bug on mentioned "automated script" ignoring duplicates' series?
This one was originally reported against Karmic, but duplicates include all later releases and some earlier ones as well.
Several other bug reports besides this one may have been marked "Won't Fix" by said script.

Revision history for this message

avdd (avdd) wrote on 2011-07-15:

#86

As requested, new bug report filed here: https://bugs.launchpad.net/ecryptfs/+bug/810860

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-07-15:

#87

How did get launchpad get to be so rubbish?

The typical ubuntu scenario is like like this:

1. file a bug
2. wait 1-2 years
3. bug gets marked as invalid
4. if you are lucky it was fixed upstream and is fixed in a future release

the use of the system seems designed to disuade people from using launchpad.
Rather than the point of "report all bugs" it is the point of "being insulted for reporting any bug".

And this is a serious bug - ubuntu is corrupt users files if they use an offered feature.

Mint is where it is at; while ubuntu is "debian done nicely" - mint seems to be: "ubuntu with the bugs fixed"

I'm still on ubuntu but I'm not expecting to stay, I've had enough of being insulted by launchpad scripts.

Revision history for this message

elementz (memetical) wrote on 2011-07-15:

#88

+1 for Sam Liddicots comment!
I already tried the ecrypfs mailinglist quite some time ago, but did not hear anything back from the developers til this day.
I believe our best option is simply NOT USING ecryptfs in a production environment. Especially since developers in ubuntu and ecryptfs itself are so darn unresponsive in this matter.

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-07-15:

#89

What makes it worse is that Shuttleworth abandoned his bounty idea; so it is impossible for individual users financially support Canonical or Ubuntu, or get support or contribute to support on issues that matter.

Individual users (non-enterprise customers) are reduced to the level of beggars who can contribute if they don't mind being insulted by launchpad. And any value in the contributions will leak away.

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-07-15:

#90

I know I'm filing in the wrong forum. I did open a question on this: https://answers.launchpad.net/ubuntu/+question/157752
but someone re-filed it wrongly and left it.

I try to point out what is going wrong, but actively people don't want to hear.

Revision history for this message

avdd (avdd) wrote on 2011-07-15:

#91

reddit bomb?

Revision history for this message

Sam Liddicott (sam-liddicott) wrote on 2011-07-15:

#92

what does "reddit bomb" mean?

Revision history for this message

elementz (memetical) wrote on 2011-07-15:

#93

This will be too offtopic, but I since this bug is abondened anyways
I guess avdd refers to the fact, that it could be of help to raise awareness of this topic and the shortcomings of launchpad/ubuntu bugfixing over at reddit. If upvoted enough, this could lead to high exposure and gain traction simply by the sheer mass of users interested in the topic.
But, I highly doubt that useres over at reddit will really care. Maybe hackernews could be a better place for this.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-07-15: Re: [Bug 509180] Re: ecryptfs sometimes seems to add trailing garbage to encrypted files

#94

On Fri Jul 15, 2011 at 09:34:47AM -0000, memetical <email address hidden> wrote:
> I already tried the ecrypfs mailinglist quite some time ago, but did
> not hear anything back from the developers til this day. I believe
> our best option is simply NOT USING ecryptfs in a production
> environment. Especially since developers in ubuntu and ecryptfs itself
> are so darn unresponsive in this matter.

Sorry - I must have missed your message on the mailing list. This bug
has been fixed upstream (I'm the upstream maintainer) and will need to
be backported to Ubuntu kernels. I'll alert a member of the Ubuntu
kernel team about the severity of this bug.

Tim Gardner (timg-tpi) on 2011-07-15

Changed in linux (Ubuntu Oneiric):
status:	Won't Fix → Fix Released
Changed in linux (Ubuntu Lucid):
assignee:	nobody → John Johansen (jjohansen)
status:	New → In Progress
Changed in linux (Ubuntu Maverick):
assignee:	nobody → John Johansen (jjohansen)
status:	New → In Progress
Changed in linux (Ubuntu Natty):
assignee:	nobody → John Johansen (jjohansen)
status:	New → In Progress

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-15:

#95

Tyler - is there a backportable commit? I see 3 with the word 'corrupt' in them.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-07-15:

#96

Tim - upstream git commit is 3aeb86ea4cd15f728147a3bd5469a205ada8c767

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-15:

#97

Natty: eCryptfs: Handle failed metadata read in lookup Edit (5.9 KiB, text/plain)

Tyler - please review this backport to 2.6.38 (natty). The only thing I had to think about was the value to assign ECRYPTFS_I_SIZE_INITIALIZED. I chose to simply increment the mask by <<1.

Changed in linux (Ubuntu Natty):
assignee:	John Johansen (jjohansen) → Tim Gardner (timg-tpi)

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-15:

#98

Maverick: eCryptfs: Handle failed metadata read in lookup Edit (5.8 KiB, text/plain)

Changed in linux (Ubuntu Maverick):
assignee:	John Johansen (jjohansen) → Tim Gardner (timg-tpi)
Changed in linux (Ubuntu Oneiric):
assignee:	John Johansen (jjohansen) → nobody

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2011-07-15:

#99

From Bug #810860:

Tyler Hicks (tyhicks) wrote:
The upstream fix for this is http://git.kernel.org/linus/3aeb86ea4cd15f728147a3bd5469a205ada8c767

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-07-15:

#100

On Fri Jul 15, 2011 at 05:15:08PM -0000, Tim Gardner <email address hidden> wrote:
> Tyler - please review this backport to 2.6.38 (natty). The only thing
> I had to think about was the value to assign
> ECRYPTFS_I_SIZE_INITIALIZED. I chose to simply increment the mask by
> <<1.

Ack - looks good to me.

FYI, this trivial patch is why you had to use different flag values:
http://git.kernel.org/linus/fed8859b3ab94274c986cbdf7d27130e0545f02c

Those flags never hit the disk (only live in memory), so there is no
harm in the patch that you applied.

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-19:

#101

Lucid: eCryptfs: Handle failed metadata read in lookup Edit (5.8 KiB, text/plain)

Tim Gardner (timg-tpi) on 2011-07-20

Changed in linux (Ubuntu Natty):
status:	In Progress → Fix Committed
Changed in linux (Ubuntu Maverick):
status:	In Progress → Fix Committed
Changed in linux (Ubuntu Lucid):
status:	In Progress → Fix Committed

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-20:

#102

SRU Justification

Impact: files mounted under ecryptfs can be corrupted.

Patch Description: backport upstream 3aeb86ea4cd15f728147a3bd5469a205ada8c767, eCryptfs: Handle failed metadata read in lookup

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-07-21:

#103

Lucid V2: eCryptfs: Handle failed metadata read in lookup Edit (5.9 KiB, text/plain)

Tyler - I updated the patch for Lucid (2.6.32) to use num_header_bytes_at_front instead of metadata_size in ecryptfs_i_size_init(). Would you have another look to make sure this version isn't going to toast the on disk format ? Thanks.

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-07-28:

#104

Linux linux 2.6.38-11-generic #47-Ubuntu SMP Fri Jul 15 19:27:09 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

Description: Ubuntu 11.04
Codename: natty

After months of kernel and application crashes and corrupt files, I
was ready to abandon Ubuntu 11.04 and go to some other distribution.
Before doing that, I wrote a script to repeatedly md5sum the files in
my ecryptfs directory and compare the results. I went single-user to
ensure that nothing else was running and, sure enough, the script showed
that md5sums changed randomly on files that I never touched.

I went looking today and found this launchpad entry indicating that you
have known for 18 months (since 9.10!) that ecryptfs is broken and is
not suitable for production use.

Why didn't you disable it in recent releases? My 11.04 install offered
to encrypt my home directory during installation, yet you've known for
18 months that to do so would corrupt my system and crash it.

I am frustrated that nobody took prompt action to disable the use of
ecryptfs and to notify those of us using it.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2011-07-28:

#105

Ian-

I'd love to understand why some users are affected by this, and other
users aren't. Some people, such as myself, have hundreds of GB,
thousands of files, using them all day, every day, and do not suffer
from this problem.

--
:-Dustin

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-08-07:

#106

@Dustin
Things that might make a difference: 64-bit SMP kernel on AMD quad-core, 120GB SSD hard disk, 8GB memory.

Revision history for this message

Gioele Barabucci (gioele) wrote on 2011-08-07:

#107

@idallen:

This also happens on a somewhat old Dell Optiplex SD280: HT Pentium 4, 1 GiB of RAM, rotational HD.

Revision history for this message

Mario Vukelic (kreuzsakra) wrote on 2011-08-07:

#108

@Dustin: I have a 300 GB internal traditional HD and am running the amd64+mac version (Natty, fresh install) on an Intel Core 2 Duo P8700, 4 GB RAM

Revision history for this message

Félim Whiteley (felimwhiteley) wrote on 2011-08-09:

#109

I've just been bitten with this. Soemthing ate 500GB (yes GB!) one night when my machine was on.. I couldn't find out what had done it, the machine eventually restarted and the culprit was gone so it must have been some crazy cache issue. It has made a right mess of KDE on me, Akonadi looks to be broken because of it. I continually get messages like:

[32223.735781] Valid eCryptfs headers not found in file header region or xattr region
[32223.735788] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

Kernel is: 2.6.38-11-generic #48-Ubuntu SMP Fri Jul 29 19:02:55 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

Running find ../.ecryptfs/felim/.Private -xdev -size 0c -exec ls '{}' \; | wc -l

Gives me 351 files... I've tried following this thread but it's hard to know what can be done. IS a restore from backup the only real option?

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2011-08-09:

#110

According to the upstream maintainer (Tyler Hicks) the patch to fix this issue is https://bugs.launchpad.net/ecryptfs/+bug/509180/comments/96 which has been applied to Lucid/Maverick/Natty. You can get an experimental kernel containing this patch (and others) from https://launchpad.net/~kernel-ppa/+archive/pre-proposed

Revision history for this message

Félim Whiteley (felimwhiteley) wrote on 2011-08-09:

#111

Hmmm I did that and I am running:

2.6.38-11-generic #49~pre201108030903-Ubuntu SMP Wed Aug 3 09:34:11 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

But just got:

[ 155.829002] Valid eCryptfs headers not found in file header region or xattr region
[ 155.829008] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2011-08-12:

#112

This bug is awaiting verification that the kernel for maverick in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-maverick' to 'verification-done-maverick'.

If verification is not done by one week from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-maverick

Revision history for this message

reliable-robin-22 (nicolasdiogo) wrote on 2011-08-18:

#113

hello

just joining the choir

it is still present on Natty x64

should this not have a high level

do we require further info?

======================================================
# mount
/dev/sda3 on / type ext4 (rw,noatime,commit=0)
proc on /proc type proc (rw,noexec,nosuid,nodev)
none on /sys type sysfs (rw,noexec,nosuid,nodev)
fusectl on /sys/fs/fuse/connections type fusectl (rw)
none on /sys/kernel/debug type debugfs (rw)
none on /sys/kernel/security type securityfs (rw)
none on /dev type devtmpfs (rw,mode=0755)
none on /dev/pts type devpts (rw,noexec,nosuid,gid=5,mode=0620)
none on /dev/shm type tmpfs (rw,nosuid,nodev)
none on /var/run type tmpfs (rw,nosuid,mode=0755)
none on /var/lock type tmpfs (rw,noexec,nosuid,nodev)
/dev/sda6 on /home type ext4 (rw,commit=0)
binfmt_misc on /proc/sys/fs/binfmt_misc type binfmt_misc (rw,noexec,nosuid,nodev)
/home/MYUSER/.Private on /home/MYUSER type ecryptfs (ecryptfs_check_dev_ruid,ecryptfs_cipher=aes,ecryptfs_key_bytes=16,ecryptfs_unlink_sigs,ecryptfs_sig=<KEY>,ecryptfs_fnek_sig=<KEY2>)
gvfs-fuse-daemon on /home/MYUSER/.gvfs type fuse.gvfs-fuse-daemon (rw,nosuid,nodev,user=MYUSER)
/dev/sda1 on /media/System Reserved type fuseblk (rw,nosuid,nodev,allow_other,blksize=4096,default_permissions)
======================================================

dmesg
======================================================
[34279.235317] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
[34279.235390] Valid eCryptfs headers not found in file header region or xattr region
[======================================================

please let me know if you want more info

thanks,

Revision history for this message

reliable-robin-22 (nicolasdiogo) wrote on 2011-08-19:

#114

added the option for 'proposed' updates on synaptics

and the problem persists.

=======================================================
[ 481.927093] Valid eCryptfs headers not found in file header region or xattr region
[ 481.927099] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
[ 481.927312] Valid eCryptfs headers not found in file header region or xattr region
[ 481.927317] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO
=======================================================

my question is - how reliable is this method of storing my personal data encrypted?

besides keeping a backup of everything i have stored on my $HOME- is there an option to this?

thanks,

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-08-19:

#115

This bug is awaiting verification that the kernel for Lucid in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-lucid' to 'verification-done-lucid'.

If verification is not done by one week from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-lucid

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-08-23:

#116

For the people reporting that this is still present after updating the kernel from -proposed, can you absolutely confirm for us which version of the kernel you are running by pasting the output from 'uname -a' into this bug?

Thanks. It's not that we doubt you, we just want to double check.

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-08-29:

#117

This fix has not been verified as being fixed in the -proposed kernels for Lucid or Maverick, and the patch will be reverted from those series

Herton R. Krzesinski (herton) on 2011-08-31

tags:

added: verification-reverted-lucid
removed: verification-needed-lucid

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2011-08-31:

#118

This bug is awaiting verification that the kernel for Natty in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-natty' to 'verification-done-natty'.

If verification is not done by one week from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-natty

Revision history for this message

Tomi Hukkalainen (tpievila) wrote on 2011-09-01:

#119

$ uname -a
Linux puppy-ubuntu 2.6.39-020639rc5-generic #201105041556 SMP Wed May 4 15:59:47 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

I enabled proposed on natty, but still get the messages. Does the new kernel fix an already occured problem, or should I remove all the offending files like and see if the problem reappears?

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2011-09-01:

#120

@Tomi: since the original problem is corruption of files (extra trailing garbage bytes added), I would expect that you have to ignore already corrupted files and see if it happens again with non corrupted ones.

tags:

added: verification-reverted-maverick
removed: verification-needed-maverick

Revision history for this message

Tomi Hukkalainen (tpievila) wrote on 2011-09-01:

#121

I have deleted (again) the unreadable files, but I think a week is unreasonably short time to verify the fix, as this bug does not have a certain way to reproduce. Also I don't see how it makes sense to remove the fix AND close the bug without verification leaving it certainly not fixed.

Revision history for this message

Tomi Hukkalainen (tpievila) wrote on 2011-09-01:

#122

I tried "cat /dev/zero > zerofile" since heavy writing / disk exhaustion seemed to be related to the problem. And sure enough, after the empty space went to zero, same errors appeared again to dmesg. After deleting the zerofile I logged out and tried to login, but it failed. Then I tried to shutdown cleanly, but it failed. After a hard reboot the disk space had been freed, and the .Private contained again zero length files. The perm trick reveals that zeroed files include gnome applet files and other stuff that logically could have been written into without user action. All of them are now completely unreadable due to input/output errors.

I'd say that ecryptfs is still extremely dangerous to use due to data corruption problems in a common scenario, writing too much. It simply cannot be offered to users without any kind of warning that this happens.

Revision history for this message

Tomi Hukkalainen (tpievila) wrote on 2011-09-07:

#123

I'm now getting the same errors in dmesg, in fact I'm not doing anything and there's just more and more (seems like two errors every two seconds). *But* there's no zero sized files under .Private so I no longer can even remove them and cannot get rid of the erros even temporarily...

Revision history for this message

Verneri Åberg (verneri-aberg) wrote on 2011-09-09:

#124

Getting the same errors constantly in dmesg (several times in a second) with 64-bit natty an 0 0 length files in .Private. First started to happen bit after upgrade to natty, then disappeared for few days and now started to happen again yesterday. Don't seem to be able to find the offending files anyhow, so even if the proposed would help, couldn't fix the existing prob...

As this is hanging up nautilus constantly, it makes normal usage almost impossible. Somehow btw nautilus runs fine when running with gksudo.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-09-13:

#125

This bug was fixed in the package linux - 2.6.35-30.59

---------------
linux (2.6.35-30.59) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #837449

[ Upstream Kernel Changes ]

  * Revert "drm/nv50-nvc0: work around an evo channel hang that some people
    see"
  * Revert "eCryptfs: Handle failed metadata read in lookup"

linux (2.6.35-30.58) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #828376

[ Upstream Kernel Changes ]

  * proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020

linux (2.6.35-30.57) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #823306

[ Tim Gardner ]

  * SAUCE: rtl8192se: Force a build for a 2.6/3.0 kernel
    - LP: #805494
  * [Config] Add enic/fnic to udebs
    - LP: #801610

[ Upstream Kernel Changes ]

  * taskstats: don't allow duplicate entries in listener mode,
    CVE-2011-2484
    - LP: #806390
    - CVE-2011-2484
  * dccp: handle invalid feature options length, CVE-2011-1770
    - LP: #806375
    - CVE-2011-1770
  * eCryptfs: Handle failed metadata read in lookup
    - LP: #509180
  * pagemap: close races with suid execve, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * report errors in /proc/*/*map* sanely, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * close race in /proc/*/environ, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * auxv: require the target to be tracable (or yourself), CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * deal with races in /proc/*/{syscall, stack, personality}, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * rose: Add length checks to CALL_REQUEST parsing, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * Bluetooth: l2cap and rfcomm: fix 1 byte infoleak to userspace.
    - LP: #819569
    - CVE-2011-2492
  * drm/nv50-nvc0: work around an evo channel hang that some people see
    - LP: #583760
-- Herton Ronaldo Krzesinski <email address hidden> Tue, 30 Aug 2011 12:11:13 -0300

This bug was fixed in the package linux - 2.6.35-30.59

---------------
linux (2.6.35-30.59) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #837449

[ Upstream Kernel Changes ]

* Revert "drm/nv50-nvc0: work around an evo channel hang that some people
    see"
  * Revert "eCryptfs: Handle failed metadata read in lookup"

linux (2.6.35-30.58) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #828376

[ Upstream Kernel Changes ]

* proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020

linux (2.6.35-30.57) maverick-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #823306

[ Tim Gardner ]

* SAUCE: rtl8192se: Force a build for a 2.6/3.0 kernel
    - LP: #805494
  * [Config] Add enic/fnic to udebs
    - LP: #801610

[ Upstream Kernel Changes ]

* taskstats: don't allow duplicate entries in listener mode,
    CVE-2011-2484
    - LP: #806390
    - CVE-2011-2484
  * dccp: handle invalid feature options length, CVE-2011-1770
    - LP: #806375
    - CVE-2011-1770
  * eCryptfs: Handle failed metadata read in lookup
    - LP: #509180
  * pagemap: close races with suid execve, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * report errors in /proc/*/*map* sanely, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * close race in /proc/*/environ, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * auxv: require the target to be tracable (or yourself), CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * deal with races in /proc/*/{syscall, stack, personality}, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * rose: Add length checks to CALL_REQUEST parsing, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * Bluetooth: l2cap and rfcomm: fix 1 byte infoleak to userspace.
    - LP: #819569
    - CVE-2011-2492
  * drm/nv50-nvc0: work around an evo channel hang that some people see
    - LP: #583760
 -- Herton Ronaldo Krzesinski <herton.krzesinski@canonical.com>   Tue, 30 Aug 2011 12:11:13 -0300

Changed in linux (Ubuntu Maverick):
status:	Fix Committed → Fix Released

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2011-09-13:

#126

Please ignore the fixed messages from the janitor. The included fix was reverted because verification was not done or failed on the three releases (natty, maverick, lucid).

tags:	added: verification-reverted-natty removed: verification-needed-natty
Changed in linux (Ubuntu Maverick):
status:	Fix Released → Incomplete
Changed in linux (Ubuntu Lucid):
status:	Fix Committed → Incomplete
Changed in linux (Ubuntu Natty):
status:	Fix Committed → Incomplete

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-09-21:

#127

This bug was fixed in the package linux - 2.6.38-11.50

---------------
linux (2.6.38-11.50) natty-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #848246

[ Upstream Kernel Changes ]

  * Revert "eCryptfs: Handle failed metadata read in lookup"
  * Revert "KVM: fix kvmclock regression due to missing clock update"
  * Revert "ath9k: use split rx buffers to get rid of order-1 skb
    allocations"

linux (2.6.38-11.49) natty-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #836903

[ Adam Jackson ]

* SAUCE: drm/i915/pch: Fix integer math bugs in panel fitting
- LP: #753994

[ Keng-Yu Lin ]

* SAUCE: Input: ALPS - Enable Intellimouse mode for Lenovo Zhaoyang E47
- LP: #632884, #803005

[ Stefan Bader ]

* [Config] Force perf to use libiberty for demangling
- LP: #783660

[ Tim Gardner ]

* [Config] Add enic/fnic to udebs
- LP: #801610

[ Upstream Kernel Changes ]

  * eeepc-wmi: add keys found on EeePC 1215T
    - LP: #812644
  * eCryptfs: Handle failed metadata read in lookup
    - LP: #509180
  * pagemap: close races with suid execve, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * report errors in /proc/*/*map* sanely, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * close race in /proc/*/environ, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * auxv: require the target to be tracable (or yourself), CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * deal with races in /proc/*/{syscall, stack, personality}, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * vmscan: fix a livelock in kswapd
    - LP: #813797
  * mmc: Add PCI fixup quirks for Ricoh 1180:e823 reader
    - LP: #773524
  * mmc: Added quirks for Ricoh 1180:e823 lower base clock frequency
    - LP: #773524
  * rose: Add length checks to CALL_REQUEST parsing, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * pata_marvell: Add support for 88SE91A0, 88SE91A4
    - LP: #777325
  * GFS2: make sure fallocate bytes is a multiple of blksize, CVE-2011-2689
    - LP: #819572
    - CVE-2011-2689
  * Bluetooth: l2cap and rfcomm: fix 1 byte infoleak to userspace.
    - LP: #819569
    - CVE-2011-2492
  * drm/nv50-nvc0: work around an evo channel hang that some people see
    - LP: #583760
  * KVM: fix kvmclock regression due to missing clock update
    - LP: #795717
  * Add mount option to check uid of device being mounted = expect uid,
    CVE-2011-1833
    - LP: #732628
    - CVE-2011-1833
  * proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * ipv6: make fragment identifications less predictable, CVE-2011-2699
    - LP: #827685
    - CVE-2011-2699
  * ath9k: use split rx buffers to get rid of order-1 skb allocations
    - LP: #728835
  * perf: Fix software event overflow, CVE-2011-2918
    - LP: #834121
    - CVE-2011-2918
-- Herton Ronaldo Krzesinski <email address hidden> Mon, 12 Sep 2011 17:23:38 -0300

This bug was fixed in the package linux - 2.6.38-11.50

---------------
linux (2.6.38-11.50) natty-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #848246

[ Upstream Kernel Changes ]

* Revert "eCryptfs: Handle failed metadata read in lookup"
  * Revert "KVM: fix kvmclock regression due to missing clock update"
  * Revert "ath9k: use split rx buffers to get rid of order-1 skb
    allocations"

linux (2.6.38-11.49) natty-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #836903

[ Adam Jackson ]

* SAUCE: drm/i915/pch: Fix integer math bugs in panel fitting
    - LP: #753994

[ Keng-Yu Lin ]

* SAUCE: Input: ALPS - Enable Intellimouse mode for Lenovo Zhaoyang E47
    - LP: #632884, #803005

[ Stefan Bader ]

* [Config] Force perf to use libiberty for demangling
    - LP: #783660

[ Tim Gardner ]

* [Config] Add enic/fnic to udebs
    - LP: #801610

[ Upstream Kernel Changes ]

* eeepc-wmi: add keys found on EeePC 1215T
    - LP: #812644
  * eCryptfs: Handle failed metadata read in lookup
    - LP: #509180
  * pagemap: close races with suid execve, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * report errors in /proc/*/*map* sanely, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * close race in /proc/*/environ, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * auxv: require the target to be tracable (or yourself), CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * deal with races in /proc/*/{syscall, stack, personality}, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * vmscan: fix a livelock in kswapd
    - LP: #813797
  * mmc: Add PCI fixup quirks for Ricoh 1180:e823 reader
    - LP: #773524
  * mmc: Added quirks for Ricoh 1180:e823 lower base clock frequency
    - LP: #773524
  * rose: Add length checks to CALL_REQUEST parsing, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * pata_marvell: Add support for 88SE91A0, 88SE91A4
    - LP: #777325
  * GFS2: make sure fallocate bytes is a multiple of blksize, CVE-2011-2689
    - LP: #819572
    - CVE-2011-2689
  * Bluetooth: l2cap and rfcomm: fix 1 byte infoleak to userspace.
    - LP: #819569
    - CVE-2011-2492
  * drm/nv50-nvc0: work around an evo channel hang that some people see
    - LP: #583760
  * KVM: fix kvmclock regression due to missing clock update
    - LP: #795717
  * Add mount option to check uid of device being mounted = expect uid,
    CVE-2011-1833
    - LP: #732628
    - CVE-2011-1833
  * proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * ipv6: make fragment identifications less predictable, CVE-2011-2699
    - LP: #827685
    - CVE-2011-2699
  * ath9k: use split rx buffers to get rid of order-1 skb allocations
    - LP: #728835
  * perf: Fix software event overflow, CVE-2011-2918
    - LP: #834121
    - CVE-2011-2918
 -- Herton Ronaldo Krzesinski <herton.krzesinski@canonical.com>   Mon, 12 Sep 2011 17:23:38 -0300

Changed in linux (Ubuntu Natty):
status:	Incomplete → Fix Released

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-09-23:

#128

Download full text (14.6 KiB)

See also comments #104 and #128.

In parallel with the looping md5sum of everything in the 17G ecryptfs
read-only partition described above, I picked one of the files that
had produced different md5sums on different runs and wrote this script
to repeatedly md5sum the same file over and over (filename shortened
for clarity):

#!/bin/sh -u
# -Ian! D. Allen - <email address hidden> - www.idallen.com

    f=filename
    set -- $( md5sum "$f" )
    start=$1
    while : ; do
set -- $( md5sum "$f" )
if [ "$1" != "$start" ] ; then
     echo "$start != $*"
fi
    done

Over the course of two days running my tests, the above script produced
two outputs, indicating two times when that one file was read and had
a different checksum from when the script started:

c68292cceb13f21de7375c46c0ffdf9a != 58a90406649b9e795f9e1e6b34b806f8 filename
c68292cceb13f21de7375c46c0ffdf9a != 58a90406649b9e795f9e1e6b34b806f8 filename

This file is on a read-only ecryptfs partition. The md5sum should not
be changing.

I note that when the md5sum does change, it always changes to the same
"other" value; it's not random.

In my tests, the ecryptfs corruption seems to happen only when reading
from files with more than one link.

The file itself is 1132 bytes. Its encrypted base file is 12288 bytes.

Revision history for this message

Rich Wales (richw) wrote on 2011-09-24:

#130

Since you have identified a specific file which seems to be read in two different ways, it might be useful to capture the actual content of the file in each case (not just the md5sum values) and see if there is any discernible pattern to the corruption (e.g., extraneous trailing nulls in one of the two versions).

And if the corruption happens only with files that have more than one hard link, this might help explain why some people have not reported seeing the problem.

I've upgraded all five Ubuntu systems under my control to the latest Natty kernel (2.6.38-11.50), but I'm obviously still nervous as long as this bug appears to be evident.

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-09-26:

#131

Ubuntu 11.04 natty
Linux ubuntu 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

I'm now reproducing the problem in a VMware virtual machine by repeatedly
md5sum scanning a subset of files on that 17GB directory as small as
119 MB. (If I scan too small a directory, some sort of caching seems
to happen and I don't see any errors.)

What is happening is that the sizes of some files are changing on some
subsequent md5sum passes. Usually when one stat()s the file, one gets
the actual size of the unencrypted file. Occasionally, the size of the
underlying *encrypted* file is substituted for the real file size and
so the md5sum sums some extra bytes that it shouldn't.

It's tricky to get at the bad content of the file, since to do so I'd
have to actually copy every file to a temporary location, check to
see if the size of the copy matched the original, and complain if not.
I'll work on doing that. Until then, here is an example.

Here is the correct file, md5sum, and size (61440):

9a601197629e5c0b68ecad8d039d1b51 23 1575321 33056 2 777 777 0 61440 1316934939 1163069184 1316934615 4096 136 /home/idallen-ecryptfs/[filename]

When I repeated the md5sum scanning over and over, suddenly I got this:
a wrong md5sum and wrong file size (69632):

e26ef8caef4c1eca7672a6e0678b3190 23 1575321 33056 2 777 777 0 69632 1316934939 1163069184 1316934615 4096 136 /home/idallen-ecryptfs/[filename]

Where does 69632 come from? Well, look at the encrypted file size that
corresponds to that inode:

# find /mnt/sdb1/idallen-ecryptfs/.Private -inum 1575321 -ls
1575321 68 -r--r----- 2 idallen idallen 69632 Nov 9 2006 /mnt/sdb1/idallen-ecryptfs/.Private/[...]

There it is! The underlying *encrypted* file size is bleeding up to be
the (incorrect) size of the *unencrypted* file on some passes.

I'm also seeing these errors in kern.log, but these messages don't always
appear when I'm seeing corruption, so I don't know how to relate them:

[...]
Sep 25 20:43:12 ubuntu kernel: [59348.543107] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
Sep 25 20:43:12 ubuntu kernel: [59348.543113] ecryptfs_readpage: Error decrypting page; rc = [-4]
Sep 25 21:59:10 ubuntu kernel: [63906.460167] ecryptfs_read_and_validate_header_region: Error reading header region; rc = [-4]
Sep 25 21:59:10 ubuntu kernel: [63906.460334] Valid eCryptfs headers not found in file header region or xattr region
Sep 25 21:59:10 ubuntu kernel: [63906.460336] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

Ubuntu 11.04 natty
Linux ubuntu 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

I'm now reproducing the problem in a VMware virtual machine by repeatedly
md5sum scanning a subset of files on that 17GB directory as small as
119 MB.  (If I scan too small a directory, some sort of caching seems
to happen and I don't see any errors.)

What is happening is that the sizes of some files are changing on some
subsequent md5sum passes.  Usually when one stat()s the file, one gets
the actual size of the unencrypted file.  Occasionally, the size of the
underlying *encrypted* file is substituted for the real file size and
so the md5sum sums some extra bytes that it shouldn't.

It's tricky to get at the bad content of the file, since to do so I'd
have to actually copy every file to a temporary location, check to
see if the size of the copy matched the original, and complain if not.
I'll work on doing that.  Until then, here is an example.

Here is the correct file, md5sum, and size (61440):

9a601197629e5c0b68ecad8d039d1b51 23 1575321 33056 2 777 777 0 61440 1316934939 1163069184 1316934615 4096 136  /home/idallen-ecryptfs/[filename]

When I repeated the md5sum scanning over and over, suddenly I got this:
a wrong md5sum and wrong file size (69632):

e26ef8caef4c1eca7672a6e0678b3190 23 1575321 33056 2 777 777 0 69632 1316934939 1163069184 1316934615 4096 136  /home/idallen-ecryptfs/[filename]

Where does 69632 come from?  Well, look at the encrypted file size that
corresponds to that inode:

# find /mnt/sdb1/idallen-ecryptfs/.Private -inum 1575321 -ls
1575321   68 -r--r-----   2 idallen  idallen     69632 Nov  9  2006 /mnt/sdb1/idallen-ecryptfs/.Private/[...]

There it is!  The underlying *encrypted* file size is bleeding up to be
the (incorrect) size of the *unencrypted* file on some passes.

I'm also seeing these errors in kern.log, but these messages don't always
appear when I'm seeing corruption, so I don't know how to relate them:

[...]
Sep 25 20:43:12 ubuntu kernel: [59348.543107] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
Sep 25 20:43:12 ubuntu kernel: [59348.543113] ecryptfs_readpage: Error decrypting page; rc = [-4]
Sep 25 21:59:10 ubuntu kernel: [63906.460167] ecryptfs_read_and_validate_header_region: Error reading header region; rc = [-4]
Sep 25 21:59:10 ubuntu kernel: [63906.460334] Valid eCryptfs headers not found in file header region or xattr region
Sep 25 21:59:10 ubuntu kernel: [63906.460336] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-09-26:

#132

See also comments #104, #128, #129, #131.

Corruption (file size incorrect for ecryptfs files) also happens with this kernel from ppa:kernel-ppa/pre-proposed :

Linux ubuntu 2.6.38-11-generic #51~pre201109230902-Ubuntu SMP Fri Sep 23 09:15:48 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

dmesg shows:

[ 245.530576] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
[ 245.530600] ecryptfs_readpage: Error decrypting page; rc = [-4]
[ 245.531485] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
[ 245.531494] ecryptfs_readpage: Error decrypting page; rc = [-4]

As noted previously, corruption of file sizes happens even when no errors show in dmesg or kern.log.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-09-29:

#133

Download full text (16.9 KiB)

This bug was fixed in the package linux - 2.6.32-34.77

---------------
linux (2.6.32-34.77) lucid-proposed; urgency=low

[Steve Conklin]

* Release Tracking Bug
- LP: #849228

[ Upstream Kernel Changes ]

  * Revert "drm/i915: Remove BUG_ON from i915_gem_evict_something"
  * Revert "drm/i915: Periodically flush the active lists and requests"
  * Revert "drm/i915/evict: Ensure we completely cleanup on failure"
  * Revert "drm/i915: Maintain LRU order of inactive objects upon access by
    CPU (v2)"
  * Revert "drm/i915: Implement fair lru eviction across both rings. (v2)"
  * Revert "drm/i915: Move the eviction logic to its own file."
  * Revert "drm/i915: prepare for fair lru eviction"

linux (2.6.32-34.76) lucid-proposed; urgency=low

[Steve Conklin]

* Release Tracking Bug
- LP: #836914

[ Upstream Kernel Changes ]

  * Revert "drm/nv50-nvc0: work around an evo channel hang that some people
    see"
  * Revert "eCryptfs: Handle failed metadata read in lookup"
  * Revert "tunnels: fix netns vs proto registration ordering"

linux (2.6.32-34.75) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #832332

[ Upstream Kernel Changes ]

* drm/i915: Remove BUG_ON from i915_gem_evict_something
- LP: #828550

linux (2.6.32-34.74) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #828375

[ Upstream Kernel Changes ]

  * proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020

linux (2.6.32-34.73) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
- LP: #824148

[ Tim Gardner ]

  * SAUCE: rtl8192se: Force a build for a 2.6/3.0 kernel
    - LP: #805494
  * [Config] Add enic/fnic to udebs
    - LP: #801610

[ Upstream Kernel Changes ]

  * tty: icount changeover for other main devices, CVE-2010-4076,
    CVE-2010-4077
    - LP: #720189
    - CVE-2010-4077
  * fs/partitions/efi.c: corrupted GUID partition tables can cause kernel
    oops
    - LP: #795418
    - CVE-2011-1577
  * ftrace: Only update the function code on write to filter files
    - LP: #802383
  * kmemleak: Do not return a pointer to an object that kmemleak did not
    get
    - LP: #802383
  * CPU hotplug, re-create sysfs directory and symlinks
    - LP: #802383
  * Fix memory leak in cpufreq_stat
    - LP: #802383
  * powerpc/kexec: Fix memory corruption from unallocated slaves
    - LP: #802383
  * powerpc/oprofile: Handle events that raise an exception without
    overflowing
    - LP: #802383
  * mtd: mtdconcat: fix NAND OOB write
    - LP: #802383
  * x86, 64-bit: Fix copy_[to/from]_user() checks for the userspace address
    limit
    - LP: #802383
  * ext3: Fix fs corruption when make_indexed_dir() fails
    - LP: #802383
  * jbd: Fix forever sleeping process in do_get_write_access()
    - LP: #802383
  * jbd: fix fsync() tid wraparound bug
    - LP: #802383
  * ext4: release page cache in ext4_mb_load_buddy error path
    - LP: #802383
  * Fix Ultrastor asm snippet
    - LP: #802383
  * x86, amd: Do not enable ARAT feature on AMD processors below family
    0x12
    - LP: #802383
  * x86, ...

This bug was fixed in the package linux - 2.6.32-34.77

---------------
linux (2.6.32-34.77) lucid-proposed; urgency=low

[Steve Conklin]

* Release Tracking Bug
    - LP: #849228

[ Upstream Kernel Changes ]

* Revert "drm/i915: Remove BUG_ON from i915_gem_evict_something"
  * Revert "drm/i915: Periodically flush the active lists and requests"
  * Revert "drm/i915/evict: Ensure we completely cleanup on failure"
  * Revert "drm/i915: Maintain LRU order of inactive objects upon access by
    CPU (v2)"
  * Revert "drm/i915: Implement fair lru eviction across both rings. (v2)"
  * Revert "drm/i915: Move the eviction logic to its own file."
  * Revert "drm/i915: prepare for fair lru eviction"

linux (2.6.32-34.76) lucid-proposed; urgency=low

[Steve Conklin]

* Release Tracking Bug
    - LP: #836914

[ Upstream Kernel Changes ]

* Revert "drm/nv50-nvc0: work around an evo channel hang that some people
    see"
  * Revert "eCryptfs: Handle failed metadata read in lookup"
  * Revert "tunnels: fix netns vs proto registration ordering"

linux (2.6.32-34.75) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #832332

[ Upstream Kernel Changes ]

* drm/i915: Remove BUG_ON from i915_gem_evict_something
    - LP: #828550

linux (2.6.32-34.74) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #828375

[ Upstream Kernel Changes ]

* proc: fix oops on invalid /proc/<pid>/maps access, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020

linux (2.6.32-34.73) lucid-proposed; urgency=low

[Herton R. Krzesinski]

* Release Tracking Bug
    - LP: #824148

[ Tim Gardner ]

* SAUCE: rtl8192se: Force a build for a 2.6/3.0 kernel
    - LP: #805494
  * [Config] Add enic/fnic to udebs
    - LP: #801610

[ Upstream Kernel Changes ]

* tty: icount changeover for other main devices, CVE-2010-4076,
    CVE-2010-4077
    - LP: #720189
    - CVE-2010-4077
  * fs/partitions/efi.c: corrupted GUID partition tables can cause kernel
    oops
    - LP: #795418
    - CVE-2011-1577
  * ftrace: Only update the function code on write to filter files
    - LP: #802383
  * kmemleak: Do not return a pointer to an object that kmemleak did not
    get
    - LP: #802383
  * CPU hotplug, re-create sysfs directory and symlinks
    - LP: #802383
  * Fix memory leak in cpufreq_stat
    - LP: #802383
  * powerpc/kexec: Fix memory corruption from unallocated slaves
    - LP: #802383
  * powerpc/oprofile: Handle events that raise an exception without
    overflowing
    - LP: #802383
  * mtd: mtdconcat: fix NAND OOB write
    - LP: #802383
  * x86, 64-bit: Fix copy_[to/from]_user() checks for the userspace address
    limit
    - LP: #802383
  * ext3: Fix fs corruption when make_indexed_dir() fails
    - LP: #802383
  * jbd: Fix forever sleeping process in do_get_write_access()
    - LP: #802383
  * jbd: fix fsync() tid wraparound bug
    - LP: #802383
  * ext4: release page cache in ext4_mb_load_buddy error path
    - LP: #802383
  * Fix Ultrastor asm snippet
    - LP: #802383
  * x86, amd: Do not enable ARAT feature on AMD processors below family
    0x12
    - LP: #802383
  * x86, amd: Use _safe() msr access for GartTlbWlk disable code
    - LP: #802383
  * rcu: Fix unpaired rcu_irq_enter() from locking selftests
    - LP: #802383
  * staging: usbip: fix wrong endian conversion
    - LP: #802383
  * Fix for buffer overflow in ldm_frag_add not sufficient
    - LP: #802383
  * seqlock: Don't smp_rmb in seqlock reader spin loop
    - LP: #802383
  * ALSA: HDA: Use one dmic only for Dell Studio 1558
    - LP: #731706, #802383
  * ASoC: Ensure output PGA is enabled for line outputs in wm_hubs
    - LP: #802383
  * ASoC: Add some missing volume update bit sets for wm_hubs devices
    - LP: #802383
  * mm/page_alloc.c: prevent unending loop in __alloc_pages_slowpath()
    - LP: #802383
  * loop: limit 'max_part' module param to DISK_MAX_PARTS
    - LP: #802383
  * loop: handle on-demand devices correctly
    - LP: #802383
  * USB: CP210x Add 4 Device IDs for AC-Services Devices
    - LP: #802383
  * USB: moto_modem: Add USB identifier for the Motorola VE240.
    - LP: #802383
  * USB: serial: ftdi_sio: adding support for TavIR STK500
    - LP: #802383
  * USB: gamin_gps: Fix for data transfer problems in native mode
    - LP: #802383
  * usb/gadget: at91sam9g20 fix end point max packet size
    - LP: #802383
  * usb: gadget: rndis: don't test against req->length
    - LP: #802383
  * OHCI: fix regression caused by nVidia shutdown workaround
    - LP: #802383
  * p54usb: add zoom 4410 usbid
    - LP: #802383
  * eCryptfs: Allow 2 scatterlist entries for encrypted filenames
    - LP: #802383
  * UBIFS: fix a rare memory leak in ro to rw remounting path
    - LP: #802383
  * i8k: Avoid lahf in 64-bit code
    - LP: #802383
  * cpuidle: menu: fixed wrapping timers at 4.294 seconds
    - LP: #802383
  * dm table: reject devices without request fns
    - LP: #802383
  * atm: expose ATM device index in sysfs
    - LP: #802383
  * brd: limit 'max_part' module param to DISK_MAX_PARTS
    - LP: #802383
  * brd: handle on-demand devices correctly
    - LP: #802383
  * SUNRPC: Deal with the lack of a SYN_SENT sk->sk_state_change
    callback...
    - LP: #802383
  * PCI: Add quirk for setting valid class for TI816X Endpoint
    - LP: #802383
  * xen mmu: fix a race window causing leave_mm BUG()
    - LP: #802383
  * netfilter: nf_conntrack_reasm: properly handle packets fragmented into
    a single fragment
    - LP: #802383
  * fix memory leak in scsi_report_lun_scan
    - LP: #802383
  * fix refcounting bug in scsi_get_host_dev
    - LP: #802383
  * fix duplicate removal on error path in scsi_sysfs_add_sdev
    - LP: #802383
  * UBIFS: fix shrinker object count reports
    - LP: #802383
  * UBIFS: fix memory leak on error path
    - LP: #802383
  * nbd: limit module parameters to a sane value
    - LP: #802383
  * mm: fix ENOSPC returned by handle_mm_fault()
    - LP: #802383
  * PCI: Set PCIE maxpayload for card during hotplug insertion
    - LP: #802383
  * nl80211: fix check for valid SSID size in scan operations
    - LP: #802383
  * lockdep: Fix lock_is_held() on recursion
    - LP: #802383
  * drm/i915: Add a no lvds quirk for the Asus EeeBox PC EB1007
    - LP: #802383
  * drm/radeon/kms: fix for radeon on systems >4GB without hardware iommu
    - LP: #802383
  * fat: Fix corrupt inode flags when remove ATTR_SYS flag
    - LP: #802383
  * xen: off by one errors in multicalls.c
    - LP: #802383
  * x86/amd-iommu: Fix 3 possible endless loops
    - LP: #802383
  * USB: cdc-acm: Adding second ACM channel support for Nokia E7 and C7
    - LP: #802383
  * USB: core: Tolerate protocol stall during hub and port status read
    - LP: #802383
  * USB: serial: add another 4N-GALAXY.DE PID to ftdi_sio driver
    - LP: #802383
  * ALSA: hda: Fix quirk for Dell Inspiron 910
    - LP: #792712, #802383
  * oprofile, dcookies: Fix possible circular locking dependency
    - LP: #802383
  * CPUFREQ: Remove cpufreq_stats sysfs entries on module unload.
    - LP: #802383
  * md: check ->hot_remove_disk when removing disk
    - LP: #802383
  * md/raid5: fix raid5_set_bi_hw_segments
    - LP: #802383
  * md/raid5: fix FUA request handling in ops_run_io()
    - LP: #802383
  * ata: use pci_dev->revision
    - LP: #802383
  * pata_cmd64x: fix PIO setup
    - LP: #802383
  * pata_cmd64x: cmd648_bmdma_stop() fix
    - LP: #802383
  * pata_cmd64x: remove unused definitions
    - LP: #802383
  * pata_cm64x: fix boot crash on parisc
    - LP: #802383
  * ACPI: use _HID when supplied by root-level devices
    - LP: #802383
  * xfs: properly account for reclaimed inodes
    - LP: #802383
  * exec: delay address limit change until point of no return
    - LP: #802383
  * netfilter: IPv6: initialize TOS field in REJECT target module
    - LP: #802383
  * netfilter: IPv6: fix DSCP mangle code
    - LP: #802383
  * genirq: Add IRQF_FORCE_RESUME
    - LP: #802383
  * xen: Use IRQF_FORCE_RESUME
    - LP: #802383
  * time: Compensate for rounding on odd-frequency clocksources
    - LP: #802383
  * Linux 2.6.32.42
    - LP: #802383
  * taskstats: don't allow duplicate entries in listener mode,
    CVE-2011-2484
    - LP: #806390
    - CVE-2011-2484
  * drm_mm: extract check_free_mm_node
    - LP: #599017, #807508
  * drm: implement helper functions for scanning lru list
    - LP: #599017, #807508
  * drm/i915: prepare for fair lru eviction
    - LP: #599017, #807508
  * drm/i915: Move the eviction logic to its own file.
    - LP: #599017, #807508
  * drm/i915: Implement fair lru eviction across both rings. (v2)
    - LP: #599017, #807508
  * drm/i915: Maintain LRU order of inactive objects upon access by CPU
    (v2)
    - LP: #599017, #807508
  * drm/i915/evict: Ensure we completely cleanup on failure
    - LP: #599017, #807508
  * drm/i915: Periodically flush the active lists and requests
    - LP: #599017, #807508
  * Linux 2.6.32.42+drm33.19
    - LP: #807508
  * net: add limit for socket backlog CVE-2010-4251
    - LP: #807462
  * tcp: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * ipv6: udp: Optimise multicast reception
    - LP: #807462
  * ipv4: udp: Optimise multicast reception
    - LP: #807462
  * udp: multicast RX should increment SNMP/sk_drops counter in allocation
    failures CVE-2010-4251
    - LP: #807462
  * udp: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * llc: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * sctp: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * tipc: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * x25: use limited socket backlog CVE-2010-4251
    - LP: #807462
  * net: backlog functions rename CVE-2010-4251
    - LP: #807462
  * net: sk_add_backlog() take rmem_alloc into account CVE-2010-4805
    - LP: #809318
  * ksm: fix NULL pointer dereference in scan_get_next_rmap_item()
    - LP: #810425
  * migrate: don't account swapcache as shmem
    - LP: #810425
  * clocksource: Make watchdog robust vs. interruption
    - LP: #810425
  * TTY: ldisc, do not close until there are readers
    - LP: #810425
  * xhci: Reject double add of active endpoints.
    - LP: #810425
  * PM: Free memory bitmaps if opening /dev/snapshot fails
    - LP: #810425
  * ath5k: fix memory leak when fewer than N_PD_CURVES are in use
    - LP: #810425
  * mm: fix negative commitlimit when gigantic hugepages are allocated
    - LP: #810425
  * uvcvideo: Remove buffers from the queues when freeing
    - LP: #810425
  * watchdog: mtx1-wdt: request gpio before using it
    - LP: #810425
  * debugobjects: Fix boot crash when kmemleak and debugobjects enabled
    - LP: #810425
  * cfq-iosched: fix locking around ioc->ioc_data assignment
    - LP: #810425
  * cfq-iosched: fix a rcu warning
    - LP: #810425
  * i2c-taos-evm: Fix log messages
    - LP: #810425
  * md: avoid endless recovery loop when waiting for fail device to
    complete.
    - LP: #810425
  * SUNRPC: Ensure the RPC client only quits on fatal signals
    - LP: #810425
  * 6pack,mkiss: fix lock inconsistency
    - LP: #810425
  * USB: don't let errors prevent system sleep
    - LP: #810425
  * USB: don't let the hub driver prevent system sleep
    - LP: #810425
  * uml: fix CONFIG_STATIC_LINK=y build failure with newer glibc
    - LP: #810425
  * um: os-linux/mem.c needs sys/stat.h
    - LP: #810425
  * inet_diag: fix inet_diag_bc_audit()
    - LP: #810425
  * PM / Hibernate: Avoid hitting OOM during preallocation of memory
    - LP: #810425
  * PM / Hibernate: Fix free_unnecessary_pages()
    - LP: #810425
  * bug.h: Add WARN_RATELIMIT
    - LP: #810425
  * net: filter: Use WARN_RATELIMIT
    - LP: #810425
  * af_packet: prevent information leak
    - LP: #810425
  * net/ipv4: Check for mistakenly passed in non-IPv4 address
    - LP: #810425
  * ipv6/udp: Use the correct variable to determine non-blocking condition
    - LP: #810425
  * udp/recvmsg: Clear MSG_TRUNC flag when starting over for a new packet
    - LP: #810425
  * mm: prevent concurrent unmap_mapping_range() on the same inode
    - LP: #810425
  * xen: set max_pfn_mapped to the last pfn mapped
    - LP: #810425
  * xen: partially revert "xen: set max_pfn_mapped to the last pfn mapped"
    - LP: #810425
  * Linux 2.6.32.43
    - LP: #810425
  * eCryptfs: Handle failed metadata read in lookup
    - LP: #509180
  * pagemap: close races with suid execve, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * report errors in /proc/*/*map* sanely, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * close race in /proc/*/environ, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * auxv: require the target to be tracable (or yourself), CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * deal with races in /proc/*/{syscall, stack, personality}, CVE-2011-1020
    - LP: #813026
    - CVE-2011-1020
  * rose_loopback_timer sets VC number <= ROSE_DEFAULT_MAXVC, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * rose: Add length checks to CALL_REQUEST parsing, CVE-2011-1493
    - LP: #816550
    - CVE-2011-1493
  * Bluetooth: l2cap and rfcomm: fix 1 byte infoleak to userspace.
    - LP: #819569
    - CVE-2011-2492
  * drm/nv50-nvc0: work around an evo channel hang that some people see
    - LP: #583760
  * ASoC: Fix Blackfin I2S _pointer() implementation return in bounds
    values
    - LP: #823296
  * v4l2-ioctl.c: prefill tuner type for g_frequency and g/s_tuner
    - LP: #823296
  * pvrusb2: fix g/s_tuner support
    - LP: #823296
  * bttv: fix s_tuner for radio
    - LP: #823296
  * gro: Only reset frag0 when skb can be pulled
    - LP: #823296
  * NFSv4.1: update nfs4_fattr_bitmap_maxsz
    - LP: #823296
  * SUNRPC: Fix a race between work-queue and rpc_killall_tasks
    - LP: #823296
  * SUNRPC: Fix use of static variable in rpcb_getport_async
    - LP: #823296
  * si4713-i2c: avoid potential buffer overflow on si4713
    - LP: #823296
  * hwmon: (max1111) Fix race condition causing NULL pointer exception
    - LP: #823296
  * bridge: send proper message_age in config BPDU
    - LP: #823296
  * davinci: DM365 EVM: fix video input mux bits
    - LP: #823296
  * libata: fix unexpectedly frozen port after ata_eh_reset()
    - LP: #823296
  * x86: Make Dell Latitude E5420 use reboot=pci
    - LP: #823296
  * USB: pl2303: add AdLink ND-6530 USB IDs
    - LP: #823296
  * USB: pl2303.h: checkpatch cleanups
    - LP: #823296
  * USB: serial: add IDs for WinChipHead USB->RS232 adapter
    - LP: #823296
  * staging: comedi: fix infoleak to userspace
    - LP: #823296
  * USB: OHCI: fix another regression for NVIDIA controllers
    - LP: #823296
  * usb: musb: restore INDEX register in resume path
    - LP: #823296
  * USB: dummy-hcd needs the has_tt flag
    - LP: #823296
  * ARM: pxa/cm-x300: fix V3020 RTC functionality
    - LP: #823296
  * jme: Fix unmap error (Causing system freeze)
    - LP: #823296
  * libsas: remove expander from dev list on error
    - LP: #823296
  * mac80211: Restart STA timers only on associated state
    - LP: #823296
  * Blacklist Traxdata CDR4120 and IOMEGA Zip drive to avoid lock ups.
    - LP: #823296
  * ses: requesting a fault indication
    - LP: #823296
  * pmcraid: reject negative request size
    - LP: #823296
  * kexec, x86: Fix incorrect jump back address if not preserving context
    - LP: #823296
  * powerpc/kdump: Fix timeout in crash_kexec_wait_realmode
    - LP: #823296
  * PCI: ARI is a PCIe v2 feature
    - LP: #823296
  * cciss: do not attempt to read from a write-only register
    - LP: #823296
  * xtensa: prevent arbitrary read in ptrace
    - LP: #823296
  * ext3: Fix oops in ext3_try_to_allocate_with_rsv()
    - LP: #823296
  * svcrpc: fix list-corrupting race on nfsd shutdown
    - LP: #823296
  * EHCI: only power off port if over-current is active
    - LP: #823296
  * EHCI: fix direction handling for interrupt data toggles
    - LP: #823296
  * powerpc/pseries/hvconsole: Fix dropped console output
    - LP: #823296
  * x86: Hpet: Avoid the comparator readback penalty
    - LP: #823296
  * x86: HPET: Chose a paranoid safe value for the ETIME check
    - LP: #823296
  * cifs: clean up cifs_find_smb_ses (try #2)
    - LP: #823296
  * cifs: fix NULL pointer dereference in cifs_find_smb_ses
    - LP: #823296
  * cifs: check for NULL session password
    - LP: #823296
  * gre: fix netns vs proto registration ordering
    - LP: #823296
  * netns xfrm: fixup xfrm6_tunnel error propagation
    - LP: #823296
  * tunnels: fix netns vs proto registration ordering
    - LP: #823296
  * alpha: fix several security issues
    - LP: #823296
  * proc: restrict access to /proc/PID/io
    - LP: #823296
  * ALSA: sound/core/pcm_compat.c: adjust array index
    - LP: #823296
  * dm mpath: fix potential NULL pointer in feature arg processing
    - LP: #823296
  * dm: fix idr leak on module removal
    - LP: #823296
  * perf: overflow/perf_count_sw_cpu_clock crashes recent kernels
    - LP: #823296
  * atm: [br2684] allow routed mode operation again
    - LP: #823296
  * Linux 2.6.32.44
    - LP: #823296
 -- Steve Conklin <sconklin@canonical.com>   Tue, 13 Sep 2011 13:04:10 -0500

Changed in linux (Ubuntu Lucid):
status:	Incomplete → Fix Released

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-09-30:

#134

> Launchpad Janitor (janitor) wrote 18 hours ago: #133
> This bug was fixed in the package linux - 2.6.32-34.77

It's broken in stock Ubuntu 11.04 using 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64.

Revision history for this message

Paolo Bonzini (bonzini) wrote on 2011-10-06:

#135

I still see occasional errors:

[33945.269075] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
[33945.269084] ecryptfs_readpage: Error decrypting page; rc = [-4]

that are reported as "short read: Success" when doing a "git grep" on an encrypted repository. Redoing the grep fixes the problems for those files and may transfer it to others, until all of them are in the cache and the grep succeeds.

Revision history for this message

Dustin Kirkland  (kirkland) wrote on 2011-10-06:

#136

Please, anyone responding to this report, please please please tell us
exactly which kernel you're running.

uname -a

Revision history for this message

tankdriver (stoneraider-deactivatedaccount) wrote on 2011-10-06:

#137

kernel: [18393.960955] ecryptfs_encrypt_page: Error attempting to write lower page; rc = [-5]
kernel: [18393.960962] ecryptfs_writepage: Error encrypting page (upper index [0x000000000000006c])

$ uname -a
Linux thomas-VPCF13J0E 3.0.0-12-generic #19-Ubuntu SMP Fri Sep 23 21:23:39 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-10-06:

#138

On 2011-10-06 15:18:11, tankdriver wrote:
> kernel: [18393.960955] ecryptfs_encrypt_page: Error attempting to write lower page; rc = [-5]

This is an -EIO error returned from the lower filesystem. It isn't
related to this bug and isn't an eCryptfs bug. If anything, it is
eCryptfs being too verbose when it sees an error code.

Revision history for this message

Tyler Hicks (tyhicks) wrote on 2011-10-06:

#139

On 2011-10-06 14:34:12, Paolo Bonzini wrote:
> I still see occasional errors:
>
> [33945.269075] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
> [33945.269084] ecryptfs_readpage: Error decrypting page; rc = [-4]
>
> that are reported as "short read: Success" when doing a "git grep" on an
> encrypted repository. Redoing the grep fixes the problems for those
> files and may transfer it to others, until all of them are in the cache
> and the grep succeeds.

This is what the fix was intended to do. When eCryptfs is trying to
read from the lower filesystem and gets interrupted, pass the EINTR
error code onto userspace. However, eCryptfs doesn't need to write a log
message about it.

From your description, I can't tell if grep handled the EINTR error
correctly. If you can reproduce this, can you please strace grep and
attach the output so that I can take a better look at what grep is
seeing returned from system calls?

$ strace -o grep.strace grep [GREP ARGS]

Oh, and please listen to kirkland's request and give us `uname -a`
output.

Revision history for this message

harcesz (harcesz) wrote on 2011-10-13:

#140

bump, one more guinea pig happy to test possible solutions (whole system drive encryption btw)

Linux parafia 2.6.38-11-generic #50-Ubuntu SMP Mon Sep 12 21:17:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-10-17:

#141

See also comments #104, #128, #129, #131, #132, #134.

Still seeing ecryptfs file size corruption (leading to file corruption) in latest Ubuntu 11.04 kernel update:

Linux linux 2.6.38-12-generic #51-Ubuntu SMP Wed Sep 28 14:27:32 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

The ecryptfs occasionally returns the (larger) size of the underlying ecryptfs file, not the size of the
decrypted file being accessed in the decrypted and mounted partition.

Revision history for this message

Joe Edmonds (joee) wrote on 2011-10-19:

#142

Ever since moving to an ext4+ecryptfs oneiric system, I receive this message about 6,000 times a day in /var/log/kern.log:

Oct 16 01:56:39 hostname kernel: [32384.222198] Valid eCryptfs headers not found in file header region or xattr region
Oct 16 01:56:39 hostname kernel: [32384.222211] Either the lower file is not in a valid eCryptfs format, or the key could not be retrieved. Plaintext passthrough mode is not enabled; returning -EIO

I don't know whether trailing garbage is being added to my encrypted files. But the bug about the kernel message is marked as a duplicate of this one:

https://bugs.launchpad.net/ubuntu/+source/ecryptfs-utils/+bug/372014

$ uname -srvmo
Linux 3.0.0-12-generic #20-Ubuntu SMP Fri Oct 7 14:56:25 UTC 2011 x86_64 GNU/Linux

Revision history for this message

Ian! D. Allen (idallen) wrote on 2011-10-22:

#143

See also my comments #104, #128, #129, #131, #132, #134, #141.

The above ecryptfs corruption is reduced but not gone in Ubuntu 11.10 Oneiric:

Linux ubuntu 3.0.0-12-generic #20-Ubuntu SMP Fri Oct 7 14:56:25 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

I repeatedly ran md5sum on a read-only ecryptfs partition with 18GB of files (many with multiple hard links) and the md5sum results *changed* on a few files on some of the runs.

It took many more runs (several dozen) to discover the corruption than when using the 2.6.38 kernel, and *only* the md5sums changed between runs, not the file sizes as well as was true with 2.6.38. I also saw the corruption on files with only one link, where under 2.6.38 it seemed to happen only on files with more than one link.

I'm running these tests using a VMware install of a fully-updated Ubuntu 11.10 Oneiric with a separate 32GB virtual disk containing the 18GB ecryptfs partition.

I'm looping a full md5sum of all 18GB in one terminal window and in a second window I'm looping a simultaneous md5sum of a 4.2GB subdirectory. When corruption happens, both windows show it happening on the same file(s). Out of 48 runs over the full 18GB so far, four runs turned up files with different checksums. In the other loop, out of 171 runs over the 4.2GB subdirectory so far, 11 runs turned up files with different checksums.

I did see some kern.log errors early in the testing process:

Oct 20 14:57:49 ubuntu kernel: [ 65.128562] EXT4-fs (sdb1): mounted filesystem with ordered data mode. Opts: (null)
Oct 20 16:04:30 ubuntu kernel: [ 4066.054573] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
Oct 20 16:04:30 ubuntu kernel: [ 4066.054580] ecryptfs_readpage: Error decrypting page; rc = [-4]
Oct 20 16:47:30 ubuntu kernel: [ 6645.362178] ecryptfs_decrypt_page: Error attempting to read lower page; rc = [-4]
Oct 20 16:47:30 ubuntu kernel: [ 6645.362263] ecryptfs_readpage: Error decrypting page; rc = [-4]

Those errors all appeared long before the runs showing corruption.

/dev/sdb1 on /mnt/sdb1 type ext4 (ro)
/mnt/sdb1/idallen-ecryptfs/.Private on /home/idallen-ecryptfs type ecryptfs (ro,ecryptfs_unlink_sigs,ecryptfs_cipher=aes,ecryptfs_key_bytes=16,ecryptfs_sig=xxxxxxxxxxxxxxxx,ecryptfs_fnek_sig=yyyyyyyyyyyyyyyy)