[Upstream] Saving in DOCX format truncates the file

Bug #1225556 reported by Vaclav Petras
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
LibreOffice
Invalid
High
libreoffice (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

1) Ubuntu 12.04.2 LTS

2) LibreOffice 3.5.7.2, Build ID: 350m1(Build:2), probably package 1:3.5.7-0ubuntu4 but apt-cache reports none

3) Edit and save DOCX file
open DOCX file
edit DOCX file
save DOCX file
close
open the file

4) the file is truncated
e.g., 9 pages instead of 16 but it can truncate anywhere on the page

It is actually known bug at
https://bugs.freedesktop.org/show_bug.cgi?id=47782
but I think that it is important to also backported into 12.04 LTS once it is fixed (or at least some warning that 'this really does not works' is added) since data loss is critical. File to reproduce it is provided at freedesktop ticket.

The latest version of LibrreOffice is 4.1, I haven't tested if this happen there too.
---
ApportVersion: 2.0.1-0ubuntu17.6
Architecture: i386
DistroRelease: Ubuntu 12.04
InstallationMedia: Ubuntu 11.04 "Natty Narwhal" - Release i386 (20110427.1)
MarkForUpload: True
Package: libreoffice 1:3.5.7-0ubuntu5
PackageArchitecture: i386
ProcEnviron:
 TERM=xterm
 PATH=(custom, no user)
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 3.2.0-57.87-generic-pae 3.2.52
Tags: precise running-unity
Uname: Linux 3.2.0-57-generic-pae i686
UpgradeStatus: Upgraded to precise on 2012-05-10 (606 days ago)
UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare

Revision history for this message
In , Colemanjj (colemanjj) wrote :

when the a MS .docx file received by email is opened and then saved as MS .docx the 10 page document is truncated to 4 pages.
if the document is saved as a MS .doc file, the file is not truncated.

Revision history for this message
In , Sasha-libreoffice (sasha-libreoffice) wrote :

Thanks for bugreport
Please, do not save documents in docx format using LibreOffcie. Or information will be lost.

Revision history for this message
In , spaetz (spaetz) wrote :

Dear John, DATA LOSS is the worst of all possible failures.

You will have to attach such a document so that the error can be reproduced and tested, otherwise this bug is of little help to get this fixed.

Revision history for this message
In , Gr306 (gr306) wrote :

Created attachment 72790
docx file that exhibits the bug

This document clearly reveals the truncation bug with docx files.

Revision history for this message
In , Gr306 (gr306) wrote :

This bug also exist in 3.6.1.2 on MacOSX

Revision history for this message
In , Vaclav Petras (wenzeslaus) wrote :

Also true for LibreOffice 3.5.7.2 (Build ID: 350m1(Build:2)) on Ubuntu 12.04 LTS.

Confirmed with the file attached here (from 16 pages to 9 after edit and save (+close and open)) and also one other file I cannot share created in not specified version of MS Word.

Current version of LibreOffice is 4.1, can someone test it with this version?

The other comments should probably go to one of these reports at LibreOffice Bugzilla:

https://www.libreoffice.org/bugzilla/show_bug.cgi?id=43780
https://www.libreoffice.org/bugzilla/show_bug.cgi?id=46025

penalvch (penalvch)
summary: - Saving in DOCX format truncates the file
+ [Upstream] Saving in DOCX format truncates the file
Changed in df-libreoffice:
importance: Unknown → High
status: Unknown → Incomplete
Revision history for this message
In , Sasha-libreoffice (sasha-libreoffice) wrote :

in 4.0 and 4.1.1 on Fedora RFR 64 bit
document is not truncated after saving into docx and reloading, as I can see.
But document formatting in some places becomes corrupted.

Revision history for this message
In , Benjamin Herr (ben0x539) wrote :

I am using "Version: 4.1.4.2", "Build ID: Gentoo official package". The document in comment 3 does not seem to get truncated for me. However, libreoffice just truncated my ~22 pages document to seven pages. I believe I cannot share the .docx file, unfortunately.

I am not familiar with the .docx format but it appears that after unzipping it and stripping all the markup from word/document.xml, most of the text is still kind of there. Meanwhile, other .docx consumers refuse to open the file at all.

Revision history for this message
In , Benjamin Herr (ben0x539) wrote :

Ignore me, I think I wanted #55820

Revision history for this message
In , Foss-4 (foss-4) wrote :

Hi all, while I see some odd behavior and slight differences when comparing the test document in Word2010 and LO 4.2.0.1 pages are the same amount.

Next I tried resaving to a new docx file. Then compared the two LO files (existing docx and newly created docx from test file), they are identical.

Thus setting to WORKSFORME.

I know similar issues do appear occasionally but with 4.2.0.1 and this test file it seems to no longer happen, so I suggest, if any of you run into a similar problem please make sure to create a new bug and we can start a new investigation of what's going on.

Changed in df-libreoffice:
status: Incomplete → Invalid
Revision history for this message
penalvch (penalvch) wrote :

Vaclav Petras, thank you for taking the time to report this bug and helping to make Ubuntu better. Please execute the following command, as it will automatically gather debugging information, in a terminal:
apport-collect 1225556
When reporting bugs in the future please use apport by using 'ubuntu-bug' and the name of the package affected. You can learn more about this functionality at https://wiki.ubuntu.com/ReportingBugs.

Changed in libreoffice (Ubuntu):
status: New → Incomplete
Revision history for this message
Vaclav Petras (wenzeslaus) wrote : Dependencies.txt

apport information

tags: added: apport-collected precise running-unity
description: updated
Revision history for this message
Vaclav Petras (wenzeslaus) wrote :

I did apport-collect 1225556 but before I has to install libreoffice package because apport-collect said:

 Package libreoffice not installed and no hook available, ignoring

I had only several libreoffice-* packages installed. But I guess, it chanes nothing.

penalvch (penalvch)
Changed in libreoffice (Ubuntu):
status: Incomplete → New
Revision history for this message
zblace (zblace) wrote :

I just experienced same problem with 4.2 Libre Office / super frustrated as I lost over hour of work

Changed in libreoffice (Ubuntu):
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.