corrupted files not reported

Bug #608950 reported by Brian Z
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
pdfshuffler (Debian)
Fix Released
Unknown
pdfshuffler (Fedora)
Fix Released
Undecided
pdfshuffler (Ubuntu)
New
Undecided
Unassigned

Bug Description

Binary package hint: pdfshuffler

A corrupted file can be opened, and edited, but not saved. When I try to save it, the "Export..." dialog window simply stays open, with no error message or other indication as to why nothing is happening.

I only understood that this was because the file was corrupted after I filed invalid bug report #608870.

Revision history for this message
exactt (giesbert) wrote :

please start pdfshuffler from the command line to obtain further information!

Revision history for this message
In , Lofton (lofton-redhat-bugs) wrote :

abrt version: 1.1.17
architecture: x86_64
cmdline: /usr/bin/python /usr/bin/pdfshuffler
component: pdfshuffler
executable: /usr/bin/pdfshuffler
kernel: 2.6.35.11-83.fc14.x86_64
package: pdfshuffler-0.5-2.fc14
reason: pdf.py:651:read:ValueError: too many values to unpack
release: Fedora release 14 (Laughlin)
time: 1298295502
uid: 500

backtrace
-----
pdf.py:651:read:ValueError: too many values to unpack

Traceback (most recent call last):
  File "/usr/bin/pdfshuffler", line 411, in choose_export_pdf_name
    self.export_to_file(file_out)
  File "/usr/bin/pdfshuffler", line 432, in export_to_file
    pdfdoc_inp = PdfFileReader(file(pdfdoc.copyname, 'rb'))
  File "/usr/lib/python2.7/site-packages/pyPdf/pdf.py", line 277, in __init__
    self.read(stream)
  File "/usr/lib/python2.7/site-packages/pyPdf/pdf.py", line 651, in read
    offset, generation = line[:16].split(" ")
ValueError: too many values to unpack

Local variables in innermost frame:
cnt: 0
stream: <open file '/tmp/tmpYMc9c5pdfshuffler/02_StudentLES1.pdf', mode 'rb' at 0x1989540>
self: <pyPdf.pdf.PdfFileReader object at 0x19bc910>
startxref: 3879238
num: 1
x: 'x'
line: '0000000009 0000 n\n00'
ref: 'ref\n'
size: 6

How to reproduce
-----
1.Created PDF by importing in gui,
2.Clicked on export> crash
3.

Revision history for this message
In , Lofton (lofton-redhat-bugs) wrote :

Package: pdfshuffler-0.5-2.fc14
Architecture: x86_64
OS Release: Fedora release 14 (Laughlin)

How to reproduce
-----
1.Created PDF by importing in gui,
2.Clicked on export> crash
3.

Revision history for this message
In , Duane (duane-redhat-bugs) wrote :

Package: pdfshuffler-0.5-2.fc14
Architecture: x86_64
OS Release: Fedora release 14 (Laughlin)

How to reproduce
-----
1.
2.
3.
Imported 3 PDF documents created by Simple Scan. File 1 had 4 pages, file 2 had 2 pages, and file 3 had 2 pages. Tried to export entire document of 8 pages.

Revision history for this message
Tony Whelan (tony-whelan) wrote :

Running pdfshuffler 0.5.1 from a terminal window, I import several PDFs, and I can see from the terminal window that one of them is regarded by pdfshuffler as corrupt as it shows an error in the terminal window.

If then I delete the corrupt file from the pdfshuffler window, the remaining files cannot be exported. The same error message occurs in both cases.

 It seems that a corrupted file spoils the entire session and you need to close the program and start again.

The error I get when I try to export the attached file is:
Traceback (most recent call last):
  File "/usr/bin/pdfshuffler", line 417, in choose_export_pdf_name
    self.export_to_file(file_out)
  File "/usr/bin/pdfshuffler", line 438, in export_to_file
    pdfdoc_inp = PdfFileReader(file(pdfdoc.copyname, 'rb'))
  File "/usr/lib/pymodules/python2.6/pyPdf/pdf.py", line 277, in __init__
    self.read(stream)
  File "/usr/lib/pymodules/python2.6/pyPdf/pdf.py", line 651, in read
    offset, generation = line[:16].split(" ")
ValueError: too many values to unpack

Interestingly, the 'corrupted' file I have just tested this on was produced by Ubuntu's Simple Scan (in text mode, 150dpi), and it displays fine in Adobe Reader 9 or Document Viewer 2.32.0. I've rescanned it 3 times and it still breaks pdfshuffler.

I'm using Ubuntu 10.10, 32-bit version.

Revision history for this message
In , Fabian (fabian-redhat-bugs) wrote :

An update to 0.5.1 was provided. Please restest and reopen this bug if the problem still occurs.

Revision history for this message
In , Fabian (fabian-redhat-bugs) wrote :

*** Bug 715427 has been marked as a duplicate of this bug. ***

Changed in pdfshuffler (Debian):
status: Unknown → New
Revision history for this message
In , Rafał (rafa-redhat-bugs) wrote :

For a reason, I doubt any reasonable change in pdfshuffler would help.

The problem - in a way - lies in pyPDF, or more exactly in Simple Scan producing corrupted pdf files.

See my comment in http://bugs.gentoo.org/show_bug.cgi?id=380429 for a complete explanation. Anyone having a launchpad account should feel free to i.e. comment in https://bugs.launchpad.net/simple-scan/+bug/662144.

Changed in pdfshuffler (Debian):
status: New → Fix Released
Changed in pdfshuffler (Fedora):
importance: Unknown → Undecided
status: Unknown → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.