PDF to MOBI Conversion Crashes on MAC

Bug #959651 reported by Faraz Jasimi
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
calibre
Won't Fix
Undecided
Unassigned

Bug Description

I am using mac OS X 10.6.8:

When I try to convert a PDF to MOBI file to read on my kindle it crashes: the file can be found here

http://www.soc.ucsb.edu/sites/default/files/Fragile%20Resistance%20e-book.pdf

the following information comes up after unsuccessful conversion:

calibre Debug log
calibre 0.8.43
Darwin-10.8.0-i386-64bit
Darwin
('Darwin', '10.8.0', 'Darwin Kernel Version 10.8.0: Tue Jun 7 16:33:36 PDT 2011; root:xnu-1504.15.3~1/RELEASE_I386')
Python 2.7.1
OSX: ('10.6.8', ('', '', ''), 'i386')
Starting up...
Started up in 22.26 seconds with 4 books
Worker Launch took: 0.0968151092529
Worker Launch took: 0.150835037231
Job: 1 Convert book 1 of 1 (Fragile Resistance e-book) finished
Convert book 1 of 1 (Fragile Resistance e-book)
 Resolved conversion options
 calibre version: 0.8.43
 {'asciiize': False,
  'author_sort': None,
  'authors': None,
  'base_font_size': 0.0,
  'book_producer': None,
  'change_justification': u'original',
  'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
  'chapter_mark': u'pagebreak',
  'comments': None,
  'cover': u'/var/folders/HE/HEzNBG5jFemEniJnnnydC++++TI/-Tmp-/calibre_0.8.43_tmp_SOXvzr/ItsCdt.jpeg',
  'debug_pipeline': None,
  'dehyphenate': True,
  'delete_blank_paragraphs': True,
  'disable_font_rescaling': False,
  'dont_compress': False,
  'duplicate_links_in_toc': False,
  'enable_heuristics': False,
  'extra_css': None,
  'extract_to': None,
  'filter_css': u'',
  'fix_indents': True,
  'font_size_mapping': None,
  'format_scene_breaks': True,
  'html_unwrap_factor': 0.4,
  'input_encoding': None,
  'input_profile': <calibre.customize.profiles.InputProfile object at 0x1066ddc10>,
  'insert_blank_line': False,
  'insert_blank_line_size': 0.5,
  'insert_metadata': False,
  'isbn': None,
  'italicize_common_cases': True,
  'keep_ligatures': False,
  'language': None,
  'level1_toc': None,
  'level2_toc': None,
  'level3_toc': None,
  'line_height': 0.0,
  'linearize_tables': False,
  'margin_bottom': 5.0,
  'margin_left': 5.0,
  'margin_right': 5.0,
  'margin_top': 5.0,
  'markup_chapter_headings': True,
  'max_toc_links': 50,
  'minimum_line_height': 120.0,
  'mobi_ignore_margins': False,
  'mobi_keep_original_images': False,
  'mobi_toc_at_start': False,
  'new_pdf_engine': False,
  'no_chapters_in_toc': False,
  'no_images': False,
  'no_inline_navbars': True,
  'no_inline_toc': False,
  'output_profile': <calibre.customize.profiles.KindleOutput object at 0x1066de2d0>,
  'page_breaks_before': u"//*[name()='h1' or name()='h2']",
  'personal_doc': u'[PDOC]',
  'prefer_author_sort': False,
  'prefer_metadata_cover': False,
  'pretty_print': False,
  'pubdate': None,
  'publisher': None,
  'rating': None,
  'read_metadata_from_opf': u'/var/folders/HE/HEzNBG5jFemEniJnnnydC++++TI/-Tmp-/calibre_0.8.43_tmp_SOXvzr/Ydvo4b.opf',
  'remove_fake_margins': True,
  'remove_first_image': False,
  'remove_paragraph_spacing': False,
  'remove_paragraph_spacing_indent_size': 1.5,
  'renumber_headings': True,
  'replace_scene_breaks': u'',
  'series': None,
  'series_index': None,
  'share_not_sync': False,
  'smarten_punctuation': False,
  'sr1_replace': None,
  'sr1_search': None,
  'sr2_replace': None,
  'sr2_search': None,
  'sr3_replace': None,
  'sr3_search': None,
  'tags': None,
  'timestamp': None,
  'title': None,
  'title_sort': None,
  'toc_filter': None,
  'toc_threshold': 6,
  'toc_title': None,
  'unsmarten_punctuation': False,
  'unwrap_factor': 0.45,
  'unwrap_lines': True,
  'use_auto_toc': False,
  'verbose': 2}

Job: 2 Convert book 1 of 1 (Fragile Resistance e-book) finished
Convert book 1 of 1 (Fragile Resistance e-book)
 Resolved conversion options
 calibre version: 0.8.43
 {'asciiize': False,
  'author_sort': None,
  'authors': None,
  'base_font_size': 0.0,
  'book_producer': None,
  'change_justification': u'original',
  'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
  'chapter_mark': u'pagebreak',
  'comments': None,
  'cover': u'/var/folders/HE/HEzNBG5jFemEniJnnnydC++++TI/-Tmp-/calibre_0.8.43_tmp_SOXvzr/HHu1FC.jpeg',
  'debug_pipeline': None,
  'dehyphenate': True,
  'delete_blank_paragraphs': True,
  'disable_font_rescaling': False,
  'dont_compress': False,
  'duplicate_links_in_toc': False,
  'enable_heuristics': False,
  'extra_css': None,
  'extract_to': None,
  'filter_css': u'',
  'fix_indents': True,
  'font_size_mapping': None,
  'format_scene_breaks': True,
  'html_unwrap_factor': 0.4,
  'input_encoding': None,
  'input_profile': <calibre.customize.profiles.InputProfile object at 0x1066ddc10>,
  'insert_blank_line': False,
  'insert_blank_line_size': 0.5,
  'insert_metadata': False,
  'isbn': None,
  'italicize_common_cases': True,
  'keep_ligatures': False,
  'language': None,
  'level1_toc': None,
  'level2_toc': None,
  'level3_toc': None,
  'line_height': 0.0,
  'linearize_tables': False,
  'margin_bottom': 5.0,
  'margin_left': 5.0,
  'margin_right': 5.0,
  'margin_top': 5.0,
  'markup_chapter_headings': True,
  'max_toc_links': 50,
  'minimum_line_height': 120.0,
  'mobi_ignore_margins': False,
  'mobi_keep_original_images': False,
  'mobi_toc_at_start': False,
  'new_pdf_engine': False,
  'no_chapters_in_toc': False,
  'no_images': False,
  'no_inline_navbars': True,
  'no_inline_toc': False,
  'output_profile': <calibre.customize.profiles.KindleOutput object at 0x1066de2d0>,
  'page_breaks_before': u"//*[name()='h1' or name()='h2']",
  'personal_doc': u'[PDOC]',
  'prefer_author_sort': False,
  'prefer_metadata_cover': False,
  'pretty_print': False,
  'pubdate': None,
  'publisher': None,
  'rating': None,
  'read_metadata_from_opf': u'/var/folders/HE/HEzNBG5jFemEniJnnnydC++++TI/-Tmp-/calibre_0.8.43_tmp_SOXvzr/jbVFkT.opf',
  'remove_fake_margins': True,
  'remove_first_image': False,
  'remove_paragraph_spacing': False,
  'remove_paragraph_spacing_indent_size': 1.5,
  'renumber_headings': True,
  'replace_scene_breaks': u'',
  'series': None,
  'series_index': None,
  'share_not_sync': False,
  'smarten_punctuation': False,
  'sr1_replace': None,
  'sr1_search': None,
  'sr2_replace': None,
  'sr2_search': None,
  'sr3_replace': None,
  'sr3_search': None,
  'tags': None,
  'timestamp': None,
  'title': None,
  'title_sort': None,
  'toc_filter': None,
  'toc_threshold': 6,
  'toc_title': None,
  'unsmarten_punctuation': False,
  'unwrap_factor': 0.45,
  'unwrap_lines': True,
  'use_auto_toc': False,
  'verbose': 2}

Any help will be much appreciated.

Thanks :)

Revision history for this message
Kovid Goyal (kovid) wrote : Re: calibre bug 959651

PDF is a very bad format, some PDF files are corrupted in ways that cause the
PDF library calirbe uses to crash. You are out of luck using calibre to convert
such PDF files, sorry.

 status wontfix

Changed in calibre:
status: New → Won't Fix
Revision history for this message
Faraz Jasimi (farazss) wrote : RE: [Bug 959651] Re: calibre bug 959651
Download full text (8.3 KiB)

Thank you for your response.
Is there anyway that I can covert this ebook into mobi format for kindle?
Any suggestion will be much appreciated.
Thnaks

> Date: Tue, 20 Mar 2012 03:33:10 +0000
> From: <email address hidden>
> To: <email address hidden>
> Subject: [Bug 959651] Re: calibre bug 959651
>
> PDF is a very bad format, some PDF files are corrupted in ways that cause the
> PDF library calirbe uses to crash. You are out of luck using calibre to convert
> such PDF files, sorry.
>
> status wontfix
>
> ** Changed in: calibre
> Status: New => Won't Fix
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/959651
>
> Title:
> PDF to MOBI Conversion Crashes on MAC
>
> Status in calibre: e-book management:
> Won't Fix
>
> Bug description:
> I am using mac OS X 10.6.8:
>
> When I try to convert a PDF to MOBI file to read on my kindle it
> crashes: the file can be found here
>
> http://www.soc.ucsb.edu/sites/default/files/Fragile%20Resistance%20e-
> book.pdf
>
> the following information comes up after unsuccessful conversion:
>
>
> calibre Debug log
> calibre 0.8.43
> Darwin-10.8.0-i386-64bit
> Darwin
> ('Darwin', '10.8.0', 'Darwin Kernel Version 10.8.0: Tue Jun 7 16:33:36 PDT 2011; root:xnu-1504.15.3~1/RELEASE_I386')
> Python 2.7.1
> OSX: ('10.6.8', ('', '', ''), 'i386')
> Starting up...
> Started up in 22.26 seconds with 4 books
> Worker Launch took: 0.0968151092529
> Worker Launch took: 0.150835037231
> Job: 1 Convert book 1 of 1 (Fragile Resistance e-book) finished
> Convert book 1 of 1 (Fragile Resistance e-book)
> Resolved conversion options
> calibre version: 0.8.43
> {'asciiize': False,
> 'author_sort': None,
> 'authors': None,
> 'base_font_size': 0.0,
> 'book_producer': None,
> 'change_justification': u'original',
> 'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((chapter|book|section|part)\\s+)|((prolog|prologue|epilogue)(\\s+|$))', 'i')) or @class = 'chapter']",
> 'chapter_mark': u'pagebreak',
> 'comments': None,
> 'cover': u'/var/folders/HE/HEzNBG5jFemEniJnnnydC++++TI/-Tmp-/calibre_0.8.43_tmp_SOXvzr/ItsCdt.jpeg',
> 'debug_pipeline': None,
> 'dehyphenate': True,
> 'delete_blank_paragraphs': True,
> 'disable_font_rescaling': False,
> 'dont_compress': False,
> 'duplicate_links_in_toc': False,
> 'enable_heuristics': False,
> 'extra_css': None,
> 'extract_to': None,
> 'filter_css': u'',
> 'fix_indents': True,
> 'font_size_mapping': None,
> 'format_scene_breaks': True,
> 'html_unwrap_factor': 0.4,
> 'input_encoding': None,
> 'input_profile': <calibre.customize.profiles.InputProfile object at 0x1066ddc10>,
> 'insert_blank_line': False,
> 'insert_blank_line_size': 0.5,
> 'insert_metadata': False,
> 'isbn': None,
> 'italicize_common_cases': True,
> 'keep_ligatures': False,
> 'language': None,
> 'level1_toc': None,
> 'level2_toc': None,
> 'level3_toc': None,
> 'line_height': 0.0,
> 'linearize_tables': False,
> 'margin_bottom...

Read more...

Revision history for this message
Kovid Goyal (kovid) wrote :

That PDF is a collection of page scans (images of the actual paper books
pages). It cannot be converted into an ebook. You can try using OCR software on
it.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.