[private] Mobi to Epub conversion fails on large files

Bug #908835 reported by Ana
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
calibre
Won't Fix
Undecided
Unassigned

Bug Description

Version: 0.8.29

Going from mobi to epub (largish file -- 41MB) yields the following error:

calibre, version 0.8.29
ERROR: Conversion Error: <b>Failed</b>: Convert book 1 of 1 (NIV Study Bible)

Convert book 1 of 1 (NIV Study Bible)
Resolved conversion options
calibre version: 0.8.29
{'asciiize': False,
 'author_sort': None,
 'authors': None,
 'base_font_size': 0.0,
 'book_producer': None,
 'change_justification': u'original',
 'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part|prologue|epilogue\\s+', 'i')) or @class = 'chapter']",
 'chapter_mark': u'pagebreak',
 'comments': None,
 'cover': 'c:\\users\\erin\\appdata\\local\\temp\\calibre_0.8.29_tmp_0qygcs\\y6l2im.jpeg',
 'debug_pipeline': None,
 'dehyphenate': True,
 'delete_blank_paragraphs': True,
 'disable_font_rescaling': False,
 'dont_split_on_page_breaks': False,
 'duplicate_links_in_toc': False,
 'enable_heuristics': False,
 'epub_flatten': False,
 'extra_css': None,
 'extract_to': None,
 'filter_css': u'',
 'fix_indents': True,
 'flow_size': 260,
 'font_size_mapping': None,
 'format_scene_breaks': True,
 'html_unwrap_factor': 0.4,
 'input_encoding': None,
 'input_profile': <calibre.customize.profiles.InputProfile object at 0x058EA530>,
 'insert_blank_line': False,
 'insert_blank_line_size': 0.5,
 'insert_metadata': False,
 'isbn': None,
 'italicize_common_cases': True,
 'keep_ligatures': False,
 'language': None,
 'level1_toc': None,
 'level2_toc': None,
 'level3_toc': None,
 'line_height': 0.0,
 'linearize_tables': False,
 'margin_bottom': 5.0,
 'margin_left': 5.0,
 'margin_right': 5.0,
 'margin_top': 5.0,
 'markup_chapter_headings': True,
 'max_toc_links': 50,
 'minimum_line_height': 120.0,
 'no_chapters_in_toc': False,
 'no_default_epub_cover': False,
 'no_inline_navbars': False,
 'no_svg_cover': False,
 'output_profile': <calibre.customize.profiles.NookOutput object at 0x058EA890>,
 'page_breaks_before': u"//*[name()='h1' or name()='h2']",
 'prefer_metadata_cover': False,
 'preserve_cover_aspect_ratio': False,
 'pretty_print': True,
 'pubdate': None,
 'publisher': None,
 'rating': None,
 'read_metadata_from_opf': 'c:\\users\\erin\\appdata\\local\\temp\\calibre_0.8.29_tmp_0qygcs\\fslcc4.opf',
 'remove_fake_margins': True,
 'remove_first_image': False,
 'remove_paragraph_spacing': False,
 'remove_paragraph_spacing_indent_size': 1.5,
 'renumber_headings': True,
 'replace_scene_breaks': u'',
 'series': None,
 'series_index': None,
 'smarten_punctuation': False,
 'sr1_replace': None,
 'sr1_search': None,
 'sr2_replace': None,
 'sr2_search': None,
 'sr3_replace': None,
 'sr3_search': None,
 'tags': None,
 'timestamp': None,
 'title': None,
 'title_sort': None,
 'toc_filter': None,
 'toc_threshold': 6,
 'unsmarten_punctuation': False,
 'unwrap_lines': True,
 'use_auto_toc': False,
 'verbose': 2}
InputFormatPlugin: MOBI Input running
on c:\users\erin\appdata\local\temp\calibre_0.8.29_tmp_0qygcs\braasx.mobi
Extracting text...
Adding anchors...
Extracting images...
Cleaning up HTML...
Parsing HTML...
Converting style information to CSS...
Creating OPF...
Parsing all content...
Parsing styles.css ...
Parsing CR%21NWS9NNXY812WF4RY66Y6CDKRRQN1.html ...
Failed to parse content in CR%21NWS9NNXY812WF4RY66Y6CDKRRQN1.html
Traceback (most recent call last):
  File "site-packages\calibre\ebooks\oeb\reader.py", line 165, in _manifest_prune_invalid
  File "site-packages\calibre\ebooks\oeb\base.py", line 1167, in fget
  File "site-packages\calibre\ebooks\oeb\base.py", line 857, in _parse_xhtml
  File "site-packages\calibre\ebooks\conversion\preprocess.py", line 546, in __call__
MemoryError

Spine item 'id1' not found
Python function terminated unexpectedly
  Spine is empty (Error Code: 1)
Traceback (most recent call last):
  File "site.py", line 132, in main
  File "site.py", line 109, in run_entry_point
  File "site-packages\calibre\utils\ipc\worker.py", line 187, in main
  File "site-packages\calibre\gui2\convert\gui_conversion.py", line 31, in gui_convert_override
  File "site-packages\calibre\gui2\convert\gui_conversion.py", line 25, in gui_convert
  File "site-packages\calibre\ebooks\conversion\plumber.py", line 968, in run
  File "site-packages\calibre\ebooks\conversion\plumber.py", line 1114, in create_oebbook
  File "site-packages\calibre\ebooks\oeb\reader.py", line 71, in __call__
  File "site-packages\calibre\ebooks\oeb\reader.py", line 612, in _all_from_opf
  File "site-packages\calibre\ebooks\oeb\reader.py", line 307, in _spine_from_opf
calibre.ebooks.oeb.base.OEBError: Spine is empty

Any/all help much appreciated. :(

Revision history for this message
Ana (anamardoll) wrote :
Revision history for this message
Kovid Goyal (kovid) wrote : Re: calibre bug 908835

That file has rather pathological markup, its not going to be convertible on
an ordinary computer. You'd need to run the conversion on a 64bit machine with
a 64bit version of calibre with oodles of RAM.

 status wontfix

Changed in calibre:
status: New → Won't Fix
Revision history for this message
Ana (anamardoll) wrote :

Well, I have a 64 bit machine and oodles of RAM. I presume I'm running the 64 bit Calibre install..?

Revision history for this message
Kovid Goyal (kovid) wrote :

Not on windows, you'd need to run the linux or OS X calibre builds. And by
oodles I mean about 12 GB

Revision history for this message
Ana (anamardoll) wrote :

Ah, that does change things a bit. :(

Thank you for letting me know. You Calibre folks really are the best. :)

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.