conversion from docx cannot handle images in footnotes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
calibre |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
I have problems converting ebooks that contain images in footnotes. The attached test file contains an inline image in a footnote and when I try to convert it to epub, the conversion fails. When I remove the image from the source file, the conversion runs fine.
Tried with Calibre 1.1.0 and 1.2.0 on Windows. (And probably on some older version on Linux, but that was with a different file and I cannot confirm that the footnote images were the sole reason; the error was the same or very similar, though.)
Another file (which I cannot share but which contained several images in footnotes) showed a different behaviour. It successfully converted, but the images in footnotes were broken, seemed to be xml files instead of image files and therefore would not display in the book.
This is from conversion to epub, but I tried several other formats and they all failed. Here are the details:
Convert book 1 of 1 (Test file)
C:\Program Files\Calibre2\
Resolved conversion options
calibre version: 1.2.0
{'asciiize': False,
'author_sort': None,
'authors': None,
'base_font_size': 0.0,
'book_producer': None,
'change_
'chapter': u"//*[((name()='h1' or name()='h2') and re:test(., '\\s*((
'chapter_mark': u'pagebreak',
'comments': None,
'cover': None,
'debug_pipeline': None,
'dehyphenate': True,
'delete_
'disable_
'docx_no_cover': False,
'dont_
'duplicate_
'embed_all_fonts': False,
'embed_
'enable_
'epub_flatten': False,
'epub_inline_toc': False,
'epub_toc_at_end': False,
'extra_css': None,
'extract_to': None,
'filter_css': u'',
'fix_indents': True,
'flow_size': 260,
'font_
'format_
'html_
'input_encoding': None,
'input_profile': <calibre.
'insert_
'insert_
'insert_metadata': False,
'isbn': None,
'italicize_
'keep_ligatures': False,
'language': None,
'level1_toc': None,
'level2_toc': None,
'level3_toc': None,
'line_height': 0.0,
'linearize_
'margin_bottom': 5.0,
'margin_left': 5.0,
'margin_right': 5.0,
'margin_top': 5.0,
'markup_
'max_toc_links': 50,
'minimum_
'no_chapters_
'no_default_
'no_inline_
'no_svg_cover': False,
'output_profile': <calibre.
'page_
'prefer_
'preserve_
'pretty_print': True,
'pubdate': None,
'publisher': None,
'rating': None,
'read_
'remove_
'remove_
'remove_
'remove_
'renumber_
'replace_
'search_replace': '[]',
'series': None,
'series_index': None,
'smarten_
'sr1_replace': None,
'sr1_search': None,
'sr2_replace': None,
'sr2_search': None,
'sr3_replace': None,
'sr3_search': None,
'start_
'subset_
'tags': None,
'timestamp': None,
'title': None,
'title_sort': None,
'toc_filter': None,
'toc_threshold': 6,
'toc_title': None,
'unsmarten_
'unwrap_lines': True,
'use_auto_toc': False,
'verbose': 2}
InputFormatPlugin: DOCX Input running
on C:\Users\
Converting Word markup to HTML
Python function terminated unexpectedly
u'word/
Traceback (most recent call last):
File "site.py", line 132, in main
File "site.py", line 109, in run_entry_point
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
File "site-packages\
KeyError: u'word/
Fixed in branch master. The fix will be in the next release. calibre is usually released every Friday.
status fixreleased