chm->pdf conversion german umlauts TOC problem

Bug #2021413 reported by Michael Thummerer
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
calibre
Fix Released
Undecided
Unassigned

Bug Description

calibre-64bit-6.18.1
Windows 11 x64

when converting a german CHM file to PDF, the german umlauts from the CHM-TOC-entrys are not correctly converted and wrongly displayed at the PDF bookmarks. see attached screenshot (left toc: chm, right toc: pdf).

the codepage of the hhc-file from which the CHM file is generated is 1252. i changed it to UTF8 but it doesnt help.

i tested another conversion tool and the german umlauts are correctly displayed at the PDF bookmarks.

Tags: umlauts
Revision history for this message
Michael Thummerer (michla) wrote :
Revision history for this message
Kovid Goyal (kovid) wrote :

This is fixed in the next calibre release

Changed in calibre:
status: New → Fix Released
Revision history for this message
Michael Thummerer (michla) wrote :

i installed calibre-64bit-6.19.1 and tried to convert the CHM to PDF again; but now i always get an error report and the conversion is stopped. i tried different CHM-files and i tried to convert to EPUB, but i always got the same error report:

Normalizing filename cases
Rewriting HTML links
Parsing toc.html ...
Forcing toc.html into XHTML namespace
File toc.html missing <head/> element
Traceback (most recent call last):
  File "runpy.py", line 196, in _run_module_as_main
  File "runpy.py", line 86, in _run_code
  File "site.py", line 83, in <module>
  File "site.py", line 78, in main
  File "site.py", line 50, in run_entry_point
  File "calibre\utils\ipc\worker.py", line 215, in main
  File "calibre\gui2\convert\gui_conversion.py", line 38, in gui_convert_override
  File "calibre\gui2\convert\gui_conversion.py", line 25, in gui_convert
  File "calibre\ebooks\conversion\plumber.py", line 1108, in run
  File "calibre\customize\conversion.py", line 242, in __call__
  File "calibre\ebooks\conversion\plugins\chm_input.py", line 77, in convert
  File "calibre\ebooks\conversion\plugins\chm_input.py", line 106, in _create_oebbook_html
  File "calibre\ebooks\conversion\plugins\html_input.py", line 202, in create_oebbook
  File "calibre\ebooks\oeb\base.py", line 267, in rewrite_links
  File "calibre\ebooks\conversion\plugins\html_input.py", line 274, in resource_adder
  File "calibre\ebooks\conversion\plugins\html_input.py", line 266, in link_to_local_path
AttributeError: 'HTMLInput' object has no attribute 'root_dir_of_input'

Revision history for this message
Kovid Goyal (kovid) wrote :

Ah, that's an unrelated bug caused by the restriction of the HTML input
plugin to the root dir of the input file. I forgot the CHM input plugin
uses the HTML input plugin, and neglects to set the root dir. This will
be fixed in the next release. In the meantime you can always downgrade
to 6.16 if you need to convert CHM files.

Revision history for this message
Kovid Goyal (kovid) wrote :

Fixed in branch master. The fix will be in the next release. calibre is usually released every alternate Friday.

Revision history for this message
Michael Thummerer (michla) wrote :

Is there any beta download available so i can test the fix?

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.