Private Named Entities problems

Bug #1772157 reported by Hans Lederer on 2018-05-19
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
calibre
Undecided
Unassigned

Bug Description

I came across a German classic EPUB, Grimms Märchen, using lots of private named entities for ligatures in its Fraktur font:
http://users.physik.fu-berlin.de/~jtt/ebooks/Kinder-%20und%20Hausm%C3%A4rchen.epub
from http://users.physik.fu-berlin.de/~jtt/ebooks/ like defining —

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN"
                      "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd"
[
  <!ENTITY ll "&#xf4f9;">
]>

and then using them like   ſo&ll;en   in the XHTML texts.
As far as I can see this is coded correctly though not recommendable; my two Pocketbooks have different problems displaying it.

Calibre's current viewer (3.23 on Linux Mint Debian Ed.) displays those ligatures correctly, but also displays the closing ]> from <!DOCTYPE … . 3.13 did not display the ligatures.

Calibre's current editor (3.23) flags >70k errors and then ≈200 warnings about that, offering to replace the named entities with the referenced characters — but then does NOT replace the   &ll;   by its definition   &#xf4f9;   but by ≪ , others not at all, removing only the warnings but not the errors.
Beautifying just removes the above definitions, thus ruining their display.

Fixed in branch master. The fix will be in the next release. calibre is usually released every alternate Friday.

 status fixreleased

Changed in calibre:
status: New → Fix Released
Hans Lederer (hanspl) wrote :

Wow. That was fast! Could you possibly lend a hand to the LibreOffice team? ;-)

It was an easy fix :)

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers