Comment 8 for bug 324256

Revision history for this message
adriverhoef (a3) wrote :

I'm running Cuneiform 1.0 and the behaviour that I'm observing is also, like others already have mentioned, that a dash gets translated into three characters: —
(that's U+00E2, U+20AC, U+201D).
This happens not only when I use "smarttext" for format, also with "html", "hocr" and "text".
When using Czech, Dutch, English, French, German, etc. Cuneiform will produce —.
However, when using Bulgarian, Russian, etc. Cuneiform will produce: — (that's U+0432, U+0402, U+201D).