I'm running Cuneiform 1.0 and the behaviour that I'm observing is also, like others already have mentioned, that a dash gets translated into three characters: —
(that's U+00E2, U+20AC, U+201D).
This happens not only when I use "smarttext" for format, also with "html", "hocr" and "text".
When using Czech, Dutch, English, French, German, etc. Cuneiform will produce —.
However, when using Bulgarian, Russian, etc. Cuneiform will produce: — (that's U+0432, U+0402, U+201D).
I'm running Cuneiform 1.0 and the behaviour that I'm observing is also, like others already have mentioned, that a dash gets translated into three characters: —
(that's U+00E2, U+20AC, U+201D).
This happens not only when I use "smarttext" for format, also with "html", "hocr" and "text".
When using Czech, Dutch, English, French, German, etc. Cuneiform will produce —.
However, when using Bulgarian, Russian, etc. Cuneiform will produce: — (that's U+0432, U+0402, U+201D).