Comment 53 for bug 623438

Revision history for this message
Emmanuel Pirsch (emmanuel-pirsch) wrote : Re: Font size not correct in merged sandvich PDF

I'm having similar issue. I can confirm that it is not related to Cuneiform. I'm using ocropus (ocroscript recognize) (which uses Tesseract) and I have check the resulting .html (hocr) which seems valid and pixel perfect.

However, hocr2pdf misalign the text with their related bounding boxes. I've tried ocroscript recognize with and without the --charboxes options and the result is always wrong (the text has an offset on the Y axis).

This is with exactimage 0.8.1-3build1 on Ubuntu Natty.