Comment 0 for bug 710586

Revision history for this message
RaiMan (raimund-hocke) wrote : X 1.0rc1: Region.text() -- known problems and needed improvements

******* this report is a summary of known problems

The text recognition feature (OCR - Region.text()) together with the possibility to find text in an image is still experimental and under developement.

This are currently reported bugs:
bug 695616: Inconsistency in text recognition and matching, especially with integers-as-text!
bug 695650: find(text).text() does not return same text
bug 701005: text() always returns text with trailing x'200A20'
bug 701012: text() does not return all intervening blanks, add's others

Other experienced oddities
-- there are problems with text, that is not in english language
-- very small and very large fonts may not work
-- multiline text makes problems
-- intervening/preceding/trailing grafics and symbols are tried to be interpreted as text

Tip when using Region.text():
Currently you get the best results, when the region represents only one line of text and only contains text (no graphics/symbols) in english language.