Comment 1 for bug 1224811

Revision history for this message
niknah (hankin0) wrote :

Tesseract works better if you enlarge the image.
I think it uses black and white for OCR, not even grayscale. So it gets confused if the gaps between the letters are gray and not black or white.

It'd be useful if a scale argument can be added to text() or Settings

Right now, I can't see a way to OCR on an enlarged image. text() only works on a Region. Can it be used with an Image?

Here're some tests with the attached image.
The same image, 2x, 3x, scaled using gimp (cubic) and the results from tesseract 3.02

------Results from tesseract 3.02----

mm lnruwzk hmvm raxmmvm D>l’V nu Wm dafl
um mequlci. hmwn fox lunllkduier u. my an;

u..n-umuuu hvrmn miuumu over uu ht} dug
::»,un. quick hmvm fax ilmlped u... nu lazy dug

um-1.. quick hmmu (ax jumped rner Ilue nu, dug
lfipt The quick brown fox jumped over the lazy dog

ispc The quick brown fox jumped over the lazy dog

32pt The quick brown fox jumped over the lazy dog

Same image resized 2x cubic with gimp...
lllpl Thequick hmwn foxjumped ovtrlhel-.I1_v dog

llpl'l1Ir quick bmwn [ox jumped over lhr lazy dog

12])! The quick hmwn foxjumpcd over the hazy dog
l3pt The quick brown fox jumped over the lazy dog

I-lpt The quick brown fox jumped over the lazy dog
l6pt The quick brown fox jumped over the lazy dog

l8pt The quick brown fox jumped over the lazy (log

32pt The quick brown fox jumped over the lazy dog

Same image resized 3x cubic with gimp...

llilpl The quick brmm foxjumpcd over the lazy dog
Hp! 11:‘ quick brown fox jumped over the Buy dog

I211! The qulek brown foxjumped over the lazy dog
l3pl The quick brown fox jumped over the lazy dog

l-lpt The quick brown fox jumped over the lazy dog