SikuliX

[request] OCR/tesseract: allow new training sets for other languages and more tesseract features

Bug #795391 reported by RaiMan on 2011-06-10

This bug affects 5 people

Affects		Status	Importance	Assigned to	Milestone
	SikuliX	In Progress	Medium	RaiMan

--- implementation of other languages
A training set can be created for tesseract 2.04 as described here:
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
and implemented in sikuli-script.jar.

If it contains non english characters, Sikuli crashes (e.g. norwegian).

--- Tesseracts bold-feature is not used properly by Sikuli: only @ is returned instead of @x for bold characters.

Tags:

RaiMan (raimund-hocke) on 2011-06-10

Changed in sikuli:
importance:	Undecided → Wishlist

RaiMan (raimund-hocke) on 2013-02-22

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Bug watches keep track of this bug in other bug trackers.