[request] OCR/tesseract: allow new training sets for other languages and more tesseract features

Bug #795391 reported by RaiMan
24
This bug affects 5 people
Affects Status Importance Assigned to Milestone
SikuliX
In Progress
Medium
RaiMan

Bug Description

--- implementation of other languages
A training set can be created for tesseract 2.04 as described here:
http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract2
and implemented in sikuli-script.jar.

If it contains non english characters, Sikuli crashes (e.g. norwegian).

--- Tesseracts bold-feature is not used properly by Sikuli: only @ is returned instead of @x for bold characters.

Tags: fkt-text
RaiMan (raimund-hocke)
Changed in sikuli:
importance: Undecided → Wishlist
RaiMan (raimund-hocke)
Changed in sikuli:
status: New → Incomplete
status: Incomplete → In Progress
importance: Wishlist → Medium
assignee: nobody → RaiMan (raimund-hocke)
tags: added: fkt-text
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.