[request] tesseract: want to use tessedit_char_whitelist

Bug #1365575 reported by RaiMan
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
SikuliX
Fix Released
Medium
RaiMan

Bug Description

I'm trying to override the tessedit_char_whitelist Tesseract config parameter. I want to tell Tesseract to only match on AlphaNumberic characters (don't include punctuation etc). The full parameter definition is:

tessedit_char_whitelist abcdefghijklmnopqrtsuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789

Problem is I don't know where to put this in the tessdata folder. Someone said use the "bazaar" pattern matching file. In other words define a file in the config directory called "bazaar_test" and put that line in it.

First I tried it with a stand alone install of Tesseract and it actually worked! Then I tried it in Sekuli's tessdata directory but didn't have any luck.

Has anyone ever tried to do this before?

Revision history for this message
RaiMan (raimund-hocke) wrote :

a solution see related question

Changed in sikuli:
status: New → In Progress
importance: Undecided → Medium
assignee: nobody → RaiMan (raimund-hocke)
milestone: none → 1.2.0
RaiMan (raimund-hocke)
Changed in sikuli:
status: In Progress → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Related questions

Remote bug watches

Bug watches keep track of this bug in other bug trackers.