Add --nopictures, --tables=n to cli
Bug #395351 reported by
Ben Jackson
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Cuneiform for Linux |
New
|
Undecided
|
Unassigned |
Bug Description
I found that cuneiform would not OCR anything inside an outline (any kind of box) and would consider it either a picture or a table. With '--nopicture' it seems to just ignore those same areas. If you also add --tables it will successfully OCR inside the table.
I've never used bzr before this so I've probably botched the patch somehow, but it is fairly trivial.
Related branches
To post a comment you must log in.
I see now in puma.h there are constants related to the values I set:
# define PUMA_TABLE_NONE 0 ONLY_LINE 2 ONLY_TEXT 3 LINE_TEXT 4
# define PUMA_TABLE_DEFAULT 1
# define PUMA_TABLE_
# define PUMA_TABLE_
# define PUMA_TABLE_
# define PUMA_PICTURE_NONE 0
# define PUMA_PICTURE_ALL 1
I tried all the table settings but didn't really get any different output. Setting it to anything other than 0 got it to OCR things inside a box outline (as opposed to turning the box into a picture).