Tesseract OCR package is missing traning commands

Bug #1310136 reported by Darryl Hamilton
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
tesseract (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Ubuntu Version:
Description: Ubuntu 14.04 LTS
Release: 14.04

Package: tesseract-ocr:
  Installed: 3.03.02-3
  Candidate: 3.03.02-3
  Version table:
 *** 3.03.02-3 0
        500 http://nz.archive.ubuntu.com/ubuntu/ trusty/universe amd64 Packages
        100 /var/lib/dpkg/status

The package is missing a number of programs used to train the recognition engine, but the man pages for these are in the archive.

The upstream Debian version has fixed this - https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=742029

ProblemType: Bug
DistroRelease: Ubuntu 14.04
Package: tesseract-ocr 3.03.02-3
ProcVersionSignature: Ubuntu 3.13.0-24.46-generic 3.13.9
Uname: Linux 3.13.0-24-generic x86_64
ApportVersion: 2.14.1-0ubuntu3
Architecture: amd64
Date: Sun Apr 20 12:35:16 2014
InstallationDate: Installed on 2014-04-02 (17 days ago)
InstallationMedia: Ubuntu-Server 14.04 LTS "Trusty Tahr" - Beta amd64 (20140326)
SourcePackage: tesseract
UpgradeStatus: No upgrade log present (probably fresh install)

Revision history for this message
Darryl Hamilton (lordp) wrote :
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in tesseract (Ubuntu):
status: New → Confirmed
Revision history for this message
David Agudo (dagudoj) wrote :

Yes, they are missing in tesseract-ocr 3.03.02-3, although they were present in Ubuntu 12.04.

It is not possible to train tesseract without those tools.

Revision history for this message
David Agudo (dagudoj) wrote :

I have found several problems...

1) The training tools are not built by default because the training/ directory is not included in SUBDIRS

2) make training fails because there are some .o files remaining into the training/ directory that should not be into the source tgz. To make things worse, they are not deleted with make clean. This is explained here:

https://groups.google.com/forum/#!topic/tesseract-dev/ARKOSV3zpWo

I propose patching the main Makefile.am to solve these issues.

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

The attachment "make_clean.patch" seems to be a patch. If it isn't, please remove the "patch" flag from the attachment, remove the "patch" tag, and if you are a member of the ~ubuntu-reviewers, unsubscribe the team.

[This is an automated message performed by a Launchpad user owned by ~brian-murray, for any issues please contact him.]

tags: added: patch
Revision history for this message
Jeff Breidenbach (jeff-jab) wrote :

This problem was resolved in Ubuntu 14.10 and later. Sorry for the glitch, totally my fault.

Changed in tesseract (Ubuntu):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.