ocrmypdf binary package in Ubuntu Noble amd64
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.
Publishing history
Date | Status | Target | Component | Section | Priority | Phased updates | Version | ||
---|---|---|---|---|---|---|---|---|---|
2023-10-28 23:39:49 UTC | Published | Ubuntu Noble amd64 | release | universe | graphics | Optional | 15.2.0+dfsg1-1 | ||
|
|||||||||
Deleted | Ubuntu Noble amd64 | proposed | universe | graphics | Optional | 15.2.0+dfsg1-1 | |||
|
|||||||||
2023-10-28 23:40:41 UTC | Superseded | Ubuntu Noble amd64 | release | universe | graphics | Optional | 14.0.1+dfsg1-1 | ||
|