pdftotxt extraction of accented characters
Bug #1527318 reported by
yaser
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
poppler (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
To extract text from a PDF file written in Spanish with pdftotxt function, accented characters (ü,á) are drawn incorrectly.
Example:
Original text => Extracted text
Facultad de Matemática y Computación => Facultad de Matem´tica y Computaci´n
Analizadores Multilingües en FreeLing => Analizadores Multiling¨es en FreeLing
To post a comment you must log in.