Incomlete chars table (where some CE chars missing) I have found in the files: preprocess.py (PDFTOHTML list), unsmarten.py, utils.py. Maybe only the first one is critical for automatic lines unwrapping of the pdf documents.
Incomlete chars table (where some CE chars missing) I have found in the files: preprocess.py (PDFTOHTML list), unsmarten.py, utils.py.
Maybe only the first one is critical for automatic lines unwrapping of the pdf documents.