AFAIK the bounding box of every single letter is written to the HOCR file, so generating proper info from that is the PDF generator's job.
AFAIK the bounding box of every single letter is written to the HOCR file, so generating proper info from that is the PDF generator's job.