Import of PDF changes uppercase to lowercase

Bug #918075 reported by David H
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Inkscape
Confirmed
Medium
Unassigned

Bug Description

When I import the attached PDF in Inkscape 0.48.2 r9819 some letters are converted to lowercase. One might think that the included font files are to blame (a lowercase letter maps to an uppercase shape) but when I extract them with ghostscript I do not see fonts that do that.

Revision history for this message
David H (davidh-blinker) wrote :
description: updated
description: updated
su_v (suv-lp)
tags: added: importing pdf
removed: lowercase uppercase
Revision history for this message
su_v (suv-lp) wrote :

Please provide information about your OS/platform.

Revision history for this message
su_v (suv-lp) wrote :

PDF info of the attached PDF file:
$ pdfinfo 918075-C356_ANVvj12_HR.pdf
Creator: Adobe InDesign CS3 (5.0)
Producer: Adobe PDF Library 8.0
CreationDate: Thu Nov 24 17:12:22 2011
ModDate: Thu Nov 24 17:12:51 2011
Tagged: no
Pages: 2
Encrypted: no
Page size: 1190.55 x 850.394 pts
File size: 10714502 bytes
Optimized: yes
PDF version: 1.4
$

Font info of the attached PDF file:
$ pdffonts 918075-C356_ANVvj12_HR.pdf
name type emb sub uni object ID
------------------------------------ ----------------- --- --- --- ---------
WTJWTU+EuroSans-Regular CID Type 0C yes yes yes 94 0
VLZQFW+Helvetica TrueType yes yes no 108 0
XBTCHS+OfficinaSans-Book Type 1C yes yes yes 93 0
XBTCHS+OfficinaSans-Bold Type 1C yes yes yes 92 0
WTJWTU+DINMittelschrift Type 1C yes yes yes 97 0
WTJWTU+ChaparralPro-BoldIt Type 1C yes yes no 96 0
WTJWTU+ChaparralPro-BoldIt Type 1C yes yes yes 104 0
$

The font in question seems to be 'ITC Officina Sans Book':
<http://www.itcfonts.com/fonts/Detail.htm?ProductId=169914>
Do you have it installed on your system?

  Note: Inkscape cannot reuse fonts embedded in the PDF file:
  if a matching one is found among the installed fonts, that
  one is used, else a generic fallback font (Sans).

I don't own the font in question and can't confirm if upper/lower case imports and renders correctly if it is installed locally, or whether the PDF file makes use of extended OTF features which are not supported by Inkscape (alternate glyphs, small-caps, etc).

I can confirm that for some text objects, upper and lower case seems to be mixed randomly on import:
<text
           transform="matrix(0.9,0,0,-1,42.5197,758.5334)"
           id="text128"><tspan
             style="fill:#c4161d;fill-opacity:1;fill-rule:nonzero;stroke:none;font-family:ITC Officina Sans Book;font-variant:normal;font-weight:normal;font-stretch:normal;font-size:34;writing-mode:lr;-inkscape-font-specification:OfficinaSans-Book"
             x="0 21.964 42.67 65.246 89.08 107.882 133.62 148.648 171.224 186.864 205.02 230.758 249.56 270.878 289.68 312.256 325.38 345.44 367.404 386.206 404.362 425.68 447.644 466.446"
             y="0"
             sodipodi:role="line"
             id="tspan130">aRNHEm-NIJmEgEN VaStgoEd</tspan></text>

(tested with Inkscape 0.48.2 and 0.48+devel r10903 on OS X Lion).

Changed in inkscape:
importance: Undecided → Medium
status: New → Confirmed
Revision history for this message
su_v (suv-lp) wrote :

Similar issue reported in
Bug #199689 in Inkscape: “PDF import changes font case randomly”
<https://bugs.launchpad.net/inkscape/+bug/199689>

(See screenshot attached in comment #1: <https://bugs.launchpad.net/inkscape/+bug/199689/+attachment/224341/+files/Inkscape-PDF-problem.png>)

Revision history for this message
su_v (suv-lp) wrote :

If I copy text from the PDF file displayed in Apple's Preview.app, from the first title, displayed as
"ARNHEM-NIJMEGEN VASTGOD"
I get the same "incorrect" string seen in Inkscape when pasted elsewhere, e.g. into the edit field of this bug report in Firefox:
"aRNHEm-NIJmEgEN VaStgoEd"

Issue seems not specific to Inkscape.

Revision history for this message
su_v (suv-lp) wrote :

Text from the PDF file imports correctly in trunk (r10903) with the experimental 'Adobe PDF via cairo-poppler (*.pdf)' input format (using the embedded fonts outlined as cloned paths).

Note: with this import format, text is not editable as text anymore (outlined instead).

Revision history for this message
David H (davidh-blinker) wrote :

I'm on Linux / Ubuntu maverick-backports. I do not have the fonts installed (either).

For me it is imperative that I can edit the text in the .svg file, cloned path import would not help me.

Revision history for this message
Beluga (buovjaga) wrote :

Still repro with internal.

Win 7 64-bit
Inkscape 0.92pre1_64bit r15016

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.