Comment 7 for bug 1440304

Revision history for this message
iostrym (armandooooo) wrote : Re : [Bug 1440304] Re: [Enhancement] configure metadata import when importing pdf file in calibre

Thanks a lot for your time. I did not understand that calibre use also PDF info dic. So calibre use PDF info dic, xmp Dublin core and also xmp calibre meta data (only for custom metadata not available in DC metadata) ?
PDF info dic are not OK in the wrong PDF file ? Because PDF xchange change both PDF info dic and xmp Dublin core in same manipulation. So even reading info dic, it should be OK...
I think I start to understand in KO file info dic are read, in OK file DC xmp are read. Even if info dic are identical in both. But there is something calibre don't like in info dic. But what...

--- Message initial ---

De : "Kovid Goyal" <email address hidden>
Envoyé : 7 avril 2015 08:35
A : <email address hidden>
Objet : [Bug 1440304] Re: [Enhancement] configure metadata import when importing pdf file in calibre

To be precise, calibre compares the ModDate from the PDF Info dictionary
to the MetadataData in the XMP block. In your problem PDF, the ModDate
is Mon Apr 6 23:24:42 2015 and the MetadataDate is
2014-04-22T00:53:01+02:00

so calibre will use the information from the Info block rather than the
XMP, since the Info block is marked as being newer.

--
You received this bug notification because you are subscribed to the bug
report.
https://bugs.launchpad.net/bugs/1440304

Title:
  [Enhancement] configure metadata import when importing pdf file in
  calibre

Status in calibre: e-book management:
  Won't Fix

Bug description:
  In 2.23 on win7 64 bits, when importing a pdf in calibre, some common metadata in pdf file can be read by calibre to be imported in calibre metadata.
  for example : title, author and tag are imported. Also subject metadata is put in comment.

  By testing I saw that :

  - first line of subject is put in calibre tag (pdf subject can set in many lines using some pdf editor)
  - full subject (including others lines) are put in calibre comment
  - tag must be separated by comma.

  But maybe this import feature is described somewhere ?

  Something great would be for example
  - to configure the "separator" used between tags because some pdf editor don't support comma and want ";"
  - to be able to disable de first line import in subject for tags
  - to be able to customize which calibre metadata is written using which pdf metadata :
     ie : published date is first line of subject
            isbn is second line of subject
            others lines of subject are comment

  I would be happy to help if I was showed where this is done in code...

To manage notifications about this bug go to:
https://bugs.launchpad.net/calibre/+bug/1440304/+subscriptions