Comment 6 for bug 158435

Revision history for this message
danh (danh-archive) wrote :

For reference, i'm coding as outlined above, and anticipate getting the c portion done shortly (day or two).

When wrapping in php, we'll need to create an xml file according to the national digital newspaper project (which will be saved as an xml box in the output jp2).

This file is described in part 5 of the JPEG 2000 Profile for the National Digital Newspaper Program (with an example in appendix F): http://www.loc.gov/ndnp/pdf/NDNP_JP2HistNewsProfile.pdf

I think the basic intent is to have embedded in each jp2 page image the:
    newspaper name
    location of publication
    date of publication
    page label
    Library of Congress (LOC) Catalog number

The profile makes it sound like this is all available from the marc information ("Dublin Core"). Some of this (e.g., the page label, which i think could be strings like "B2") i need some advice on how to proceed.

Just for reference, the details of the proposal from the LOC include these items:

   (1) Library of Congress catalog number for the serial ("normalized")
   (2) Date of publication (CCCC-MM-DD)
   (3) "Edition Order"
   (4) "Page Sequence Number"
   (5) Title
   (6) "Page label" (which i think would typically be a page number)
   (7) "Responsible Organization" (looks like sponsor)
   (8) Reel number
   (9) Reel sequence number

SO: i will need some advice on this a little later this week, and thanks in advance everybody for your suggestions or other help.