Toronto needs two language tags available in biblios metaform
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Scribe2 |
New
|
Undecided
|
Unassigned |
Bug Description
(from <email address hidden> - toronto book loader)
We have been working on a bilingual (english and french) collection for a
government sponsor and experimented with adding a second language code. It
is very important that the Canadian government stuff works in both
languages.
We did a bit of research on valid bilingual codes and found that using
engfre is allowable in a MARC record.
http://
We tried a test by adding engfre at the Biblio/Metaform stage and while it
derived properly, the accents are not correct on the french sections of
the txt file.
http://
So, we have been running an xml task to add the second language code
before the book is scanned and derived. The OCR works well on both
languages.
http://
My question (and my hope) is that we can add the second code on the
metaform page during loading so that we don't have to run an extra task on
the books. Is there a way to enter a second language code and have it
work? We have also tried eng;fre entered in the language field on the
metaform and it did not OCR properly.
Is there any way that the code engfre can OCR in both languages?
The best solution here is for your sponsor to correct their MARC records to indicate the second language. It is true, as stated in the bug report, that multiple language codes are allowed in MARC, and our code will extract them from any of the 3-4 standard places where they might be found in the MARC record, but both of the test books referred to in the bug report have MARC records that specify only English.
If those records can't be corrected for whatever reason, and the second language code is to be added manually to meta.xml, it has to be added as a second <language> element, not appended to the first element. In other words, "engfre" is correct in the MARC, but would be converted to:
<language> eng</language> fre</language>
<language>
in meta.xml. Currently the metaform page in the biblio tool offers no way to specify additional elements, so it's not possible to manually add languages there; perhaps the biblio tool could be modified to allow adding elements, like the Metadata Editor in the Item Manager, and the QA page in the metamgr, now do.
It looks as though this experimentation has been going on for a month or so. Did anyone consider just emailing me during that time?