marc death dates

Bug #237099 reported by Aaron Swartz
2
Affects Status Importance Assigned to Milestone
Open Library
Confirmed
Wishlist
Edward Betts

Bug Description

LC/OCLC are putting out after-the-fact author death dates which would be nice to add:

http://www.oclc.org/rss/feeds/authorityrecords/default.htm

Revision history for this message
Aaron Swartz (aaronsw) wrote :

[from the mailing list]

I have a parsing script/program (in Perl) for the Death Dates lists [1].

It asks for a directory containing the Death Date lists in html
format, cleans them into tab-delimited text files for each plus one
compling all names into a single file, contained in a /cleaned/
directory under the initially-specified directory.

Then, after installing Global_Replace.pm [2] (tar.gz up is in the
/inprocess/ directory, but with an older version of
parsedeathdateslists.pl than that found above), the all.txt file
created by the parser may be used with globalreplaceidentnames.pl.
This asks for an input MARC file, output text file, and the all.txt
file created by the parsedeathdateslists.pl program. As written, it
uses the 001 field for reporting the control number, the old name(s),
and the new name(s). I've found it useful for my own purposes, but
have been working with a fairly clean set of MARC records. It may
have difficulty/issues with diacritics (which are stripped).

The other .pl in the /bin/ file is global_replace_ident.pl, which
takes the results of LCSHchangesparser.pl (current version 1.11,
version integrated into name), which takes a directory of LCSH Weekly
Lists (saved in text format) and parses them into tab-delimited text
files (including all.txt (compilation of all changed headings that
could be parsed) and allhash.txt (same, but sorted alphabetically) of
a format old_tag \t old_heading \t new_tag \t new_heading \t
thesaurus), plus bad.txt (unparsed/problematic headings).
global_replace_ident.pl then takes the allhash.txt or all.txt and a
file of MARC records and reports to an export text file the control
number and any changed headings that may need to be revised (it
doesn't work 100% of the time).

#######

[1] <http://home.inwave.com/eija/inprocess/parsedeathdateslists.pl.txt>

[2] <http://home.inwave.com/eija/inprocess/MARC-Global_Replace0.06/>

[3]
<http://home.inwave.com/eija/inprocess/MARC-Global_Replace0.06/bin/globalreplaceidentnames.pl.txt>

[4] <http://home.inwave.com/eija/inprocess/LCSHchangesparserpl111.txt>

I hope this helps,

Bryan Baldus
<email address hidden>
<email address hidden>
http://home.inwave.com/eija

Changed in openlibrary:
assignee: nobody → edward-debian
importance: Undecided → Wishlist
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.