Old Marc Data Not Valid - Results in FixMe Error
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Evergreen |
Triaged
|
Wishlist
|
Unassigned |
Bug Description
Evergreen 2.2.2
Postgres 9.1
Debian Squeeze
Brief History:
We have older bib entries (2009-2010 time frame) that have invalid characters within the marc data. As a result, various activities against the marc data cause an Unhandled Error popup FIXME. Since the errors encountered are non-specific in nature, the library ends up creating a ticket, support spends some time researching including reviewing lengthy log files before finally determining there is an invalid character that was allowed to be loaded into the existing marc data causing the error.
Desired Outcome:
A method to identify marc records with invalid data so that something like regexp_replace could be used to clean up the errant data. An example that was recently found was quote marcs around a word copied from MS Word: “English”
tags: | added: marc wishlist |
tags: |
added: cat-marc removed: marc |
tags: | removed: wishlist |
I'm inclined to mark this bug invalid since it is really a data problem and not a software problem. However, I see the utility in a potential MARC scrubbing routine and having that available in the software, so I'll set the status to triaged and the importance to wishlist.