Books not importing correctly or opening to wrong book

Bug #689809 reported by Xephyr Inkpen
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Open Library
New
Undecided
Unassigned

Bug Description

This may be two separate issues, but here they are in case it is related (see possible theory below).

First if you click 'read online' from this record http://www.archive.org/details/gazetadebuenosay12131810ram and watch the title it changes from Gazeta de Buenosay to some German title. While the actual Gazeta (correct book) is displayed, if you click on the title, it goes to a totally wrong book. There is an entire collection of 117 books and it seems they are all having the same problem. This is a serial publication and they all have the same title/author and marc information. We differentiated by using the dates in the IA id - mmddyyyy.

Second, there are two editions of a book with slight variations in the title. They both open with 'read online' just fine. The first http://www.archive.org/details/vidadejjdessalin00dubr will open to an entirely different version of the book if you click on the title in the book reader. It looks like a google version. And as far as I can tell, the book vidadejjdessalin00dubr has no page on Open Library. The second book http://www.archive.org/details/vidadejjdessalin01dubr opens ok, reads ok, and clicking on the title brings it to the right page. I did notice that one has a : and the other has a space between the J.J. if you look at the two titles in the new reader.

I also noticed the titles cropped slightly, which lead me to wonder about the Open Library xsl filter. It seems to be picking up more of the MARC record than the usual IA filter. For example the reference numbers for Sabin under 'Edition Notes' are included http://openlibrary.org/books/OL24349968M/Vida_de_J.J._Dessalines whereas they are deleted in the Internet Archive version http://www.archive.org/details/vidadejjdessalin01dubr . This is subsection |c of the Marc record which is usually deleted, along with the rest of the title according to LOC standards. Though it is displayed as a subtitle in Open Library, I have been working on changing the xsl filter for the library here at David Rumsey & the Library's Request (see IA bug 676670).

Possible theory: until the xsl filter could be put in place, we decided I should copy/paste the rest of the title into the metadata of each and every record that needed it as I entered the picklists in. Consequently there are a lot of titles here with hand edited metadata. I wonder if the affect of this longer than usual title may be messing up the books importing into Open Library? I think the Gazeta may just be a serial issue with too many books under the same metadata, but I'm not sure about the others.

Revision history for this message
paul.n (paul-n) wrote :

Just an update: I described the first part of this bug to OL (and linked them to this report)

Sent the email today, so we'll wait for a reply. Sorry about the delay!

Paul

Revision history for this message
Edward Betts (edwardbetts) wrote :

This book has many source_records: http://openlibrary.org/books/OL24377285M

Might be a problem with the edition merge algorithm.

We need to add support for multi-volume works, so that the source records are displayed on Open Library.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.