remote z39.50 search returns no results for terms with diacritics
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Evergreen |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
Evergreen version: Master as of 2014-07-21
OpenSRF version: Master as of 2014-07-14 (2.3.0)
Postgres version: 9.1.[something]
When searching via z39.50, I get no remote results for terms containing diacritics. For example, Slavoj Žižek return 0 results if I only search remote sources, but returns 4 from the local catalog. The same is true if I search his name as Slavoj Zizek.
In the attached screenshots, I have limited the searches to remotes that are also running Evergreen, but I get identical results if I choose Library of Congress or biblios.net.
I have also tested against the other Evergreen catalogs and my own using SRU. If I search for the authors last name, without or without diacritics, I get results that way.
I assume this has something to do with Z39.50 going through YAZ. It would seem to me that the characters are being double encoded, converted to an ISO8859-1 string, or similar. Note the way the author's name appears in the screen shot for the remote search by title.
Changed in evergreen: | |
status: | New → Confirmed |
I did this search because I knew it would return results from the NOBLE catalog.
Note how the author's name appears in the results list. This suggests some kind of encoding problem as the data passes through Evergreen, yaz, and back.