incorrect self link to bookserver.archive.org/catalog

Bug #548454 reported by mangtronix
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Internet Archive BookServer
Fix Released
High
mangtronix

Bug Description

User report:

Just noticed while testing something that this link:

 <link rel="self" type="application/atom+xml"
href="http://bookserver.archive.org/catalog"/>

in http://bookserver.archive.org/catalog/downloads.xml

will 404, because it needs to have a trailing / to work:

http://bookserver.archive.org/catalog -> fails
http://bookserver.archive.org/catalog/ -> works

The @rel=self is correct in the main catalog.

mangtronix (mang)
Changed in bookserver:
assignee: nobody → mangtronix (mang)
importance: Undecided → High
status: New → Confirmed
Revision history for this message
mangtronix (mang) wrote :

Not really related but the search engine timed out the first time I tried to access /downloads.xml

Traceback (most recent call last):
  File "/petabox/sw/lib/python/web/application.py", line 242, in process
    return self.handle()
  File "/petabox/sw/lib/python/web/application.py", line 233, in handle
    return self._delegate(fn, self.fvars, args)
  File "/petabox/sw/lib/python/web/application.py", line 412, in _delegate
    return handle_class(cls)
  File "/petabox/sw/lib/python/web/application.py", line 387, in handle_class
    return tocall(*args)
  File "/home/mang/bookserver/opds.py", line 249, in GET
    ingestor = catalog.ingest.IASolrToCatalog(pubInfo, solrUrl, urn, titleFragment=titleFragment)
  File "/home/mang/bookserver/bookserver/catalog/ingest/SolrToCatalog.py", line 150, in __init__
    f = urllib.urlopen(self.url)
  File "urllib.py", line 82, in urlopen
    return opener.open(url)
  File "urllib.py", line 190, in open
    return getattr(self, name)(url)
  File "urllib.py", line 328, in open_http
    errcode, errmsg, headers = h.getreply()
  File "httplib.py", line 1195, in getreply
    response = self._conn.getresponse()
  File "httplib.py", line 924, in getresponse
    response.begin()
  File "httplib.py", line 385, in begin
    version, status, reason = self._read_status()
  File "httplib.py", line 343, in _read_status
    line = self.fp.readline()
  File "socket.py", line 331, in readline
    data = recv(1)
IOError: [Errno socket error] timed out

I added a dump of the URL. When I tried accessing the second time it was successful.

$ ./opds.py 9090
http://0.0.0.0:9090/
url - http://se.us.archive.org:8983/solr/select?fl=identifier,title,creator,publicdate,date,contributor,publisher,subject,language,format,month&wt=json&q=mediatype%3Atexts+AND+(format%3A(LuraTech+PDF)+OR+scanner:google)&sort=month+desc&rows=50
71.202.38.39:61501 - - [25/Mar/2010 23:48:28] "HTTP/1.1 GET /downloads.xml" - 200 OK

Revision history for this message
mangtronix (mang) wrote :

Fix to link is live on the site.

Changed in bookserver:
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.