Internet Archive BookServer

incorrect self link to bookserver.archive.org/catalog

Bug #548454 reported by mangtronix on 2010-03-25

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	Internet Archive BookServer	Fix Released	High	mangtronix

Bug Description

User report:

Just noticed while testing something that this link:

in http://bookserver.archive.org/catalog/downloads.xml

will 404, because it needs to have a trailing / to work:

http://bookserver.archive.org/catalog -> fails
http://bookserver.archive.org/catalog/ -> works

The @rel=self is correct in the main catalog.

mangtronix (mang) on 2010-03-25

Changed in bookserver:
assignee:	nobody → mangtronix (mang)
importance:	Undecided → High
status:	New → Confirmed

Revision history for this message

mangtronix (mang) wrote on 2010-03-25:

Not really related but the search engine timed out the first time I tried to access /downloads.xml

Traceback (most recent call last):
  File "/petabox/sw/lib/python/web/application.py", line 242, in process
    return self.handle()
  File "/petabox/sw/lib/python/web/application.py", line 233, in handle
    return self._delegate(fn, self.fvars, args)
  File "/petabox/sw/lib/python/web/application.py", line 412, in _delegate
    return handle_class(cls)
  File "/petabox/sw/lib/python/web/application.py", line 387, in handle_class
    return tocall(*args)
  File "/home/mang/bookserver/opds.py", line 249, in GET
    ingestor = catalog.ingest.IASolrToCatalog(pubInfo, solrUrl, urn, titleFragment=titleFragment)
  File "/home/mang/bookserver/bookserver/catalog/ingest/SolrToCatalog.py", line 150, in __init__
    f = urllib.urlopen(self.url)
  File "urllib.py", line 82, in urlopen
    return opener.open(url)
  File "urllib.py", line 190, in open
    return getattr(self, name)(url)
  File "urllib.py", line 328, in open_http
    errcode, errmsg, headers = h.getreply()
  File "httplib.py", line 1195, in getreply
    response = self._conn.getresponse()
  File "httplib.py", line 924, in getresponse
    response.begin()
  File "httplib.py", line 385, in begin
    version, status, reason = self._read_status()
  File "httplib.py", line 343, in _read_status
    line = self.fp.readline()
  File "socket.py", line 331, in readline
    data = recv(1)
IOError: [Errno socket error] timed out

I added a dump of the URL. When I tried accessing the second time it was successful.

$ ./opds.py 9090
http://0.0.0.0:9090/
url - http://se.us.archive.org:8983/solr/select?fl=identifier,title,creator,publicdate,date,contributor,publisher,subject,language,format,month&wt=json&q=mediatype%3Atexts+AND+(format%3A(LuraTech+PDF)+OR+scanner:google)&sort=month+desc&rows=50
71.202.38.39:61501 - - [25/Mar/2010 23:48:28] "HTTP/1.1 GET /downloads.xml" - 200 OK

Revision history for this message

mangtronix (mang) wrote on 2010-03-26:

Fix to link is live on the site.

Changed in bookserver:
status:	Confirmed → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.