Crude duplicate checking causes some URLs to be updated several times

Bug #740637 reported by Vallery Lancey
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenSearch
New
Undecided
Unassigned

Bug Description

Some but very few URLs were showing up as duplicates. This was solved by the implementation of URLs updating their entires if entries already exist. However, the issue needs cleaning up.

I suspect the problem is that a url like /about is checked against the previously downloaded example.com/about, and is seen to be different.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.