QuestionableContent Archives bug

Bug #535438 reported by Curtis
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Dosage
Fix Committed
Undecided
Unassigned

Bug Description

The scraper doesn't see the image link within archive pages when trying to do a catch-up. (This may be the same as the 'current link' scraper bug, but I wasn't sure.) Thanks for looking, and for the great app.

$ mainline -v -v -v -v -c -c QuestionableContent
18:25:36 QuestionableContent> Catching up...
18:25:36 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1616', 'view.php?comic=1616']]
18:25:38 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/view.php?comic=1616...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1615', 'view.php?comic=1615']]
18:25:38 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/view.php?comic=1615...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1614', 'view.php?comic=1614']]

<snip>

Revision history for this message
Stuart I. Lumgair (stuart-lumgair) wrote :

I just ran mainline -vcc QuestionableContent locally, crawled it back through over 100 comics (1676 - 1576) ... all showing
' Skipping existing file "Comics/QuestionableContent/1###.png" ' . Did a catchup-scrape into a fresh directory, got a dozen or so before I jumped out. You may want to check the QuestionableContent data being used in my branch, or the newest main branch, if you are using the original plugins which came with 1.6.0

Revision history for this message
Stuart I. Lumgair (stuart-lumgair) wrote :

This problem was addressed with revision 591 in the trunk branch.

Changed in dosage:
status: New → Fix Committed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.