Dosage

QuestionableContent Archives bug

Bug #535438 reported by Curtis on 2010-03-10

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	Dosage	Fix Committed	Undecided	Unassigned

Bug Description

The scraper doesn't see the image link within archive pages when trying to do a catch-up. (This may be the same as the 'current link' scraper bug, but I wasn't sure.) Thanks for looking, and for the great app.

$ mainline -v -v -v -v -c -c QuestionableContent
18:25:36 QuestionableContent> Catching up...
18:25:36 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1616', 'view.php?comic=1616']]
18:25:38 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/view.php?comic=1616...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1615', 'view.php?comic=1615']]
18:25:38 QuestionableContent> Matching regex(es) ['<base\\s+href="([^"]*)"\\s+/?>', '<img src="((?:http://www\\.questionablecontent\\.net|\\.{1,2})/comics/\\d+\\.png)"', '<a href="(view.php\\?comic=\\d+)">Previous'] multiple times against http://www.questionablecontent.net/view.php?comic=1615...
18:25:38 QuestionableContent> ...found [[], [], ['view.php?comic=1614', 'view.php?comic=1614']]

<snip>

Revision history for this message

Stuart I. Lumgair (stuart-lumgair) wrote on 2010-06-01:

I just ran mainline -vcc QuestionableContent locally, crawled it back through over 100 comics (1676 - 1576) ... all showing
' Skipping existing file "Comics/QuestionableContent/1###.png" ' . Did a catchup-scrape into a fresh directory, got a dozen or so before I jumped out. You may want to check the QuestionableContent data being used in my branch, or the newest main branch, if you are using the original plugins which came with 1.6.0

Revision history for this message

Stuart I. Lumgair (stuart-lumgair) wrote on 2010-09-28:

This problem was addressed with revision 591 in the trunk branch.

Changed in dosage:
status:	New → Fix Committed

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.