-c LFG crawls to first comic but retrieves no images

Bug #555282 reported by Stuart I. Lumgair
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Dosage
In Progress
Undecided
Stuart I. Lumgair

Bug Description

-c LookingForGroup is crawling back to first comic successfully, but retrieves no images. I will post a repair on my branch as soon as possible.

fix can be found at http://bazaar.launchpad.net/~stuart-lumgair/dosage/stuarts_plugins/revision/620

Related branches

Changed in dosage:
assignee: nobody → Stuart I. Lumgair (stuart-lumgair)
Changed in dosage:
status: New → In Progress
description: updated
Revision history for this message
Mr. Happy (elgerb) wrote :

tried the fix, but it doesn't work (anymore)
can you please fix it??

Revision history for this message
Stuart I. Lumgair (stuart-lumgair) wrote : Re: [Bug 555282] Re: -c LFG crawls to first comic but retrieves no images

On Wed, 08 Feb 2012 11:40:09 -0000
"Mr. Happy" <email address hidden> wrote:

> tried the fix, but it doesn't work (anymore)
> can you please fix it??

 looking at the page source, the images now have names like 91085a5287a84db324c0c03702aabb2b185e628d.gif ... I don't really want to use that as a file name. I'll keep looking at it, and hopefully figure out a way to deal with it.

Revision history for this message
Stuart I. Lumgair (stuart-lumgair) wrote :

ok, this works:

class LookingForGroup(BasicScraper):
    latestUrl = 'http://www.lfgcomic.com/'
    imageUrl = 'http://www.lfgcomic.com/page/%s'
    imageSearch = compile(r'(http://newcdn.lfgcomic.com/uploads/comics/.+?)"')
    prevSearch = compile(r'href="/page/(.+?)" id="nav-prev')
    starter = indirectStarter('http://www.lfgcomic.com/',
                              compile(r'href="/(page/.+?)" id="feature-preview"'))
    def namer(cls, imageUrl, pageUrl):
        return pageUrl.split('/')[-1].split('.')[0]

Revision history for this message
Mr. Happy (elgerb) wrote :

Thanks!
Had to add ', indirectStarter' to 'from dosage.helpers import BasicScraper', but now it works!

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.