Seshat Wiki Scraper - Parsing issue due to bullet lists
Bug #1541424 reported by
Odhran Gavin
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
dacura |
New
|
Undecided
|
Unassigned |
Bug Description
The scraper fails to parse a page when the last item on the page is a bullet list or indented and the page contains references.
Steps to duplicate:
1. Ensure that the last item on the page in question is a bullet list (preceded by an asterisk)
2. Ensure that the page contains a reference (between <ref></ref> tags)
3. Attempt to scrape the page using the Dacura Seshat Scraping Tool
Note: if the page has been recently scraped, caching will prevent these changes from appearing.
tags: | added: scraper |
To post a comment you must log in.