Parsing is losing content

Bug #1876833 reported by matt daniels
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
lxml
Invalid
Undecided
Unassigned

Bug Description

As an example: http://www.expatperu.com/expatforums/index.php

Once I run lxml.html.fromstring() on the html, I lose all content below the "skip to content" and Facebook link.

This is just an example - I've been running into this every now and then.

Revision history for this message
scoder (scoder) wrote :

The parser is implemented in libxml2 and not in lxml, so this probably isn't something that lxml can change.

Changed in lxml:
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.