Cleaning html file cleans it wrong
Bug #671636 reported by
Ravi
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
lxml |
Invalid
|
Undecided
|
Unassigned |
Bug Description
I am using the default Cleaner i.e. lxml.html.
Attached is the tar of the original file and the cleaned version.
To post a comment you must log in.
Similar happens with rest of the style tags too, the content between the start tag and the end tag is not removed.