sanitizer mostly broken when used with html5lib
Bug #292401 reported by
Håkan W
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Python HTML Sanitizer |
Confirmed
|
High
|
dan mackinlay |
Bug Description
Almost all doctests are failing when used with html5lib. See attached fail log for detailed info
The errors seem to be mostly these two:
1) Adding unecessary <p> elements
Expected:
u'<p>A B C</p>'
Got:
u'<p>A </p>B C<p></p>'
2) Outputting complete documents instead of the html fragment
Expected:
u'<p>A </p><p>B C</p><p>D</p>'
Got:
u'<
To post a comment you must log in.
*blush* - what a disaster. I don't know how I committed a version that broke 14 doctests. I can only suspect I pressed undo on that rather critical change without noticing before commit. Then I went on my annual holiday. Much shame on my part.
the fixed version (which still fails 4 doctests due to different handling of the body tag) is now pushed into the repository, but I'm leaving this one open until those are also resolved.