HTMLParser html entities preprocessing is undocumented

Bug #1592628 reported by Kees Cook
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
python3.5 (Ubuntu)
New
Undecided
Unassigned

Bug Description

The attached script does not emit HTML entities on Xenial, but works correctly on Trusty. I don't see any difference in the html2lib package, so I assume something must have changed in Python 3.

Trusty (correct):

$ ./test.py
Start tag: body
Named ent: uuml
End tag : body

Xenial (incorrect):

$ ./test.py
Start tag: body
Data : ü
End tag : body

Revision history for this message
Kees Cook (kees) wrote :
Revision history for this message
Kees Cook (kees) wrote :

Nevermind, it's not html5lib, it's libpython3.5-stdlib, and the issue is the new undocumented "convert_charrefs" argument to _init__.

affects: html5lib (Ubuntu) → python3.5 (Ubuntu)
summary: - html entities are incorrectly preprocessed
+ HTMLParser html entities preprocessing is undocumented
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.