lxml.html.clean.Cleaner crushes on some HTMLs
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
lxml |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
When I run clean_reproduce.py script exception is raised
python ~/tmp/clean_
Executing python /home/nikita/
Python : sys.version_
lxml.etree : (4, 4, 0, 0)
libxml used : (2, 9, 9)
libxml compiled : (2, 9, 9)
libxslt used : (1, 1, 33)
libxslt compiled : (1, 1, 33)
Traceback (most recent call last):
File "/home/
main()
File "/home/
cleaner.
File "src/lxml/
File "src/lxml/
File "/home/
assert parent is not None
I agree that it shouldn't run into that assertion. PR welcome.