lxml.html.defs.html5_tags omits various HTML5 tags

Bug #1447450 reported by Darryl Dixon
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
lxml
Confirmed
Low
Unassigned

Bug Description

As per summary, lxml.html.defs.html5_tags omits several HTML5 elements, these are at least:

main
bdi
dialog
menuitem

Revision history for this message
scoder (scoder) wrote :

I take pull requests.

Changed in lxml:
importance: Undecided → Low
status: New → Confirmed
Revision history for this message
Stephan Sokolow (ssokolow) wrote :

These are also missing:

data
rtc

I may make a pull request if I can ever find the time to set up for it and familiarize myself with the LXML infrastructure. (I found them while I was already building a custom allow_tags for lxml.html.clean.Cleaner, so correcting it in my code is trivial and my time is limited.)

Revision history for this message
Stephan Sokolow (ssokolow) wrote :

Also, I should probably mention that <bdi> is a very important tag since it's basically the HTML's way of sandboxing against unpaired Unicode LTR/RTL text-directionality control codes.

Revision history for this message
scoder (scoder) wrote :

"lxml.html.defs" is a very simple Python module. Not much infrastructure involved.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.