Any parser can't parse <area> tag with contents using any parser
Bug #1928742 reported by
Mikhail Yudin
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Beautiful Soup |
Invalid
|
Undecided
|
Unassigned |
Bug Description
<area> tag closed early:
Python 3.9.4 (default, Apr 20 2021, 15:51:38)
[GCC 10.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from bs4 import BeautifulSoup
>>> soup = BeautifulSoup(
>>> soup.prettify()
'<html>\n <body>\n <area/>\n <test>\n 123\n </test>\n </body>\n</html>'
>>>
Same with html.parser and html5lib.
To post a comment you must log in.
Additional information:
$ pip install beautifulsoup4 --upgrade python3. 9/site- packages (4.9.3) python3. 9/site- packages (from beautifulsoup4) (2.2.1)
Defaulting to user installation because normal site-packages is not writeable
Requirement already satisfied: beautifulsoup4 in /usr/lib/
Requirement already satisfied: soupsieve>1.2 in /usr/lib/