lxml.etree.fromstring fails silently when the input includes U+1F4C2 'OPEN FILE FOLDER'
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
lxml |
New
|
Undecided
|
Unassigned |
Bug Description
Version information: Python 3.11 on macOS 13.1.
Python : sys.version_
lxml.etree : (4, 9, 2, 0)
libxml used : (2, 9, 13)
libxml compiled : (2, 9, 13)
libxslt used : (1, 1, 35)
libxslt compiled : (1, 1, 35)
In the following interaction in IPython, lxml.etree.
In [3]: import lxml.html
In [4]: import lxml.etree
In [5]: parser = lxml.html.
In [6]: lxml.etree.
Out[6]: <Element html at 0x1026c29e0>
In [7]: lxml.etree.
In [8]: lxml.etree.
Out[8]: <Element html at 0x1029e6c10>
In [9]: lxml.etree.
Out[9]: <Element html at 0x102aa6c10>
I am attaching a test script that runs essentially that code (except the failing line is last). For me, the script outputs
Traceback (most recent call last):
File "/private/
assert lxml.etree.
AssertionError
Probably due to the missing Py3.11 wheel (I guess you built your local installation yourself).