Verified with libxml2 2.7.8:
>>> print(et.tostring(et.fromstring(h, et.HTMLParser()))) <html><body><p>helloworld1<br/><br/>\n- <em>helloworld1</em></p></body></html>
Found to be fixed in libxml2 2.9.0:
>>> print(et.tostring(et.fromstring(h, et.HTMLParser()))) <html><body><p>helloworld1<br/>\n<br/>\n- <em>helloworld1</em>\n</p></body></html>
Verified with libxml2 2.7.8:
>>> print(et. tostring( et.fromstring( h, et.HTMLParser()))) body><p> helloworld1< br/><br/ >\n- <em>helloworld1 </em></ p></body> </html>
<html><
Found to be fixed in libxml2 2.9.0:
>>> print(et. tostring( et.fromstring( h, et.HTMLParser()))) body><p> helloworld1< br/>\n< br/>\n- <em>helloworld1 </em>\n< /p></body> </html>
<html><