BeautifulSoap Core Dump when reading
Bug #1026998 reported by
Dame
This bug report is a duplicate of:
Bug #984936: Segfault when target object defines doctype() and document contains invalid doctype.
Edit
Remove
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Beautiful Soup |
New
|
Undecided
|
Unassigned |
Bug Description
getSitecontent = BeautifulSoup(
<head>
<
</head>
<body>
<a name=heading84>
<
</body>
</html>""")
I have tried this on python 2.6.6 and i a get core dump.
To post a comment you must log in.
This is a bug in lxml. You can work around it by parsing the markup using html.parser or html5lib parser instead of lxml.