BeautifulSoap Core Dump when reading

Bug #1026998 reported by Dame
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Beautiful Soup
New
Undecided
Unassigned

Bug Description

getSitecontent = BeautifulSoup("""<html>
      <head>
      <title></title>
      </head>
      <body>
      <a name=heading84>
      <!doctype=book>
      </body>
      </html>""")

I have tried this on python 2.6.6 and i a get core dump.

Revision history for this message
Leonard Richardson (leonardr) wrote :

This is a bug in lxml. You can work around it by parsing the markup using html.parser or html5lib parser instead of lxml.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.