Binary package “libhtml-html5-parser-perl” in ubuntu jammy

parse HTML reliably

 HTML::HTML5::Parser is an HTML parser,
 similar to the non-CPAN module Whatpm::HTML
 with some changes including:
 .
  * Provides an XML::LibXML-like DOM interface.
    If you usually use XML::LibXML's DOM parser,
    this should be a drop-in solution for tag soup HTML.
 .
  * Constructs an XML::LibXML::Document as the result of parsing.
 .
  * Via bundling and modifications,
    removed external dependencies on non-CPAN packages.