Binary package “libextractor-plugin-html” in ubuntu noble

extracts meta-data from files of arbitrary type (html plugin)

 GNU libextractor provides developers of file-sharing networks, file managers,
 and WWW-indexing bots with a universal library to obtain meta-data about files.
 .
 This package contains the plugin supporting html files.