program mangles output when input contains Unicode characters

Bug #1930643 reported by catharsis
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
txt2html (Ubuntu)
New
Undecided
Unassigned

Bug Description

Example using non-ASCII apostrophe:

-----
$ echo 'This won’t work well' | txt2html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title></title>
<meta name="generator" content="HTML::TextToHTML v2.53"/>
</head>
<body>
<p>This won&acirc;<sup>TM</sup>t work well</p>

</body>
</html>
------

Which displays in web browser as "This wonâ�TMt work well"

ProblemType: Bug
DistroRelease: Ubuntu 20.04
Package: txt2html 1:2.53-2
ProcVersionSignature: Microsoft 4.4.0-18362.1049-Microsoft 4.4.35
Uname: Linux 4.4.0-18362-Microsoft x86_64
ApportVersion: 2.20.11-0ubuntu27.17
Architecture: amd64
CasperMD5CheckResult: skip
Date: Wed Jun 2 19:32:47 2021
PackageArchitecture: all
ProcEnviron:
 SHELL=/bin/bash
 LANG=C.UTF-8
 TERM=xterm-256color
 PATH=(custom, user)
SourcePackage: txt2html
UpgradeStatus: Upgraded to focal on 2021-04-17 (46 days ago)

Revision history for this message
catharsis (catharsis71) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.