w3mman cannot handle some characters

Bug #283975 reported by Dustin Kirkland 
6
Affects Status Importance Assigned to Milestone
w3m (Ubuntu)
Fix Released
Medium
Unassigned

Bug Description

Binary package hint: w3m

w3mman cannot handle some characters, such as the German "Ü".

To reproduce:

 $ wget http://manpages.ubuntu.com/manpages.gz/intrepid/de/man1/chmod.1.gz
 $ w3mman -l ./chmod.1.gz

Notice that it screws up the following roff text:
.SH "ÜBERSICHT"

The Ü is replaced with whitespace.

This is visible in the Ubuntu Manpage Repository:
 * http://manpages.ubuntu.com/manpages.gz/intrepid/de/man1/chmod.1.gz
Notice the indented, and misspelled word, "BERSICHT".

:-Dustin

Related branches

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Was reported by henux, I'm filing and confirming.

:-Dustin

Changed in w3m:
status: New → Confirmed
Changed in w3m:
importance: Undecided → Medium
Revision history for this message
Colin Watson (cjwatson) wrote :

This is very likely to be simply a consequence of bug 320842.

Revision history for this message
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package w3m - 0.5.2-2ubuntu1

---------------
w3m (0.5.2-2ubuntu1) karmic; urgency=low

  * debian/patches/10-w3mman-keep-formatting: Set MAN_KEEP_FORMATTING=1 to
    instruct man to preserve formatting characters in its output rather than
    filtering them through col (closes: #325699, #426362; LP: #283975,
    #353900).

 -- Colin Watson <email address hidden> Tue, 30 Jun 2009 11:49:15 +0100

Changed in w3m (Ubuntu):
status: Confirmed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers