Some Japanese manpages are not displayed correctly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu Manpage Repository |
Expired
|
Low
|
Unassigned |
Bug Description
By several confusing problems, some Japanese manpages are not displayed correctly.
ex. http://
* Title has wrong character "‹".
* "引た瑤砲弔い董" should be displayed "引き数について、".
* Topics of "NAME" should be "名前" in Japanese, but it is "前".
* and so on...
This problems are caused by following reasons:
1. Encoding of almost Japanese manpages in /usr/share/man/ja/ is EUC-JP, is not UTF-8.
2. man's behavior depend on "locale". Following two commands output difference result:
* LANG=en_GB.utf8 w3mman -l usr/share/
* LANG=ja_JP.utf8 w3mman -l usr/share/
3. col command which seems to be called in w3mman is not support UTF-8.
ref. http://
4. CGI::escapeHTML set default charset to ISO-8859-1.
To use multibyte character, should set "charset".
ref. http://
To resolve problems, I create patch files for two scripts in lp:ubuntu-manpage-repository .
In main/bin/
* Use man command instead of w3mman. And set device option "-Tutf8".
* If convert manpages for "ja", set language option "-L ja_JP.utf8".
Latter is very "one-time solution". However I couldn't get more smart way.
In main/bin/
* Set charset for CGI::escapeHTML to "UTF-8".
* Handle backspace (\x08) for multibyte character with UTF-8.
For example, "NAME" is "N\x08NA\
In this script, delete \x08 and one character former. (not one "byte" former)
And modified pipe, left/right quote escaping.
* Modified h3 tagging for non ascii topic title (.SH).
There are not well testing patches.
Please check it, especially in other locales.
This problem was reported by Nazo.
https:/
description: | updated |
Changed in ubuntu-manpage-repository: | |
importance: | Undecided → Low |
status: | New → Triaged |
Hi there. Thanks for the patches. I'll apply them and regenerate a test repository. Since I don't speak Japanese, I'll need you to test them for me :-)
:-Dustin