inconsistent czech man pages - different encoding

Bug #130377 reported by Martin Jaburek
2
Affects Status Importance Assigned to Milestone
language-pack-cs (Ubuntu)
Fix Released
Undecided
Unassigned
man-db (Ubuntu)
Fix Released
Undecided
Colin Watson

Bug Description

Binary package hint: language-pack-cs

I found it by using man aptitude (Kubuntu Feisy Fawn 7.04 up to date 4.8.2007, locales are cs_CZ.UTF-8), man page is localized, but incorrect encoding (utf-8). So I try:

martin@kopretina:/usr/share/man/cs$ find /usr/share/man/cs -type f -print0 |xargs -0 -i bash -c "echo {};zcat {} | enca"

/usr/share/man/cs/man8/groupmod.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/aptitude.8.gz
Universal transformation format 8 bits; UTF-8
/usr/share/man/cs/man8/iwevent.8.gz
7bit ASCII characters
/usr/share/man/cs/man8/groupdel.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/grpck.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/iwgetid.8.gz
7bit ASCII characters
/usr/share/man/cs/man8/iwspy.8.gz
7bit ASCII characters
/usr/share/man/cs/man8/iwpriv.8.gz
7bit ASCII characters
/usr/share/man/cs/man8/faillog.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/groupadd.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/iwconfig.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/iwlist.8.gz
7bit ASCII characters
/usr/share/man/cs/man8/nologin.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/lastlog.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man8/vipw.8.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man1/su.1.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man1/expiry.1.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man1/gpasswd.1.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man7/wireless.7.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man5/gshadow.5.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man5/passwd.5.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man5/faillog.5.gz
ISO 8859-2 standard; ISO Latin 2
/usr/share/man/cs/man5/shadow.5.gz
ISO 8859-2 standard; ISO Latin 2

ASCII pages are not translated
aptitude is UTF-8 encoded
rest of them are latin2

Do you planning migrate to UTF-8?
Will be in the future included traslated pages man-pages-cs? (maintained at http://sweb.cz/tropikhajma/man-pages-cs/index.html)

Revision history for this message
Martin Böhm (martin.bohm) wrote :

Thank you for your bug report. I am afraid it is not the translation team's fault. I suspect it is an upstream (most likely Debian) bug or a bug related to migrating data from Debian to Ubuntu.

Do you know about the situation in Debian? Are their man pages all correctly set up?

Revision history for this message
Martin Jaburek (longmatys) wrote :

I don't know. I just installed kubuntu and found this problem. What about man-pages-cs? They are completely missing.

Revision history for this message
Colin Watson (cjwatson) wrote :

This was sort of an aptitude bug; at that point, it shouldn't have been installing UTF-8 manual pages. However, man-db 2.5.0 begins the migration to UTF-8 manual pages, and in Hardy the Czech versions of aptitude(8) and groupmod(8) both display more or less properly. (Some accented characters in aptitude(8) are displayed as their nearest ASCII equivalents; this is ultimately due to lack of Unicode input support in groff, which is actively being worked on.)

Once I release man-db 2.5.1 in a few weeks' time, we'll begin the migration to UTF-8 manual pages in Debian in earnest. I'm not sure if this will land in time for Hardy, but at this point that shouldn't matter too much; your basic problem is fixed.

I've filed http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=459776 asking for a Czech-speaking developer to package the set of Czech manual pages you mentioned.

Changed in man-db:
assignee: nobody → kamion
status: New → Fix Released
Revision history for this message
Colin Watson (cjwatson) wrote :

manpages-cs is now on its way into Hardy, which should clear up the rest of this bug. Thanks for your report.

Changed in language-pack-cs:
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.