iso-8859-1 and/or utf-8 character not decoded properly

Bug #321656 reported by Zooko Wilcox-O'Hearn
2
Affects Status Importance Assigned to Milestone
kdebase-kde4 (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Binary package hint: kdebase-kde4

When I view this page:

http://eprint.iacr.org/2008/527

The non-ascii char in the last name of the author "Michal Rjaško" appears as a black diamond with a question mark in it. If I do "View -> View Document Information" then the resulting dialog box says "Document encoding: UTF-8" *and* says "Content-Type: text/html; charset=ISO-8859-1". Inspecting the headers with 'wget --save-headers' shows that the server is indeed specifying "Content-type: text/html; charset=iso-8859-1". Even more interesting, the "View -> Set Encoding" option currently shows the radio button labelled "Western European -> Autodetect". If I change that radio button to "Western European -> ISO-8859-1" then the author's name renders correctly. Also if I change that radio button to "Unicode -> UTF-8" then it also renders correctly. I think the auto-detection algorithm could use some work. ;-)

$ apt-cache policy konqueror-kde4
konqueror-kde4:
  Installed: 4:4.0.3-0ubuntu2
  Candidate: 4:4.0.3-0ubuntu2
  Version table:
 *** 4:4.0.3-0ubuntu2 0
        500 http://us.archive.ubuntu.com hardy/universe Packages
        100 /var/lib/dpkg/status

Revision history for this message
Jonathan Thomas (echidnaman) wrote :

KDE 4.0.x is very old. I tested this with KDE 4.2 and I cannot reproduce the error.

Changed in kdebase-kde4:
status: New → Fix Released
Revision history for this message
Zooko Wilcox-O'Hearn (zooko) wrote :

Agreed -- this is fixed now that I upgraded to konqueror 4.2.0 (from Ubuntu Jaunty pre-release).

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.