display URLs in Unicode

Bug #42517 reported by Martin Pool
2
Affects Status Importance Assigned to Milestone
Bazaar
Fix Released
Medium
John A Meinel

Bug Description

(This is the complement of bug 42514)

When we have a URL representing an encoded Unicode string, we could try to translate it back to Unicode for display to the user. (For example this could be done in bzr info, when showing remembered urls, etc.)

The encoding of non-ascii characters in URLs is not defined. UTF-8 is somewhat standard, and if this was implemented we would need to assume that

Such a URL will contain urlescaped characters with the high bit set, which can be unescaped and then decoded as UTF-8. Note that we can't unescape all characters; those which are reserved in urls must be left as they are.

There is a problem with doing this though: URLs which are not actually UTF-8 but some other encoding may be displayed incorrectly as Unicode.

Revision history for this message
John A Meinel (jameinel) wrote :

This should be finished in my encoding branch.

Changed in bzr:
assignee: nobody → jameinel
status: Unconfirmed → Fix Committed
John A Meinel (jameinel)
Changed in bzr:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.