[SRU] unzip list utf-8 (non-ascii) filenames as ??
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
unzip (Debian) |
Fix Released
|
Unknown
|
|||
unzip (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned | ||
Precise |
Fix Released
|
High
|
Brian Murray | ||
Quantal |
Fix Released
|
High
|
Brian Murray | ||
Raring |
Fix Released
|
High
|
Brian Murray |
Bug Description
[Impact]
Despite notorious LP #580961 , the biggest issue of current unzip is that it always list non-ASCII file name character as question mark (?) .
Even though handling Zip archives with non-standard encoding (neither CP437 nor UTF-8) can be tricky, we should support UTF-8 Zip archives out of the box as it's the standard and it become more and more popular now.
Newer version of unzip in Debian and Ubuntu (13.10+) fixed the issue by changing build configuration already. The fix is visible to File Roller and Ark also.
The issue remains in 12.04 - 13.04 currently. We need to apply the same fix for them.
[Test Case]
touch naïve.txt
touch 天真.txt
zip test.zip naïve.txt 天真.txt
unzip -l test.zip
With unzip -l you should not see question marks in the filenames.
ark test.zip
file-roller test.zip # the result may be different see notes below
[Regression Potential]
Little, if any, as the proposed fix is used by Debian, Ubuntu 13.10+ and openSUSE (using a different approach to archive exactly same effect) already.
[Other Info]
The fix for this bug also fixes bug 580961, however that bug had so much history I (bdmurray) decided not to make one the duplicate of the other and just also include that bug number in the changelog.
Note that Ark is the better frontend software for testing, as it seems to use unzip backend exclusively. File roller, on the other hand, will use 7z backend when 7z exists, so the issue can magically disappear some time (7z support utf8 filenames correctly). 7z isn't included on the ISO, though.
description: | updated |
description: | updated |
summary: |
- Do not call isprint when listing + unzip list utf-8 (non-ascii) filenames as ??, also affects file roller + and ark |
summary: |
- unzip list utf-8 (non-ascii) filenames as ??, also affects file roller - and ark + unzip list utf-8 (non-ascii) filenames as ?? |
description: | updated |
description: | updated |
summary: |
- unzip list utf-8 (non-ascii) filenames as ?? + [SRU] unzip list utf-8 (non-ascii) filenames as ?? |
description: | updated |
description: | updated |
tags: | removed: patch |
description: | updated |
description: | updated |
description: | updated |
Changed in unzip (Debian): | |
status: | Unknown → Fix Released |
tags: | added: verification-done-precise verification-done-raring |
tags: |
added: verification-done removed: verification-done-precise verification-done-raring verification-needed |
tags: | added: verification-done-precise verification-done-quantal verification-done-raring |
Status changed to 'Confirmed' because the bug affects multiple users.