unzip fails to deal correctly with filename encodings
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu Japanese Kaizen Project |
Fix Released
|
High
|
Unassigned | ||
unzip (Ubuntu) |
Fix Released
|
Critical
|
Unassigned | ||
Precise |
Fix Released
|
High
|
Unassigned | ||
Quantal |
Fix Released
|
High
|
Unassigned | ||
Raring |
Fix Released
|
High
|
Unassigned | ||
unzip (openSUSE) |
Fix Released
|
Medium
|
Bug Description
Binary package hint: unzip
This is a fairly annoying bug that's been around and known at least since 2005. It's very visible as it will very often make exchange of zip files with Windows users impossible, for example. As such, it gathered it's fair share of "me too" and "how dare you haven't fixed this yet!!111!" comments.
Problem description:
zip/unzip and the specification fall short when dealing with non-ASCII filenames not encoded in UTF-8
test case:
do an "unzip -l" on the file http://
affected programs:
the problem is in unzip itself, but affects GUI like xarchiver, file-roller, etc. that rely on unzip for the decompression
suggested solutions (most are workarounds, not proper fixes):
a) reintroduce patch for codepage-based zip filenames: bug 477755, http://
b) unzip filename according to locale: bug 203609
c) Ubuntu JP has a patch, probably not generally applicable, bug 269482
d) Russian altlinux distro uses natspec lib and patched zip binary
natspec was mentioned in bug 477755 comment #2 and may indeed be a proper fix, needs closer inspection (I haven't really looked, yet. As discussed in https:/
It's unfortunate the FOSS community so far hasn't been able to fix this rather visible problem. I'm opening this ticket as a master bug and clean slate to document the issue and current status. Please don't ruin it by making above-mentioned unhelpful comments, they actually slow things down! Please don't nominate for a release.
Unless you're a dev and can provide a patch, you should think VERY carefully to do anything but
1) subscribe yourself to this ticket
2) mark this bug as affecting you
3) tell me via mail about other bugs you think are a duplicate of this one, discussing the same problem
1) to 3) will showcase to the devs how many people are affected and that is the only real chance we have for somebody to take a serious look. "Me too" comments do the opposite, so again, please don't do it.
Changed in unzip (Ubuntu): | |
importance: | Undecided → High |
status: | New → Triaged |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
Changed in unzip (Debian): | |
status: | Unknown → Confirmed |
Changed in unzip (Ubuntu): | |
assignee: | nobody → Seung Soo Ha (sungsuha) |
tags: | added: regression-update |
tags: | added: needs-reassignment |
Changed in gentoo: | |
status: | Unknown → Fix Released |
Changed in unzip (Mandriva): | |
status: | Unknown → Confirmed |
Changed in unzip: | |
status: | Unknown → Invalid |
Changed in unzip (Ubuntu): | |
status: | Triaged → Confirmed |
Changed in unzip (Ubuntu): | |
assignee: | nobody → Sergey Polushin (serbly) |
Changed in unzip: | |
importance: | Unknown → Medium |
status: | Invalid → Unknown |
Changed in ubuntu-jp-improvement: | |
assignee: | Rolf Leggewie (r0lf) → nobody |
Changed in gentoo: | |
status: | Fix Released → Won't Fix |
Changed in unzip (Ubuntu): | |
assignee: | nobody → Click (clicky-mail) |
summary: |
- unzip fails to deal correctly with filename encodings + распаковать не учитывает правильно с именами файлов |
summary: |
- распаковать не учитывает правильно с именами файлов + unzip fails to deal correctly with filename encodings |
Changed in unzip (Ubuntu): | |
assignee: | Click (clicky-mail) → Alex Smirnov (coder1993) |
tags: | added: patch |
Changed in ubuntu-jp-improvement: | |
status: | Fix Committed → Invalid |
Changed in ubuntu-jp-improvement: | |
importance: | Undecided → High |
status: | Invalid → Fix Committed |
Changed in unzip (Ubuntu Natty): | |
assignee: | Alex Smirnov (coder1993) → Pinkertonik (pinkertonik) |
Changed in file-roller: | |
importance: | Unknown → Medium |
status: | Unknown → Confirmed |
Changed in hundredpapercuts: | |
status: | New → Confirmed |
Changed in unzip (Ubuntu Natty): | |
assignee: | Pinkertonik (pinkertonik) → Canonical Desktop Team (canonical-desktop-team) |
Changed in unzip (Ubuntu Natty): | |
assignee: | Canonical Desktop Team (canonical-desktop-team) → Brian Thomason (brian-thomason) |
Changed in unzip (Ubuntu Natty): | |
status: | Confirmed → Fix Released |
Changed in gentoo: | |
importance: | Unknown → Medium |
Changed in unzip (Mandriva): | |
importance: | Unknown → High |
Changed in unzip (Ubuntu Natty): | |
status: | Fix Released → Triaged |
Changed in unzip (Ubuntu Natty): | |
status: | Triaged → Won't Fix |
Changed in hundredpapercuts: | |
status: | Confirmed → Invalid |
description: | updated |
Changed in linuxmint: | |
status: | New → Triaged |
Changed in unzip (Ubuntu Natty): | |
assignee: | Brian Thomason (brian-thomason) → Vova (vosha) |
Changed in unzip (Ubuntu): | |
status: | Triaged → Fix Released |
Changed in unzip (Ubuntu): | |
status: | Fix Released → Triaged |
Changed in unzip (Ubuntu): | |
status: | Triaged → New |
Changed in unzip (Ubuntu): | |
status: | New → Triaged |
Changed in unzip (openSUSE): | |
importance: | Unknown → Medium |
status: | Unknown → Fix Released |
tags: | added: rls-mgr-o-tracking |
tags: | removed: rls-mgr-o-tracking |
Changed in unzip (Ubuntu): | |
assignee: | Brian Thomason (brian-thomason) → Mad_Loki (madloki1) |
Changed in unzip (Ubuntu): | |
status: | Triaged → In Progress |
Changed in unzip (Ubuntu): | |
assignee: | Mad_Loki (madloki1) → nobody |
status: | In Progress → Triaged |
Changed in unzip (Ubuntu): | |
assignee: | nobody → wert (wert-dmitrii) |
Changed in unzip (Ubuntu): | |
assignee: | wert (wert-dmitrii) → nobody |
Changed in unzip (Mandriva): | |
status: | Confirmed → Unknown |
tags: | removed: regression-update |
no longer affects: | hundredpapercuts |
Changed in unzip (Ubuntu): | |
assignee: | nobody → Пётр (plmak) |
Changed in unzip (Ubuntu): | |
assignee: | Пётр (plmak) → nobody |
assignee: | nobody → Пётр (plmak) |
Changed in unzip (Ubuntu): | |
assignee: | Пётр (plmak) → nobody |
importance: | High → Critical |
no longer affects: | unzip (Ubuntu Natty) |
tags: | added: verification-needed |
Changed in unzip (Ubuntu Raring): | |
importance: | Undecided → High |
status: | New → Fix Committed |
Changed in unzip (Ubuntu Quantal): | |
status: | New → Fix Committed |
Changed in unzip (Ubuntu Precise): | |
status: | New → Fix Committed |
tags: | added: verification-done-precise |
Changed in unzip (Ubuntu Precise): | |
status: | Fix Committed → Fix Released |
tags: | added: verification-done-raring |
tags: | added: verification-done-quantal |
tags: | removed: verification-needed |
Changed in unzip (Ubuntu Raring): | |
status: | Fix Committed → Fix Released |
Changed in unzip (Ubuntu Quantal): | |
status: | Fix Committed → Fix Released |
Changed in unzip (Ubuntu): | |
status: | Triaged → Fix Released |
Changed in unzip (Debian): | |
status: | Confirmed → Unknown |
affects: | unzip → ubuntu-translations |
Changed in ubuntu-translations: | |
importance: | Medium → Undecided |
status: | Unknown → New |
no longer affects: | ubuntu-translations |
affects: | gentoo → ubuntu-translations |
Changed in ubuntu-translations: | |
importance: | Medium → Undecided |
status: | Won't Fix → New |
no longer affects: | ubuntu-translations |
affects: | unzip (Mandriva) → ubuntu-translations |
Changed in ubuntu-translations: | |
importance: | High → Undecided |
status: | Unknown → New |
no longer affects: | ubuntu-translations |
affects: | file-roller → ubuntu-translations |
Changed in ubuntu-translations: | |
importance: | Medium → Undecided |
status: | Confirmed → New |
no longer affects: | ubuntu-translations |
affects: | unzip (Debian) → ubuntu-translations |
Changed in ubuntu-translations: | |
importance: | Unknown → Undecided |
status: | Unknown → New |
no longer affects: | ubuntu-translations |
affects: | linuxmint → ubuntu-translations |
no longer affects: | ubuntu-translations |
Changed in unzip (Ubuntu Precise): | |
importance: | Undecided → High |
Changed in unzip (Ubuntu Quantal): | |
importance: | Undecided → High |
Changed in unzip (Ubuntu): | |
assignee: | nobody → f (andrewkuzbass) |
assignee: | f (andrewkuzbass) → nobody |
Created attachment 319015
An archive file with cyrillic file names included
User-Agent: Mozilla/5.0 (X11; U; Linux i686; ru; rv:1.9.0.13) Gecko/2009080200 SUSE/3.0.13-0.1.2 Firefox/3.0.13
There are several discussions about the problem concerning cyrillic filenames in zip archives and unzip package. Unzip out-of-the-box (compiled from sources) does not choose filenames encoding correctly.
Developers from Ark say me, that the error is completely from info-zip project (https:/ /bugs.kde. org/show_ bug.cgi? id=204984).
There are sime patches to info-zip's unzip package, that makes unzip extract filenames with correct encoding. But maintainers of info-zip project rejected these patches (http:// www.info- zip.org/ board/board. pl?m-1248086794).
It would be nice to include this package in main openSuSE distribution.
Reproducible: Always
Steps to Reproduce:
1. Create zip-archive, containing files with cyrillic names under Windows.
2. Try to open it with unzip under SuSE
Actual Results:
Filename encoding is incorrect. Example:
pavel@pavel:~/tmp> unzip ReportPacket_ DBV90821CJ. zip DBV90821CJ. zip
Archive: ReportPacket_
inflating: ???????? ????? (????????).pdf
inflating: ???????? ????? (??????????).pdf
Expected Results:
Results, produced with natspec patch from sisyphus
pavel@rzn- sepak-bpa: ~/backup> pavel@rzn- sepak-bpa: ~/temp> unzip ReportPacket_ DBV90821CJ. zip DBV90821CJ. zip
Archive: ReportPacket_
inflating: ????????? ????? (??????????).pdf
inflating: ????????? ????? (??????????).pdf