.zip files generated by google forms are processed incorrectly by unzip tool if non-lating characters are present in archived file name
Bug #1898407 reported by
Unxed
This bug affects 1 person
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| unzip (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Bug Description
See sample attached.
Expected:
$ unzip -l csv.zip
Archive: csv.zip
Length Date Time Name
--------- ---------- ----- ----
102 2020-10-03 08:43 Новая форма.csv
--------- -------
102 1 file
Actual:
$ unzip -l csv.zip
Archive: csv.zip
Length Date Time Name
--------- ---------- ----- ----
102 2020-10-03 08:43 ╨Э╨╛╨▓╨░╤П ╤Д╨╛╤А╨╝╨░.csv
--------- -------
102 1 file
To post a comment you must log in.

Thank you for taking the time to report this bug and helping to make Ubuntu better.
Unfortunately it is not always possible to detect the correct encoding of the filenames. Because of this reason you can specify the encoding manually.
-O CHARSET specify a character encoding for DOS, Windows and OS/2 archives
-I CHARSET specify a character encoding for UNIX and other archives
For example:
$ unzip -O UTF8 -l csv.zip
Archive: csv.zip
Length Date Time Name
--------- ---------- ----- ----
102 2020-10-03 08:43 Новая форма.csv
--------- -------
102 1 file
To see available encodings use:
$ iconv --list
I am aware that is issue is quite old, but I hope that I could help you :)