.zip files generated by google forms are processed incorrectly by unzip tool if non-lating characters are present in archived file name

Bug #1898407 reported by Unxed
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
unzip (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

See sample attached.

Expected:

$ unzip -l csv.zip
Archive: csv.zip
  Length Date Time Name
--------- ---------- ----- ----
      102 2020-10-03 08:43 Новая форма.csv
--------- -------
      102 1 file

Actual:

$ unzip -l csv.zip
Archive: csv.zip
  Length Date Time Name
--------- ---------- ----- ----
      102 2020-10-03 08:43 ╨Э╨╛╨▓╨░╤П ╤Д╨╛╤А╨╝╨░.csv
--------- -------
      102 1 file

Revision history for this message
Unxed (unxed) wrote :
Revision history for this message
Dominik Viererbe (dviererbe) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better.

Unfortunately it is not always possible to detect the correct encoding of the filenames. Because of this reason you can specify the encoding manually.

  -O CHARSET specify a character encoding for DOS, Windows and OS/2 archives
  -I CHARSET specify a character encoding for UNIX and other archives

For example:
$ unzip -O UTF8 -l csv.zip
Archive: csv.zip
  Length Date Time Name
--------- ---------- ----- ----
      102 2020-10-03 08:43 Новая форма.csv
--------- -------
      102 1 file

To see available encodings use:
$ iconv --list

I am aware that is issue is quite old, but I hope that I could help you :)

Changed in unzip (Ubuntu):
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.