Ark : when creating a zip file Ark doesn't encode accents properly (i.e. "é" "è" etc.)

Bug #692651 reported by Mahendra Tallur
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
KDE Utilities
Unknown
Medium
zip (Ubuntu)
Triaged
Low
Unassigned

Bug Description

Binary package hint: kdeutils

Hi ! This bug occurs with a clean Kubuntu 10.10 install.

How to reproduce :

1) create a file or a drawer whose filename contains special characters with accents, such as "é" "è" "ê" etc (French accents in this case).
2) with Dolphin, right click on it and create a zip file
3) open the zip file with ark or from the command line
4) what happens : each special character is replaced by two question marks "??"
what should happen : the filenames should be the same as the source file / drawer

Please note :
1) this occurs when creating zip files, but not when creating rar files or tar.gz files
2) the fact that a single char is replaced by 2 question marks seems to indicate that the filename was encoded in UTF-8 but that the zip file was not specified as being in UTF-8

Please tell me if I can help ! (some other people on the French Ubuntu forums confirmed the issue)
Cheers !

ProblemType: Bug
DistroRelease: Ubuntu 10.10
Package: ark 4:4.5.1-0ubuntu2
ProcVersionSignature: Ubuntu 2.6.35-23.41-generic-pae 2.6.35.7
Uname: Linux 2.6.35-23-generic-pae i686
NonfreeKernelModules: nvidia
Architecture: i386
Date: Mon Dec 20 18:25:30 2010
InstallationMedia: Kubuntu 10.10 "Maverick Meerkat" - Release i386 (20101007)
ProcEnviron:
 LANGUAGE=
 LANG=fr_FR.UTF-8
 SHELL=/bin/bash
SourcePackage: kdeutils

Revision history for this message
Mahendra Tallur (mahen) wrote :
Revision history for this message
jajaX (jajaplanet) wrote :

Hi !

same problem for me under kde sc 4.5.4 kubuntu packages..

Revision history for this message
Rohan Garg (rohangarg) wrote : Re: [Bug 692651] Re: Ark : when creating a zip file Ark doesn't encode accents properly (i.e. "é" "è" etc.)

Hi
This seems to be a upstream KDE issue, please report it at bugs.kde.org
Thanks!
On 21-Dec-2010 4:40 AM, "jajaX" <email address hidden> wrote:
> Hi !
>
> same problem for me under kde sc 4.5.4 kubuntu packages..
>
> --
> You received this bug notification because you are a member of Kubuntu
> Bugs, which is subscribed to kdeutils in ubuntu.
> https://bugs.launchpad.net/bugs/692651
>
> Title:
> Ark : when creating a zip file Ark doesn't encode accents properly (i.e.
"é" "è" etc.)
>
> --
> kubuntu-bugs mailing list
> <email address hidden>
> https://lists.ubuntu.com/mailman/listinfo/kubuntu-bugs

Revision history for this message
Mahendra Tallur (mahen) wrote :

OK !
Upstream bug reported here : https://bugs.kde.org/show_bug.cgi?id=260855
(also occurs with KDE 4.5.4 from the Kubuntu PPA BTW)

Revision history for this message
Mahendra Tallur (mahen) wrote :

Another user told me he didn't reproduce it under ArchLinux though (with KDE 4.5.x as well).

Revision history for this message
Mahendra Tallur (mahen) wrote :

One more piece of information :

when listing the file with unzip, I get question marks. When listing the file within ark, I get question marks. BUT, when *extracting* the archive I get in the command line, for instance :

"extracting : No??l"

BUT the extracted file has the correct filename !

Revision history for this message
Mahendra Tallur (mahen) wrote :

It turned out this bugreport should be closed... Or rather, should be filed differently :

when creating a zip file from the command line, with no specific option, the very same behaviour occurs : so Ark is not at fault, but "zip" or some locale settings / env variables I'm not aware of...

I'm really sorry for not checking this first -- as it doesn't happen under Ubuntu, I didn't think it was necessary.

Revision history for this message
Mahendra Tallur (mahen) wrote :

OKAY. I figured out that :

- Under Ubuntu, with both File-Roller & zip from the command line, in both 10.10 and 11.04 alpha : accents are replaced by "??"
- HOWEVER, after installing p7zip-full, filenames are encoded properly, EVEN WHEN JUST calling "zip" from the command line

So it seems zip makes use of 7zip-full when available.

- NONETHELESS... The behaviour doesn't apply under Kubuntu. Even after installing p7zip-full and invoking zip from the command line, I still get "??" instead of accents.

--> I definitely have to file a new bug for this issue.

(BTW, people using OpenSUSE & Archlinux don't have this issue)

Revision history for this message
Myriam Schweingruber (myriam) wrote :

Please make sure your system Locale used UTF-8 or UTF-16 only. I can't reproduce this here with Kubuntu 10.10 and KDE 4.5.4 nor KDE 4-6 beta 2, all my LOCALE use UTF-8, no special character problems in either German, French, Italian or English.

Revision history for this message
Mahendra Tallur (mahen) wrote :

Myriam : thanks for your reply. Well, my locale is indeed UTF8. This is a clean Kubuntu 10.10 install.

LANG=fr_FR.UTF-8
LANGUAGE=
LC_CTYPE="fr_FR.UTF-8"
LC_NUMERIC="fr_FR.UTF-8"
LC_TIME="fr_FR.UTF-8"
LC_COLLATE="fr_FR.UTF-8"
LC_MONETARY="fr_FR.UTF-8"
LC_MESSAGES="fr_FR.UTF-8"
LC_PAPER="fr_FR.UTF-8"
LC_NAME="fr_FR.UTF-8"
LC_ADDRESS="fr_FR.UTF-8"
LC_TELEPHONE="fr_FR.UTF-8"
LC_MEASUREMENT="fr_FR.UTF-8"
LC_IDENTIFICATION="fr_FR.UTF-8"
LC_ALL=

Revision history for this message
Mahendra Tallur (mahen) wrote :

Myriam : meanwhile, I triple checked everything and am very surprised you don't encounter this issue. Here are some more tests I made :

[1] When using
- Ubuntu 10.10 liveCD (amd64)
- Ubuntu 11.04 alpha liveCD (i386)

By default : when archiving for instance, a file called "exposé.txt" to a "exposé.zip" file using zip or file-roller and opening it again using unzip -t or file-roller --> "é" is displayed as "??". The actual file is extracted properly from the command line although unzip says "extracting: expos??.txt", it does create "exposé.txt".

After installing p7zip-full : everything works properly from both zip and file-roller. As if they were making use of p7zip.

[2] When using
- Kubuntu 10.10 full up to date install with kubuntu-ppa (i386)
- Kubuntu 10.10 LiveCD in English mode (amd64)
- Kubuntu 10.10 LiveCD in French mode (i386 & amd64)

By default : when archiving "exposé.txt" to "exposé.zip" with Ark or zip and opening it back with Ark or Zip, I get "expos??.txt" instead of "exposé.txt" in the file listing. I can still unzip it from the command line and get the right file, as in [1]. Ark doesn't manage to extract it.

After installing p7zip-full : no difference, it still doesn't work.

Please see the attached zip file made in Kubuntu. Here the included file is listed as "expos??.txt".

Cheers !

Revision history for this message
Mahendra Tallur (mahen) wrote :

Ah, someone figured out why you may have not noticed the problem :

If you configured Dolphin to browse archives instead of opening them with Ark, the filenames will appear properly, which is not the case when using Ark or unzip.

Revision history for this message
Mahendra Tallur (mahen) wrote :

An ArchLinux user confirmed that the sample archive I attached above is displayed properly on his system (with both command line unzip and Ark), while it displays some "????" under Kubuntu for many users on the French ubuntu board.

It may give an additional clue about the problem.

Changed in kdeutils:
importance: Unknown → Medium
Revision history for this message
Philip Muškovac (yofel) wrote :

Easy to reproduce here in natty with KDE 4.6.2 using LANG=en_US.UTF-8 and LC_ALL=en_US.UTF-8

Changed in kdeutils (Ubuntu):
status: New → Triaged
Philip Muškovac (yofel)
Changed in kdeutils (Ubuntu):
importance: Undecided → Low
Revision history for this message
Jonathan Thomas (echidnaman) wrote :

According to upstream, this was a problem with the zip utility in Ubuntu.

affects: kdeutils (Ubuntu) → zip (Ubuntu)
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.