[upstream] .xls shows non-Russian character encoding for Russian characters

Bug #262777 reported by sp
16
This bug affects 3 people
Affects Status Importance Assigned to Milestone
LibreOffice
Confirmed
Medium
OpenOffice
Confirmed
Unknown
gnumeric (Ubuntu)
Confirmed
Undecided
Unassigned
libreoffice (Ubuntu)
Invalid
Medium
Unassigned
openoffice.org (Ubuntu)
Won't Fix
Low
Unassigned

Bug Description

Binary package hint: openoffice.org

1) lsb_release -rd
Description: Ubuntu 11.04
Release: 11.04

2) apt-cache policy libreoffice-calc
libreoffice-calc:
  Installed: 1:3.3.2-1ubuntu5
  Candidate: 1:3.3.2-1ubuntu5
  Version table:
 *** 1:3.3.2-1ubuntu5 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty-updates/main i386 Packages
        500 http://us.archive.ubuntu.com/ubuntu/ natty-proposed/main i386 Packages
        100 /var/lib/dpkg/status
     1:3.3.2-1ubuntu4 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty/main i386 Packages

apt-cache policy gnumeric
gnumeric:
  Installed: 1.10.13-1ubuntu1
  Candidate: 1.10.13-1ubuntu1
  Version table:
 *** 1.10.13-1ubuntu1 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty/universe i386 Packages
        100 /var/lib/dpkg/status

3) What is expected to happen:

using LibreOffice Calc via the Terminal:
cd ~/Desktop && wget https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/262777/+attachment/338558/+files/lessons.xls && localc -nologo lessons.xls

using Gnumeric via the Terminal:
cd ~/Desktop && wget https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/262777/+attachment/338558/+files/lessons.xls && gnumeric --no-splash lessons.xls

it displays the Russian characters correctly.

4) What happens instead is it does not. For example one has "Äèñöèïëèíà" instead of "Дисциплина".

WORKAROUND: Excel 2003 via WINE shows the correct characters.

Microsoft Office Excel 2003 (11.5612.6505)

apt-cache policy wine1.3
wine1.3:
  Installed: 1.3.19-0ubuntu1~maverick1~ppa1
  Candidate: 1.3.19-0ubuntu1~maverick1~ppa1
  Version table:
 *** 1.3.19-0ubuntu1~maverick1~ppa1 0
        100 /var/lib/dpkg/status
     1.3.15-0ubuntu5 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty/universe i386 Packages

ProblemType: Bug
Architecture: amd64
Date: Fri Aug 29 22:24:57 2008
DistroRelease: Ubuntu 8.04
NonfreeKernelModules: nvidia
Package: openoffice.org-core 1:2.4.1-1ubuntu2
PackageArchitecture: amd64
ProcEnviron:
 PATH=/usr/lib/openoffice/program:/home/username/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games
 LANG=ru_RU.UTF-8
 SHELL=/bin/bash
SourcePackage: openoffice.org
Uname: Linux 2.6.24-21-generic x86_64

Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :
Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :
Chris Cheney (ccheney)
Changed in openoffice.org:
importance: Undecided → Medium
status: New → Triaged
Revision history for this message
Chris Cheney (ccheney) wrote : Re: [upstream] incorrect character encoding (russian)

The OpenOffice.org developers would like to know what was this doc file originally created in?

Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :

I don't know for sure. Most likely it was created in MS Word 2003.

Changed in openoffice:
status: Unknown → Confirmed
Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :

This BUG is still present in OpenOffice.Org 3.1.1:

ProblemType: Bug
Architecture: i386
Date: Mon Jan 11 23:14:46 EET 2010
DistroRelease: Ubuntu 9.10
Package: openoffice.org-core 1:3.1.1-5ubuntu1
PackageArchitecture: i386
ProcEnviron:
 PATH=/usr/lib/openoffice/program:/home/username/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games
 LANG=ru_RU.UTF-8
 SHELL=/bin/bash
SourcePackage: openoffice.org
Uname: Linux 2.6.31-18-generic i686

Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :

One of the files, that was opened with wrong character encoding.

Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :

Screenshot 1

Revision history for this message
sp (sp89by-deactivatedaccount-deactivatedaccount) wrote :

Screenshot 2

Changed in openoffice.org (Ubuntu):
assignee: nobody → sp (sp89by)
assignee: sp (sp89by) → nobody
Chris Cheney (ccheney)
tags: added: hardy
Revision history for this message
In , penalvch (penalvch) wrote :

Created attachment 46955
lessons.xls

Downstream bug may be found at:
https://bugs.launchpad.net/ubuntu/+source/libreoffice/+bug/262777

OOo bug may be found at:
http://openoffice.org/bugzilla/show_bug.cgi?id=95572

1) lsb_release -rd
Description: Ubuntu 11.04
Release: 11.04

2) 2) apt-cache policy libreoffice-calc
libreoffice-calc:
  Installed: 1:3.3.2-1ubuntu5
  Candidate: 1:3.3.2-1ubuntu5
  Version table:
 *** 1:3.3.2-1ubuntu5 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty-updates/main i386 Packages
        500 http://us.archive.ubuntu.com/ubuntu/ natty-proposed/main i386 Packages
        100 /var/lib/dpkg/status
     1:3.3.2-1ubuntu4 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty/main i386 Packages

3) What is expected to happen using LibreOffice Calc via the Terminal:
cd ~/Desktop && wget https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/262777/+attachment/338558/+files/lessons.xls && localc -nologo lessons.xls

it displays the Russian characters correctly.

4) What happens instead is it does not. For example one has "Äèñöèïëèíà" instead of "Дисциплина".

Revision history for this message
In , timar (timar74) wrote :

This is an old known bug, see http://openoffice.org/bugzilla/show_bug.cgi?id=32785
Your file is in Excel 5.0 (BIFF5) format which is not fully supported. Nobody has implemented the CODEPAGE record in BIFF5.

penalvch (penalvch)
description: updated
Changed in libreoffice (Ubuntu):
importance: Undecided → Medium
status: New → Triaged
tags: added: lo33
summary: - [upstream] incorrect character encoding (russian)
+ [upstream] .xls shows incorrect Russian character encoding
Revision history for this message
In , penalvch (penalvch) wrote :

Created attachment 46955
lessons.xls

Downstream bug may be found at:
https://bugs.launchpad.net/ubuntu/+source/libreoffice/+bug/262777

OOo bug may be found at:
http://openoffice.org/bugzilla/show_bug.cgi?id=95572

1) lsb_release -rd
Description: Ubuntu 11.04
Release: 11.04

2) 2) apt-cache policy libreoffice-calc
libreoffice-calc:
  Installed: 1:3.3.2-1ubuntu5
  Candidate: 1:3.3.2-1ubuntu5
  Version table:
 *** 1:3.3.2-1ubuntu5 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty-updates/main i386 Packages
        500 http://us.archive.ubuntu.com/ubuntu/ natty-proposed/main i386 Packages
        100 /var/lib/dpkg/status
     1:3.3.2-1ubuntu4 0
        500 http://us.archive.ubuntu.com/ubuntu/ natty/main i386 Packages

3) What is expected to happen using LibreOffice Calc via the Terminal:
cd ~/Desktop && wget https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/262777/+attachment/338558/+files/lessons.xls && localc -nologo lessons.xls

it displays the Russian characters correctly.

4) What happens instead is it does not. For example one has "Äèñöèïëèíà" instead of "Дисциплина".

summary: - [upstream] .xls shows incorrect Russian character encoding
+ [upstream] .xls shows non-Russian character encoding for Russian
+ characters
Changed in df-libreoffice:
importance: Unknown → Medium
status: Unknown → Confirmed
Revision history for this message
In , timar (timar74) wrote :

This is an old known bug, see http://openoffice.org/bugzilla/show_bug.cgi?id=32785
Your file is in Excel 5.0 (BIFF5) format which is not fully supported. Nobody has implemented the CODEPAGE record in BIFF5.

penalvch (penalvch)
Changed in openoffice.org (Ubuntu):
importance: Medium → Low
Revision history for this message
Andreas J Guelzow (aguelzow) wrote :

Please note that this will be fixed in Gnumeric 1.11.0 and later.

Changed in openoffice.org (Ubuntu):
status: Triaged → Won't Fix
Revision history for this message
Björn Michaelsen (bjoern-michaelsen) wrote : migrating packaging from OpenOffice.org to Libreoffice

[This is an automated message.]
There are no new official OpenOffice.org releases in Ubuntu packaging anymore => Won't Fix

If the problem persists, please mark this bug as "also affects project Libreoffice" or "also affects distribution Libreoffice (Ubuntu)" if that has not happened already.

Please leave references to upstream OpenOffice.org bugs in place to allow cross pollination.

Revision history for this message
In , Björn Michaelsen (bjoern-michaelsen) wrote :

[This is an automated message.]
This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it
started right out as NEW without ever being explicitly confirmed. The bug is
changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back
to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases.
Details on how to test the 3.5.0 beta1 can be found at:
http://wiki.documentfoundation.org/QA/BugHunting_Session_3.5.0.-1

more detail on this bulk operation: http://nabble.documentfoundation.org/RFC-Operation-Spamzilla-tp3607474p3607474.html

Revision history for this message
In , Björn Michaelsen (bjoern-michaelsen) wrote :

[This is an automated message.]
This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it
started right out as NEW without ever being explicitly confirmed. The bug is
changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back
to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases.
Details on how to test the 3.5.0 beta1 can be found at:
http://wiki.documentfoundation.org/QA/BugHunting_Session_3.5.0.-1

more detail on this bulk operation: http://nabble.documentfoundation.org/RFC-Operation-Spamzilla-tp3607474p3607474.html

Changed in df-libreoffice:
status: Confirmed → Incomplete
Changed in df-libreoffice:
status: Incomplete → Confirmed
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in gnumeric (Ubuntu):
status: New → Confirmed
Revision history for this message
In , Qa-admin-q (qa-admin-q) wrote :

Please read this message in its entirety before responding.

Your bug was confirmed at least 1 year ago and has not had any activity on it for over a year. Your bug is still set to NEW which means that it is open and confirmed. It would be nice to have the bug confirmed on a newer version than the version reported in the original report to know that the bug is still present -- sometimes a bug is inadvertently fixed over time and just never closed.

If you have time please do the following:
1) Test to see if the bug is still present on a currently supported version of LibreOffice (preferably 4.2 or newer).
2) If it is present please leave a comment telling us what version of LibreOffice and your operating system.
3) If it is NOT present please set the bug to RESOLVED-WORKSFORME and leave a short comment telling us your version and Operating System

Please DO NOT
1) Update the version field
2) Reply via email (please reply directly on the bug tracker)
3) Set the bug to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case)

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
LibreOffice is powered by a team of volunteers, every bug is confirmed (triaged) by human beings who mostly give their time for free. We invite you to join our triaging by checking out this link:
https://wiki.documentfoundation.org/QA/BugTriage

There are also other ways to get involved including with marketing, UX, documentation, and of course developing - http://www.libreoffice.org/get-help/mailing-lists/.

Lastly, good bug reports help tremendously in making the process go smoother, please always provide reproducible steps (even if it seems easy) and attach any and all relevant material

Revision history for this message
In , Qa-admin-q (qa-admin-q) wrote :

Please read this message in its entirety before responding.

Your bug was confirmed at least 1 year ago and has not had any activity on it for over a year. Your bug is still set to NEW which means that it is open and confirmed. It would be nice to have the bug confirmed on a newer version than the version reported in the original report to know that the bug is still present -- sometimes a bug is inadvertently fixed over time and just never closed.

If you have time please do the following:
1) Test to see if the bug is still present on a currently supported version of LibreOffice (preferably 4.2 or newer).
2) If it is present please leave a comment telling us what version of LibreOffice and your operating system.
3) If it is NOT present please set the bug to RESOLVED-WORKSFORME and leave a short comment telling us your version and Operating System

Please DO NOT
1) Update the version field
2) Reply via email (please reply directly on the bug tracker)
3) Set the bug to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case)

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
LibreOffice is powered by a team of volunteers, every bug is confirmed (triaged) by human beings who mostly give their time for free. We invite you to join our triaging by checking out this link:
https://wiki.documentfoundation.org/QA/BugTriage

There are also other ways to get involved including with marketing, UX, documentation, and of course developing - http://www.libreoffice.org/get-help/mailing-lists/.

Lastly, good bug reports help tremendously in making the process go smoother, please always provide reproducible steps (even if it seems easy) and attach any and all relevant material

Revision history for this message
In , Qa-admin-q (qa-admin-q) wrote :

** Please read this message in its entirety before responding **

To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.

There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.

If you have time, please do the following:

Test to see if the bug is still present on a currently supported version of LibreOffice (4.4.1 or later): https://www.libreoffice.org/download/

If the bug is present, please leave a comment that includes the version of LibreOffice and your operating system, and any changes you see in the bug behavior

If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a short comment that includes your version of LibreOffice and Operating System

Please DO NOT

Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case)

If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3)

http://downloadarchive.documentfoundation.org/libreoffice/old/

2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to "inherited from OOo"; 4b. If the bug was not present in 3.3 - add "regression" to keyword

Feel free to come ask questions or to say hello in our QA chat: http://webchat.freenode.net/?channels=libreoffice-qa

Thank you for your help!

-- The LibreOffice QA Team This NEW Message was generated on: 2015-07-18

Revision history for this message
In , Beluga (beluga) wrote :

Still confirmed.

Win 7 Pro 64-bit Version: 5.1.0.0.alpha1+
Build ID: 186f32f63434e16ff5776251657f902d5808ed3d
TinderBox: Win-x86@39, Branch:master, Time: 2015-10-16_09:42:47
Locale: en-US (fi_FI)

Revision history for this message
In , Momonasmon (momonasmon) wrote :

(In reply to Andras Timar from comment #1)
> This is an old known bug, see
> http://openoffice.org/bugzilla/show_bug.cgi?id=32785
> Your file is in Excel 5.0 (BIFF5) format which is not fully supported.
> Nobody has implemented the CODEPAGE record in BIFF5.
i#32785 is about BIFF5 *export*, while this bug is about BIFF5 *import*. And actually LO imports and honors the CODEPAGE record. The problem is that for this bugdoc it equals to 1252 (Latin I).

There is also additional problem with the FONT record that in BIFF5 stores the charset as well, and it equals to 1 - which corresponds to DEFAULT_CHARSET used in the LOGFONT WinAPI structure, which according to MSDN - "DEFAULT_CHARSET is set to a value based on the current system locale. For example, when the system locale is English (United States), it is set as ANSI_CHARSET." - means there is no way to know the used charset from it. (And the charset stored in the FONT record is preferred by LO over the CODEPAGE one).

Revision history for this message
In , Momonasmon (momonasmon) wrote :

NOTOURBUG as per comment 6 (i.e. the file has wrong codepage inside (and the 2nd paragraph of that comment is completely wrong and should be ignored).

Revision history for this message
In , Momonasmon (momonasmon) wrote :

*** Bug 90210 has been marked as a duplicate of this bug. ***

Revision history for this message
Marcus Tomlinson (marcustomlinson) wrote :

This release of Ubuntu is no longer receiving maintenance updates. If this is still an issue on a maintained version of Ubuntu please let us know.

Changed in libreoffice (Ubuntu):
status: Triaged → Incomplete
Changed in df-libreoffice:
importance: Medium → Unknown
status: Confirmed → Unknown
Changed in df-libreoffice:
importance: Unknown → Medium
status: Unknown → Won't Fix
Revision history for this message
Marcus Tomlinson (marcustomlinson) wrote :

This bug has sat incomplete for 6 months. Marking invalid.

Changed in libreoffice (Ubuntu):
status: Incomplete → Invalid
Revision history for this message
In , Mikekaganski (mikekaganski) wrote :

*** This bug has been marked as a duplicate of bug 132796 ***

Revision history for this message
In , Mikekaganski (mikekaganski) wrote :

I was wrong, it needs to consider font *names* (... Cyr) to decide the encoding, while bug 132796 is different.

Also, it is not "NOTOURBUG", we can do better.

Changed in df-libreoffice:
status: Won't Fix → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.