Umlaut-errors with copy&paste copying from Konqueror to OpenOffice

Bug #44013 reported by carsten
16
Affects Status Importance Assigned to Milestone
KDE Base
Invalid
Medium
OpenOffice
Confirmed
Unknown
openoffice.org (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

Binary package hint: openoffice.org-core

Hi

Go to a page which contains german umlauts. Take for example

http://www.hamm-chemie.de/j11/j11ab/vers_aromastoffe.htm

Copy (ctrl-c) a text with an umlaut and paste (ctrl-v) in oowriter2. You will get broken umlauts. xcopy (middle mouse button) pasts fine.

Example (from the mentioned webpage):

    A: 2 ml Ameisensäure mit 2 ml Ethanol

becomes in oowriter2

    A: 2 ml Ameisensäure mit 2 ml

Revision history for this message
Jason Ribeiro (jrib) wrote :

Hi, thanks for the bug report. It seems to copy for me here without breaking (both with ctrl-c and just using xcopy).

Maybe you can provide more info about your setup so we can narrow down what's different on your system and mine. Locale you are using, language settings in Ooo2, character encoding being used on the page and what browser, and what openoffice packages you have installed may be helpful. Although I don't know if any of those could actually be related.

Here is what my system looks like:
LANG=en_US.UTF-8
language in Ooo2 is set to English(USA)
character encoding on firefox: windows-1252

(%:~)- dpkg -l '*openoffice*' | grep ^ii | awk '{ print $2 }'
openoffice.org
openoffice.org-base
openoffice.org-calc
openoffice.org-common
openoffice.org-core
openoffice.org-draw
openoffice.org-evolution
openoffice.org-gnome
openoffice.org-gtk
openoffice.org-help-pt-br
openoffice.org-impress
openoffice.org-java-common
openoffice.org-l10n-common
openoffice.org-l10n-en-us
openoffice.org-l10n-pt-br
openoffice.org-math
openoffice.org-thesaurus-en-us
openoffice.org-writer

Changed in openoffice.org:
status: Unconfirmed → Needs Info
Revision history for this message
Jason Ribeiro (jrib) wrote :

Also, what font are you using in openoffice.org? I was using Nimbus Roman No9 L. And check all the options > language settings for Ooo2, mine are all at default with English(USA) like I said above for the language.

Revision history for this message
carsten (cniehaus-kde) wrote :

- LANG: de_DE.UTF-8
- Font: Bitstream Vera Sans
- OO": 2.0.2-2ubuntu5
- Desktop: KDE
- Last system update: yesterday
- No extra repositories beside universe and multiverse
- kde-i18n-de active (german KDE)
- OO2-language-settings: all on "Standard" beside Western/German. OO2's GUI is also german

Now the most important thing: This only happens when copy happend in Konq! With Firefox it works... So maybe this is a Konqueror bug.

Revision history for this message
carsten (cniehaus-kde) wrote :

dpkg -l '*openoffice*' | grep ^ii | awk '{ print $2 }'
openoffice.org
openoffice.org-base
openoffice.org-calc
openoffice.org-common
openoffice.org-core
openoffice.org-draw
openoffice.org-help-de
openoffice.org-help-en-us
openoffice.org-hyphenation-de
openoffice.org-impress
openoffice.org-java-common
openoffice.org-kde
openoffice.org-l10n-common
openoffice.org-l10n-de
openoffice.org-l10n-en-gb
openoffice.org-l10n-en-us
openoffice.org-l10n-en-za
openoffice.org-math
openoffice.org-thesaurus-de
openoffice.org-thesaurus-de-ch
openoffice.org-thesaurus-en-us
openoffice.org-writer

Revision history for this message
gilles gregoire (schaouette) wrote : Re: Encoding-errors with copy&paste

Hello,
I have the same bug: Copy and paste from konqueror to openoffice writer doesn't seem to preserve encoding. Anyway, it works fine from the other apps I've tested so far, i.e. kate, firefox, kontact, konsole.
Copy and paste works fine from konqueror to these programs.

Otherwise, copy and paste work well from konqueror to openoffice calc and also to openoffice impress. The bug appears only for openoffice writer.

I'm using kubuntu dapper (intel x86).

Revision history for this message
gilles gregoire (schaouette) wrote : Re: Umlaut-errors with copy&paste

I found a stange behaviour with klipper:
If I copy and paste directly from konqueror to openoffice, the sentence "Ça va? Est-ce que la cédille est bien passée?" becomes:
"Ça va? Est-ce que la cédille est bien passée?"
but if I select the entry to paste in klipper before pasting in openoffice, it works fine:
"Ça va? Est-ce que la cédille est bien passée?"

Note: I did reset klipper prior to each tests.

Another strange behaviour is: if I select "collage special" (i.e. in english something like "special paste") in the edit menu, if I select: "paste as text", the pasted text is fine (If I select paste as html, it's not) even if I don't select explicitly the entry to paste in klipper.

Any idea?

PS: The document used for testing is this one:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML>
<HEAD>
 <META HTTP-EQUIV="CONTENT-TYPE" CONTENT="text/html; charset=utf-8">
</HEAD>
<BODY>
Ça va? Est-ce que la cédille est bien passée?
</BODY>
</HTML>

Revision history for this message
carsten (cniehaus-kde) wrote :

Well, another observation:

If I paste with ctrl-v I get the reported behaviour.

If I use x-copy (pasting with middle mouse button) it works fine, also when using konqueror...

Revision history for this message
carsten (cniehaus-kde) wrote :

I upgraded to Edgy Eft and I can still confirm this bug. You set this bug to "need info". What kind of info do you need? I am willing to help you, but I need more hints *what* to test or to do :-)

Revision history for this message
carsten (cniehaus-kde) wrote :

Just some more information about my system (as I did a full dist-upgrade 2 days ago)

carsten@helium:~$ echo $LANG
de_DE.UTF-8

carsten@helium:~$ dpkg -l '*openoffice*' | grep ^ii | awk '{ print $2 }'
openoffice.org
openoffice.org-base
openoffice.org-calc
openoffice.org-common
openoffice.org-core
openoffice.org-draw
openoffice.org-help-de
openoffice.org-help-en-us
openoffice.org-hyphenation-de
openoffice.org-impress
openoffice.org-java-common
openoffice.org-kde
openoffice.org-l10n-common
openoffice.org-l10n-de
openoffice.org-l10n-en-gb
openoffice.org-l10n-en-us
openoffice.org-l10n-en-za
openoffice.org-math
openoffice.org-style-crystal
openoffice.org-style-default
openoffice.org-thesaurus-de
openoffice.org-thesaurus-de-ch
openoffice.org-thesaurus-en-us
openoffice.org-writer

Revision history for this message
Jason Ribeiro (jrib) wrote :

Carsten, sorry about the "needs info". I set it originally and never removed it after you answered my question. It would have been okay for you to change it as well since I never responded.

Changed in openoffice.org:
status: Needs Info → Unconfirmed
Revision history for this message
Le Torbi (torben-letorbi) wrote :

Same problems here in Kubuntu 7.10.

echo $LANG
en_US.UTF-8

dpkg -l '*openoffice*' | grep ^ii | awk '{ print $2 }'
openoffice.org-calc
openoffice.org-common
openoffice.org-core
openoffice.org-draw
openoffice.org-help-en-us
openoffice.org-hyphenation
openoffice.org-impress
openoffice.org-java-common
openoffice.org-kde
openoffice.org-l10n-common
openoffice.org-l10n-en-gb
openoffice.org-l10n-en-za
openoffice.org-math
openoffice.org-style-crystal
openoffice.org-style-human
openoffice.org-thesaurus-en-us
openoffice.org-writer

As far as I've figured out, the problem is, that OpenOffice pastes the HTML-formatted content of the clipboard, while all other programs if tested (KWord, Abiword, Kate) are pasting the unformatted content into the document, when you're pressing CTRL+V. Klipper also uses only the unformatted content, which explains, why there aren't any problems when using xcopy (middle mouse button) or after selecting the appropriate entry in the klipper-history (this overwrites the current clipboard content (html/unformatted) with the content of the selected history entry (only unformatted)).
I don't know whether Konqueror kills the umlauts when copying the HTML-content to the clipboard or OpenOffice mixes them up when it pastes the HTML-clipboard-content, but the problem lies somewhere there. A workaround could be to prevent either Konqueror from copy HTML-content to the clipboard or OpenOffice from pasting from it. However, I have no idea how to do that...

Revision history for this message
Le Torbi (torben-letorbi) wrote :

Just some additional info:

I've just tested cut&paste from Firefox to OpenOffice and everthings works fine. It makes no difference which option I choose at "Preferences -> Content -> Fonts & Colors -> Advanced -> Default Character Encoding", it works with "Western (ISO 8859-15)" as well as with "Unicode (UTF-8)". So the problem seems to be Konqueror...

Daniel Hahler (blueyed)
Changed in kdebase:
status: New → Triaged
Revision history for this message
Daniel Hahler (blueyed) wrote :

From the upstream bug report(s) it appears to be a bug in OOo, not Konqueror.

Changed in kdebase:
status: Unknown → Confirmed
Changed in openoffice:
status: Unknown → Confirmed
Chris Cheney (ccheney)
Changed in openoffice.org:
status: Triaged → Confirmed
Changed in kdebase:
status: Confirmed → Invalid
Chris Cheney (ccheney)
Changed in openoffice.org:
status: Confirmed → Triaged
Revision history for this message
Chris Cheney (ccheney) wrote : Re: [Upstream] [hardy] Umlaut-errors with copy&paste copying from Konqueror to OpenOffice

Can you still reproduce this bug on Ubuntu 9.04 (Jaunty)? You can download a test (desktop) live cd from http://cdimage.ubuntu.com/releases/jaunty/

Changed in openoffice.org:
importance: Medium → Undecided
status: Triaged → Incomplete
Revision history for this message
Chris Cheney (ccheney) wrote :

We are closing this bug report because it lacks the information we need to investigate the problem, as described in the previous comments. Please reopen it if you can give us the missing information, and don't hesitate to submit bug reports in the future. To reopen the bug report you can click on the current status, under the Status column, and change the Status back to "New". Thanks again!

Changed in openoffice.org:
status: Incomplete → Invalid
Changed in kdebase:
importance: Unknown → Medium
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.