Zim

Problem with opening non latin internet links

Bug #1605473 reported by Justinthere
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Zim
Confirmed
High
Unassigned

Bug Description

Hello.

First, thank you for your work, thank you for very nice Zim aplication!

My problem:
I have a link
https://ru.wikipedia.org/wiki/%D0%A3%D1%8D%D0%BB%D0%BB%D1%81,_%D0%93%D0%B5%D1%80%D0%B1%D0%B5%D1%80%D1%82_%D0%94%D0%B6%D0%BE%D1%80%D0%B4%D0%B6
or
https://ru.wikipedia.org/wiki/Уэллс,_Герберт_Джордж
It is the same link name in different notations.

In zim i can see link in normal view - like
"https://ru.wikipedia.org/wiki/Уэллс,_Герберт_Джордж" - and that is cool!

But when i click on this link in Zim, programm start my browser (Maxthon 4.9), and my browser open page with error, by adress
https://ru.wikipedia.org/wiki/%D0%A0%D0%88%D0%A1%D0%8C%D0%A0%C2%BB%D0%A0%C2%BB%D0%A1%D0%83,_%D0%A0%E2%80%9C%D0%A0%C2%B5%D0%A1%D0%82%D0%A0%C2%B1%D0%A0%C2%B5%D0%A1%D0%82%D0%A1%E2%80%9A_%D0%A0%E2%80%9D%D0%A0%C2%B6%D0%A0%D1%95%D0%A1%D0%82%D0%A0%D2%91%D0%A0%C2%B6

As we can see - this is a problem with encoding. If we try encoding in Russia we can see https://ru.wikipedia.org/wiki/Уэллс,_Герберт_Джордж

It is broken symbols, right variant which we must get https://ru.wikipedia.org/wiki/Уэллс,_Герберт_Джордж

If i use latin link, like
https://cse.google.com/cse?cx=partner-pub-2698861478625135:3033704849&ie=UTF-8&q=zim#gsc.tab=0&gsc.q=zim&gsc.page=1
 - all works fine! And page open in browser correctly.

But if i try use non-latin symbosl in adrees - link open incorrect in browser.

Another example
Link
https://cse.google.com/cse?cx=partner-pub-2698861478625135:3033704849&ie=UTF-8&q=zim#gsc.tab=0&gsc.q=%D0%B7%D0%B8%D0%BC

Opened with error
https://cse.google.com/cse?cx=partner-pub-2698861478625135:3033704849&ie=UTF-8&q=zim#gsc.tab=0&gsc.q=Р·РёРј

Please, help me!

Zim 0.65, windows 8.1, if it help.

Thank you!
Justin.

Revision history for this message
Jaap Karssenberg (jaap.karssenberg) wrote :

This is a known issue - believe there is another report open as well.

Problem is that I don't know upfront how the browser will handle UTF8, so we assume an encoding standard, but can still be wrong.

Probably needs patch that is different per browser used.

Changed in zim:
status: New → Confirmed
importance: Undecided → High
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.