Zim

Comment 14 for bug 518323

Revision history for this message
Jaap Karssenberg (jaap.karssenberg) wrote : Re: [Bug 518323] Re: Automatic link creation and CamelCase don't work with non-latin characters

2011/5/31 Jiří Janoušek <email address hidden>

> I have been doing some experiments and Python regex engine seems to
> support unicode if unicode arguments and re.U flag are provided (example
> 3).
>

Yes it does for \w, however there is no way to match uppercase versus lower
case (unlike e.g. the perl regex engine which supports matching unicode
classes).

I have recently been thinking that it can work if we use the string methods
to determine which characters are uppercase and which are not and find
camelcase that way looking for an pattern of "upper lower upper" by
searching character by character.

-- Jaap