On Wed, Dec 07, 2005 at 02:03:07AM -0000, Stuart Bishop wrote:
> We already have code to do the deaccentification -
> canonical.encoding.ascii_smash() handles the European latin based character
> sets. Your still stuffed with character sets that don't have an ASCII
> equivalent, such as Coptic, Greek or most of the Asian languages.
ascii_smash() doesn't do exactly what I would expect, though. For
example, it transforms my name, 'Björn', into 'Bjoern' instead of
'Bjorn'. If people would try to find me, they would most likely search
for either 'Björn' or 'Bjorn'.
On Wed, Dec 07, 2005 at 02:03:07AM -0000, Stuart Bishop wrote: encoding. ascii_smash( ) handles the European latin based character
> We already have code to do the deaccentification -
> canonical.
> sets. Your still stuffed with character sets that don't have an ASCII
> equivalent, such as Coptic, Greek or most of the Asian languages.
ascii_smash() doesn't do exactly what I would expect, though. For
example, it transforms my name, 'Björn', into 'Bjoern' instead of
'Bjorn'. If people would try to find me, they would most likely search
for either 'Björn' or 'Bjorn'.