Canonically equivalent strings are treated differently

Bug #671829 reported by Max Rabkin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Ibid
Triaged
Low
Max Rabkin

Bug Description

<Taejo> Ibido: á is a-acute
<Ibido> Taejo: One learns a new thing every day
<Taejo> Ibido: á
<Ibido> Taejo: Excuse me?

The first is U+00E1 Latin Small Letter A With Acute; the second is U+0061 Latin Small Letter A, U+0301 Combining Acute Accent.

These are canonically equivalent and we should treat them the same. We should possibly convert all output to NFC, too.

Tags: unicode
Max Rabkin (max-rabkin)
Changed in ibid:
milestone: none → 0.2.0
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.