Canonically equivalent strings are treated differently
Bug #671829 reported by
Max Rabkin
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ibid |
Triaged
|
Low
|
Max Rabkin |
Bug Description
<Taejo> Ibido: á is a-acute
<Ibido> Taejo: One learns a new thing every day
<Taejo> Ibido: á
<Ibido> Taejo: Excuse me?
The first is U+00E1 Latin Small Letter A With Acute; the second is U+0061 Latin Small Letter A, U+0301 Combining Acute Accent.
These are canonically equivalent and we should treat them the same. We should possibly convert all output to NFC, too.
Changed in ibid: | |
milestone: | none → 0.2.0 |
To post a comment you must log in.