sort order incorrect
Bug #75705 reported by
Kevin Scannell
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
glibc (Ubuntu) |
Confirmed
|
Medium
|
Unassigned | ||
Bug Description
Binary package hint: coreutils
sort is not behaving as expected under ga_IE.utf8
To reproduce: create a UTF-8 file "test" with "a" on one line, and "á" on another. Then run:
$ LC_ALL=ga_IE.utf8 sort test
a
á
This is correct - the accented character collates after the unaccented one.
But now change the two lines to "aá" and "áa":
$ LC_ALL=ga_IE.utf8 sort test
áa
aá
Now the accented character collates first. The second command gives the correct order ("aá" followed by "áa")
on all other distros I've used (Gentoo for example).
To post a comment you must log in.
More information. The same error occurs with the other locales I've tried, including en_US.UTF-8, fr_FR.UTF-8, etc.
If this is a locale definition problem, one wouldn't be surprising to see the same thing in all locales since LC_COLLATE is defined via a common file: /usr/share/ i18n/locales/ iso14651_ t1
To see if the locale files were at fault, I copied over the "iso14651_t1" file from my Gentoo machine (where sort works correctly), ran "locale-gen", and rebooted to be safe. But sort is still broken. Could there be some ubuntu-specific glibc patch that is causing the different behavior?