This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug libc/2648] localedata/locales/es_ES has incorrect LC_COLLATE <space> handling


------- Additional Comments From mfabian at suse dot de  2006-05-09 16:00 -------
Original comment in the Novell bugzilla:  When LC_COLLATE=es_ES, the sort command ignores spaces in its sorting algorithm, so it sorts         MAS PUJADAS, FRANCESC after         MASOLIVER GARCIA, JAIME  instead of before, even though the comments in /usr/share/i18n/locales/es_ES indicate that the sorting algorithm for this locales should take spaces into account (and sort them before punctuation characters, numbers and letters).  This spanish customer is not using LC_COLLATE="POSIX" because the sort  command gives incorrect results when dealing with characters with spanish  accents so he has to use LC_COLLATE="es_ES.UTF-8" which is ignoring spaces.  Even /usr/share/i18n/locales/es_ES states:  LC_COLLATE  % Base collation scheme: 1994-03-22  % Ordering algorithm: %  1. Spaces and hyphen (but not soft %     hyphen) before punctuation %     characters, punctuation characters %     before numbers, %     numbers before letters.  I also tested it with every other language setting and the results are always the same:  mortlach:~ # export LC_COLLATE="POSIX" mortlach:~ # sort demo  AB CDESY ABC DETZ ABCD ETX  mortlach:~ # export LC_COLLATE="en_GB.UTF-8" mortlach:~ # sort demo  AB CDESY ABCD ETX ABC DETZ  mortlach:~ # export LC_COLLATE="de_DE.UTF-8" mortlach:~ # sort demo  AB CDESY ABCD ETX ABC DETZ  So the question is why LC_COLLATE="POSIX" behaves differently to any other language setting, if this is a feature where is it documented and why is it so? It doesn't make sence that LC_COLLATE="POSIX" behaves different to the English settings (UK & US) which on the other hand behave exactly the same way as any other language setting so there must be a reason why this is so

-- 


http://sourceware.org/bugzilla/show_bug.cgi?id=2648

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]