This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug libc/2648] localedata/locales/es_ES has incorrect LC_COLLATE <space> handling
- From: "mfabian at suse dot de" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sources dot redhat dot com
- Date: 9 May 2006 16:00:00 -0000
- Subject: [Bug libc/2648] localedata/locales/es_ES has incorrect LC_COLLATE <space> handling
- References: <20060509154921.2648.mfabian@suse.de>
- Reply-to: sourceware-bugzilla at sourceware dot org
------- Additional Comments From mfabian at suse dot de 2006-05-09 16:00 -------
Original comment in the Novell bugzilla: When LC_COLLATE=es_ES, the sort command ignores spaces in its sorting algorithm, so it sorts MAS PUJADAS, FRANCESC after MASOLIVER GARCIA, JAIME instead of before, even though the comments in /usr/share/i18n/locales/es_ES indicate that the sorting algorithm for this locales should take spaces into account (and sort them before punctuation characters, numbers and letters). This spanish customer is not using LC_COLLATE="POSIX" because the sort command gives incorrect results when dealing with characters with spanish accents so he has to use LC_COLLATE="es_ES.UTF-8" which is ignoring spaces. Even /usr/share/i18n/locales/es_ES states: LC_COLLATE % Base collation scheme: 1994-03-22 % Ordering algorithm: % 1. Spaces and hyphen (but not soft % hyphen) before punctuation % characters, punctuation characters % before numbers, % numbers before letters. I also tested it with every other language setting and the results are always the same: mortlach:~ # export LC_COLLATE="POSIX" mortlach:~ # sort demo AB CDESY ABC DETZ ABCD ETX mortlach:~ # export LC_COLLATE="en_GB.UTF-8" mortlach:~ # sort demo AB CDESY ABCD ETX ABC DETZ mortlach:~ # export LC_COLLATE="de_DE.UTF-8" mortlach:~ # sort demo AB CDESY ABCD ETX ABC DETZ So the question is why LC_COLLATE="POSIX" behaves differently to any other language setting, if this is a feature where is it documented and why is it so? It doesn't make sence that LC_COLLATE="POSIX" behaves different to the English settings (UK & US) which on the other hand behave exactly the same way as any other language setting so there must be a reason why this is so
--
http://sourceware.org/bugzilla/show_bug.cgi?id=2648
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.