This is the mail archive of the
libc-locales@sourceware.org
mailing list for the GNU libc locales project.
Re: Character classifications and language-dependence
On Sat, Sep 16, 2006 at 04:20:58PM +0200, Ludovic Courtès wrote:
> Hi,
>
> Keld Jørn Simonsen <keld@dkuug.dk> writes:
>
> >> Initially, I was just wondering whether this broad and (to some extent)
> >> language-independent character classification is glibc-specific, or
> >> whether it is following some standard or recommendation.
> >
> > AFAIK glibc follows ISO 14652 recommendations, which essensially is the
> > same as what Unicode advocates: that all the letters of the different
> > script and also the ideographics are considered belonging to class
> > alpha.
>
> So perhaps the ISO 14652 paragraph about the "i18n" FDCC-set that I
> quoted in my first message should be interpreted as a recommendation to
> include "i18n" in all locales? Is it what you meant?
yes, that is a recommendation.
> If this is the case, the language-independent character classification
> found in glibc is not glibc-specific but standard-conforming.
Yes, kind of. 14652 is a TR, and there is no formal conformance.
best regards
keld