This is the mail archive of the libc-locales@sourceware.org mailing list for the GNU libc locales project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Character classifications and language-dependence


On Sat, Sep 16, 2006 at 04:20:58PM +0200, Ludovic Courtès wrote:
> Hi,
> 
> Keld Jørn Simonsen <keld@dkuug.dk> writes:
> 
> >> Initially, I was just wondering whether this broad and (to some extent)
> >> language-independent character classification is glibc-specific, or
> >> whether it is following some standard or recommendation.
> >
> > AFAIK glibc follows ISO 14652 recommendations, which essensially is the
> > same as what Unicode advocates: that all the letters of the different
> > script and also the ideographics are considered belonging to class
> > alpha.
> 
> So perhaps the ISO 14652 paragraph about the "i18n" FDCC-set that I
> quoted in my first message should be interpreted as a recommendation to
> include "i18n" in all locales?  Is it what you meant?

yes, that is a recommendation.

> If this is the case, the language-independent character classification
> found in glibc is not glibc-specific but standard-conforming.

Yes, kind of. 14652 is a TR, and there is no formal conformance.

best regards
keld


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]