This is the mail archive of the libc-alpha@sources.redhat.com mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Updating/adding locale for Ethiopia and Eritrea in GNU libc.

From: Petter Reinholdtsen <pere at hungry dot com>
To: Daniel Yacob <yacob at geez dot org>
Cc: libc-alpha at sources dot redhat dot com
Date: 01 May 2003 14:18:59 +0200
Subject: Re: Updating/adding locale for Ethiopia and Eritrea in GNU libc.
References: <E198pJW-0004pT-00@geez.org>
Reply-by: Tue, 1 Jan 1801 04:37:40 +1000

[Daniel Yacob]
> Traditionally there is no AM or PM.  A full sentence expression is
> used to specify the time, I've used an abbreviated form.  The word
> used for AM has 4 spellings, I thought I had the canonical form but
> did not.  Most people will confuse this.

Are you aware that you can skip the AM/PM value, and that this is
taken as an indication that the locale should use 24-hour time value?
At least I believe this is how it work.

> I dug up my a draft spec I have for the locales and looked into the
> sorting issue also.  This is a good reference if you don't have a
> copy:
> 
>   http://yacob.org/ISO-14652.pdf

Thanks.  I printed it.  Are you aware of
<URL:http://std.dkuug.dk/jtc1/sc22/wg20/>?

> I did get the multichar sequence to sort right for Oromo, also the
> Amharic problem is 95% fixed:
> 
>   http://yacob.org/am-et-update.tar.tgz

I tested this, and it seem to work.

> The only oddity is that english softs with "aAbBcC" but I think that
> is what is being set in file iso14651_t1 file.  I get the same order
> when I set LC_COLLATION=en_US.

OK.  Are you sure lower case should come before capital letters?  In
Norwegian it is the other way around.

> I'll play with the collation updates more on the weekend, set the
> symbols for punctuation and numbers, perhaps add one of the academic
> collation orders also.  I try to get new locales submitted for glibc
> by the end of next week.  Let me know if you have any comments.

Sounds good.  I did not understand why you added whitespace (<U0020>)
to the entries.  Are you sure this is a good idea?

And you should add charset information to the locale to match the
current version in glibc CVS.

--- am_ET.orig  Thu May  1 14:07:57 2003
+++ am_ET       Thu May  1 13:43:39 2003
@@ -2,6 +2,7 @@
 escape_char     /
 % Amharic language locale for Ethiopia.
 % Contributed by Daniel Yacob <yacob@geez.org>
+% Charset: UTF-8

 LC_IDENTIFICATION
 title      "Amharic language locale for Ethiopia"


Did you check the LC_NUMERIC and LC_MONETARY sections?  How are
numbers and currency values written in Amharic?  I do not think there
should be space at the end of int_curr_symbol.  At the very least, it
should use NO-BREAK SPACE (<U00A0>).

How are the following values written as numbers and currency (using
both $ and ETB) in Amharic?

  1
  -1
  1234.56
  -1234.56
  1234567.89
  -1234567.89
  12.3456789
  -12.3456789

In Norwegian, this would look like this:

  1
  kr 1,00
  NOK 1,00

  -1
  kr -1,00
  NOK -1,00

  1 234,56
  kr 1 234,56
  NOK 1 234,56

And so on...  Do you have a similar list for the languages you are
working on?  I want to add it to the test scripts in glibc.

(I think we should move this discussion to libc-alpha, to get input
from the glibc maintainers.  I copy this reply there.)

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]