This is the mail archive of the
cygwin-developers
mailing list for the Cygwin project.
Re: utf-8 and cygwin
- From: Corinna Vinschen <corinna-cygwin at cygwin dot com>
- To: cygwin-developers at cygwin dot com
- Date: Mon, 7 Jan 2008 16:14:27 +0100
- Subject: Re: utf-8 and cygwin
- References: <259271.87255.qm@web45404.mail.sp1.yahoo.com> <477687FA.5DDD38AF@dessent.net>
- Reply-to: cygwin-developers at cygwin dot com
On Dec 29 09:46, Brian Dessent wrote:
>
> > #1 is that that NT/XP limit is 32000 UTF-16 wide characters. Expanded to UTF-8 that makes the longest name 128k so if you really want this to work for 32K character names PATH_MAX is going to have to be 128K.
>
> It doesn't make any sense to use UTF-8 in Cygwin. Nowhere in the Win32
> api or the Native API does any function take or output UTF-8, so there
> would be a useless conversion before calling *any* system function. The
> whole point of this painful conversion is to use the same encoding
> throughout in Cygwin as the operating system, namely UCS-2.
MSDN claims that Windows uses UTF-16.
Famous last words alarm: Most of the time this shouldn't matter, as
long as nobody uses musical notation or characters from dead languages
like lykian or pyu.
Corinna
--
Corinna Vinschen Please, send mails regarding Cygwin to
Cygwin Project Co-Leader cygwin AT cygwin DOT com
Red Hat