This is the mail archive of the
xsl-list@mulberrytech.com
mailing list .
Re: XSL and international characters
- From: Jirka Kosek <jirka at kosek dot cz>
- To: xsl-list at lists dot mulberrytech dot com
- Date: Wed, 05 Dec 2001 09:19:56 +0100
- Subject: Re: [xsl] XSL and international characters
- References: <00a301c17d1a$df003900$2100a8c0@swiftnet.tec>
- Reply-to: xsl-list at lists dot mulberrytech dot com
Chris Bayes wrote:
>
> And! What does it say?
> If you want me to stare at another w3 spec for hours you will have to
> wait until tomorrow evening
It states that international characters should be encoded in UTF-8 in
URLs. Byte sequences of UTF-8 are writen as %xx in URLs. This means that
%C5%82 is character which is in UTF-8 represented by sequence of two
bytes C5 82. Hower this doesn't represents Unicode character U+C582, but
U+0142 (latin small letter l with stroke -- ł).
Only problem is that many applications treat URL not as UTF-8 encoded,
but ISO-8859-1 encoded.
Jirka
--
-----------------------------------------------------------------
Jirka Kosek
e-mail: jirka@kosek.cz
http://www.kosek.cz
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list