This is the mail archive of the
glibc-bugs@sourceware.org
mailing list for the glibc project.
[Bug libc/11418] New: iconv/gconv: "illegal input sequence at position"/incomplete implementation
- From: "svenboden at hotmail dot com" <sourceware-bugzilla at sourceware dot org>
- To: glibc-bugs at sources dot redhat dot com
- Date: 22 Mar 2010 21:14:40 -0000
- Subject: [Bug libc/11418] New: iconv/gconv: "illegal input sequence at position"/incomplete implementation
- Reply-to: sourceware-bugzilla at sourceware dot org
Doing a conversion from HP to Linux (Red Hat/Ubuntu) shows some encodings on
Linux are incomplete. Or at at least they show unexpected behaviour.
Using "\xc3\xbc\x73", which is a UTF-8 string of "u umlaut" followed by an "s".
Doing the following works
printf "\xc3\xbc\x73" | iconv -f utf8 -t ISO-8859-15
Doing the following doesn't work
printf "\xc3\xbc\x73" | iconv -f utf8 -t EUC-KR
and output "iconv: illegal input sequence at position 0"
While following works:
printf "\xc3\xbc\x73" | iconv -f utf8 -t EUC-CN
On HP-UX all of the above generate proper output. Since UTF-8 is used as input
in all cases it seems strange iconv/gconv thinks the input is wrong (errno 84)
in the EUC-KR case. Converting to US-ASCII has the same problem as converting to
EUC-KR.
--
Summary: iconv/gconv: "illegal input sequence at
position"/incomplete implementation
Product: glibc
Version: unspecified
Status: NEW
Severity: normal
Priority: P2
Component: libc
AssignedTo: drepper at redhat dot com
ReportedBy: svenboden at hotmail dot com
CC: glibc-bugs at sources dot redhat dot com
http://sourceware.org/bugzilla/show_bug.cgi?id=11418
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.