This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug libc/11418] New: iconv/gconv: "illegal input sequence at position"/incomplete implementation


Doing a conversion from HP to Linux (Red Hat/Ubuntu) shows some encodings on
Linux are incomplete. Or at at least they show unexpected behaviour.

Using "\xc3\xbc\x73", which is a UTF-8 string of "u umlaut" followed by an "s".

Doing the following works
printf "\xc3\xbc\x73" | iconv -f utf8 -t ISO-8859-15

Doing the following doesn't work 
printf "\xc3\xbc\x73" | iconv -f utf8 -t EUC-KR
and output "iconv: illegal input sequence at position 0"

While following works:
printf "\xc3\xbc\x73" | iconv -f utf8 -t EUC-CN

On HP-UX all of the above generate proper output. Since UTF-8 is used as input
in all cases it seems strange iconv/gconv thinks the input is wrong (errno 84)
in the EUC-KR case. Converting to US-ASCII has the same problem as converting to
EUC-KR.

-- 
           Summary: iconv/gconv: "illegal input sequence at
                    position"/incomplete implementation
           Product: glibc
           Version: unspecified
            Status: NEW
          Severity: normal
          Priority: P2
         Component: libc
        AssignedTo: drepper at redhat dot com
        ReportedBy: svenboden at hotmail dot com
                CC: glibc-bugs at sources dot redhat dot com


http://sourceware.org/bugzilla/show_bug.cgi?id=11418

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]