bug-gnu-libiconv
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug-gnu-libiconv] Invalid characters when converting from utf8 to iso-8


From: Tom Sorensen
Subject: [bug-gnu-libiconv] Invalid characters when converting from utf8 to iso-8859-15
Date: Wed, 17 Mar 2021 11:30:59 -0400

Note -- this isn't just -15, but -1 as well, and possibly others.

I have a utf8 text file that contains <c2 98> and <c2 80>. When converted to iso-8859-15 via:
iconv -c -f utf8 -t iso_8859-15//IGNORE input > output

The resulting file contains characters x98 and x80. These are considered invalid by some programs that expect iso8859-15 encoding -- including iconv itself.

Running the file through iconv a second time results in an error, or in the characters being removed if the -c or //IGNORE options are used.

As such, shouldn't the characters be dropped in the original conversion?

--
Tom Sorensen


reply via email to

[Prev in Thread] Current Thread [Next in Thread]