[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[bug-gnu-libiconv] Converting ks_c_5601-1987 as CP949
From: |
Anders Thøgersen |
Subject: |
[bug-gnu-libiconv] Converting ks_c_5601-1987 as CP949 |
Date: |
Tue, 26 Feb 2013 11:02:39 GMT |
User-agent: |
One.com webmail 2.1.17 |
Hi,
I am converting emails to UTF-8 and ran into a Korean email which failed conversion to UTF-8.
I ran the command:
cat korean-text.txt | iconv -f 'ks_c_5601-1987' -t 'UTF-8'
but the output was not correct and contained a number of ▣ and ♥ symbols which were
not there in the original text. After some googling I discovered that I can use
cat korean-text.txt | iconv -f 'CP949' -t 'UTF-8'
Instead, and I can see that this works well as my text is now correct.
However, I do not get a warning "Conversion from ks_c_5601-1987 not supported" which tells
me that this character set is supported, at least partially, although it is not mentioned in the
output of iconv -l. I am a bit confused. Maybe there are characters of ks_c_5601-1987 that
are not covered by CP949?
Also, I am wondering if I should be aware of other aliases when decoding. Should I be referring
in general be referring to character sets by their code page name when converting to UTF-8?
Thanks for any help.
Best Regards
Anders
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [bug-gnu-libiconv] Converting ks_c_5601-1987 as CP949,
Anders Thøgersen <=