[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [bug-gnu-libiconv] 'ASMO-708' and 'ISO-8859-6' is same in libiconv
From: |
Bruno Haible |
Subject: |
Re: [bug-gnu-libiconv] 'ASMO-708' and 'ISO-8859-6' is same in libiconv |
Date: |
Sun, 27 Aug 2023 05:17:22 +0200 |
> I found 'ASMO-708' and 'ISO-8859-6' is same in libiconv.
>
> But according to the description in Microsoft Learn
>
>
> https://learn.microsoft.com/en-US/windows/win32/intl/code-page-identifiers
>
> ASMO-708 corresponds to CP708 "Arabic (ASMO 708)"
> ISO-8859-6 corresponds to CP28596 "ISO 8859-6 Arabic"
>
> According to the description in wutils.com
>
> https://wutils.com/encodings/asmo-708
> https://wutils.com/encodings/iso-8859-6
>
> It can be seen that the encoding tables of the two are somewhat different.
Microsoft is not the main source of truth in the area of character sets/
encodings. The IANA registry [1] is the source of truth here: it is normative.
> However, according to the definition in "lib/encodings.def" in libiconv,
> the two are the same encoding.
They are the same in GNU libiconv (and in GNU libc) because IANA [1]
says that ASMO-708 is an alias of ISO-8859-6.
I know that Windows has a CP28596 code page [2]. It is simply irrelevant.
Bruno
[1] https://www.iana.org/assignments/character-sets/character-sets.xhtml
[2] https://haible.de/bruno/charsets/conversion-tables/ISO-8859-6.html