[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: UCS-2BE
From: |
Kenichi Handa |
Subject: |
Re: UCS-2BE |
Date: |
Fri, 01 Sep 2006 20:28:02 +0900 |
User-agent: |
SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) |
In article <address@hidden>, Andreas Schwab <address@hidden> writes:
> The above quote is talking about "coded in N octets". If that's not about
> serialisation, what else is it?
To my understanding, it means 8*N bits here, and the wording
"UCS-4 (Universal Character Set coded in 4 octets)" is just
for explaining from where the the literal "UCS-4" comes.
See this description in C.2.
"As a consequence, UCS-4 can now be taken effectively as an
alias for the Unicode encoding form UTF-32, ..."
So, apparently UCS-4 is CEF here.
By the way, Unicode itself is confusing in names. For
instance, UTF-32 means both "UTF-32 encoding form (CEF)" and
"UTF-32 encoding scheme (CES)". Unicode 4.1 says:
"For historical reasons, the Unicode encoding schemes are
also referred to as Unicode (or UCS) transformation formats
(UTF). That term is, however, ambiguous between its usage
for encoding forms and encoding schemes."
---
Kenichi Handa
address@hidden
- Re: UCS-2BE, Andreas Schwab, 2006/09/01
- Re: UCS-2BE,
Kenichi Handa <=