[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: charsets and character sets (was: Re: 21.1: list-charset-chars)
From: |
Richard Stallman |
Subject: |
Re: charsets and character sets (was: Re: 21.1: list-charset-chars) |
Date: |
Wed, 20 Feb 2002 15:13:54 -0700 (MST) |
I added this section to mule.texi. Thanks. Any comments?
@node Charsets
@section Charsets
@cindex charsets
Emacs groups all supported characters into disjoint @dfn{charsets}.
Each character code belongs to one and only one charset. For
historical reasons, Emacs typically divides an 8-bit character code
for an extended version of ASCII into two charsets: ASCII, which
covers the codes 0 through 127, plus another charset which covers the
``right-hand part'' (the codes 128 and up). For instance, the
characters of Latin-1 include the Emacs charset @code{ascii} plus the
Emacs charset @code{latin-iso8859-1}.
Emacs characters belonging to different charsets may look the same,
but they are still different characters. For example, the letter
@samp{o} with acute accent in charset @code{latin-iso8859-1}, used for
Latin-1, is different from the letter @samp{o} with acute accent in
charset @code{latin-iso8859-2}, used for Latin-2.
@findex list-charset-chars
@cindex characters in a certain charset
@findex describe-character-set
There are two commands for obtaining information about Emacs
charsets. The command @kbd{M-x list-charset-chars} prompts for a name
of a character set, and displays all the characters in that character
set. The command @kbd{M-x describe-character-set} prompts for a
charset name and displays information about that charset, including
its internal representation within Emacs.
To find out which charset a character in the buffer belongs to,
put point before it and type @kbd{C-u C-x =}.