bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: charsets and character sets (was: Re: 21.1: list-charset-chars)


From: Richard Stallman
Subject: Re: charsets and character sets (was: Re: 21.1: list-charset-chars)
Date: Wed, 20 Feb 2002 15:13:54 -0700 (MST)

I added this section to mule.texi.  Thanks.  Any comments?

@node Charsets
@section Charsets
@cindex charsets

  Emacs groups all supported characters into disjoint @dfn{charsets}.
Each character code belongs to one and only one charset.  For
historical reasons, Emacs typically divides an 8-bit character code
for an extended version of ASCII into two charsets: ASCII, which
covers the codes 0 through 127, plus another charset which covers the
``right-hand part'' (the codes 128 and up).  For instance, the
characters of Latin-1 include the Emacs charset @code{ascii} plus the
Emacs charset @code{latin-iso8859-1}.

  Emacs characters belonging to different charsets may look the same,
but they are still different characters.  For example, the letter
@samp{o} with acute accent in charset @code{latin-iso8859-1}, used for
Latin-1, is different from the letter @samp{o} with acute accent in
charset @code{latin-iso8859-2}, used for Latin-2.

@findex list-charset-chars
@cindex characters in a certain charset
@findex describe-character-set
  There are two commands for obtaining information about Emacs
charsets.  The command @kbd{M-x list-charset-chars} prompts for a name
of a character set, and displays all the characters in that character
set.  The command @kbd{M-x describe-character-set} prompts for a
charset name and displays information about that charset, including
its internal representation within Emacs.

  To find out which charset a character in the buffer belongs to,
put point before it and type @kbd{C-u C-x =}.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]