help-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Character sets and encodings confusion


From: Otto Maddox
Subject: Character sets and encodings confusion
Date: Fri, 11 Jan 2008 14:26:29 +0000

When I type `C-u C-x =' on the character `£', I get
something like this:

  character: £ (2211, #o4243, #x8a3, U+00A3)
    charset: latin-iso8859-1
             (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): ISO-IR-100.)
 code point: #x23
     syntax: w  which means: word
   category: l:Latin
buffer code: #x81 #xA3
  file code: #xA3 (encoded by coding system iso-latin-1)
    display: by this font (glyph code)
     -apple-monaco-medium-r-normal--13-130-72-72-m-130-iso10646-1 (#xA3)

Why is the code point #x23?  Should it not be #xA3 in Latin Alphabet 1?
Because when you click on the #x23, the character list you get shows
the code point as being #xA3, which is confusing.

Also, what are the first three numbers in parenthesis on the
`character:' line?  Are they code points of some charset?  (I
understand that the fourth number is a Unicode code point.)

-- 
  Otto Maddox
  ottomaddox@fastmail.fm

-- 
http://www.fastmail.fm - Same, same, but different…





reply via email to

[Prev in Thread] Current Thread [Next in Thread]