Re: Character sets and encodings confusion

help-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Character sets and encodings confusion

From:	Jason Rumney
Subject:	Re: Character sets and encodings confusion
Date:	Fri, 11 Jan 2008 08:28:34 -0800 (PST)
User-agent:	G2/1.0

On 11 Jan, 14:26, "Otto Maddox" <ottomad...@fastmail.fm> wrote:
> When I type `C-u C-x =' on the character `£', ...

> Why is the code point #x23?  Should it not be #xA3 in Latin Alphabet 1?

The clue is in the following:

>     charset: latin-iso8859-1
>              (Right-Hand Part of Latin Alphabet 1 (ISO/IEC 8859-1): 
> ISO-IR-100.)

Note that the latin-iso8859-1 charset only includes the Right-Hand
part (0x80-0xff).

> Because when you click on the #x23, the character list you get shows
> the code point as being #xA3, which is confusing.

It is confusing, but the table displayed is listed as the *coded*
charset, so it has the +0x80 transformation applied.

> Also, what are the first three numbers in parenthesis on the
> `character:' line?

They are the code-point in the internal encoding (emacs-mule in the
current version) in decimal, octal and hexadecimal.

[Prev in Thread]

Current Thread

[Next in Thread]

Character sets and encodings confusion, Otto Maddox, 2008/01/11
- Re: Character sets and encodings confusion, Eli Zaretskii, 2008/01/11
- Re: Character sets and encodings confusion, Jason Rumney <=

Prev by Date: Re: Character sets and encodings confusion
Next by Date: Anything like this exist already? (buffer name intelligence)
Previous by thread: Re: Character sets and encodings confusion
Next by thread: Anything like this exist already? (buffer name intelligence)
Index(es):
- Date
- Thread