Re: bites assumed set mid UTF-8

bug-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: bites assumed set mid UTF-8

From:	Kenichi Handa
Subject:	Re: bites assumed set mid UTF-8
Date:	Tue, 07 Mar 2006 13:52:33 +0900
User-agent:	SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI)

In article <87r75fignl.fsf@jidanni.org>, Dan Jacobson <jidanni@jidanni.org> 
writes:

> Bad bad bad. Emacs 21.4.1 shows the same Chinese character ("nuclear")
> even though the second bit string is not valid UTF-8. Cc'd Handa.
> 11100110 10100000 10111000
> 11100110 00100000 10111000

Thank you for the report.  It is already fixed in the latest
CVS code.  With it, the second byte sequence (invalid utf-8)
is decoded into "\346 \270" (i.e. 8-bit-char #xe6, SPC,
8-bit-char #xb8).

---
Kenichi Handa
handa@m17n.org

[Prev in Thread]

Current Thread

[Next in Thread]

bites assumed set mid UTF-8, Dan Jacobson, 2006/03/06
- Re: bites assumed set mid UTF-8, Kenichi Handa <=

Prev by Date: gnu emacs 21.4a-3 signal handler loop bug (involves X?)
Next by Date: Re: view-lossage has needless blanks at the end of each line
Previous by thread: bites assumed set mid UTF-8
Next by thread: gnu emacs 21.4a-3 signal handler loop bug (involves X?)
Index(es):
- Date
- Thread