[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: bites assumed set mid UTF-8
From: |
Kenichi Handa |
Subject: |
Re: bites assumed set mid UTF-8 |
Date: |
Tue, 07 Mar 2006 13:52:33 +0900 |
User-agent: |
SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) |
In article <87r75fignl.fsf@jidanni.org>, Dan Jacobson <jidanni@jidanni.org>
writes:
> Bad bad bad. Emacs 21.4.1 shows the same Chinese character ("nuclear")
> even though the second bit string is not valid UTF-8. Cc'd Handa.
> 11100110 10100000 10111000
> 11100110 00100000 10111000
Thank you for the report. It is already fixed in the latest
CVS code. With it, the second byte sequence (invalid utf-8)
is decoded into "\346 \270" (i.e. 8-bit-char #xe6, SPC,
8-bit-char #xb8).
---
Kenichi Handa
handa@m17n.org