[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: address@hidden: [patch] url-hexify-string does not follow W3C spec]
From: |
Kenichi Handa |
Subject: |
Re: address@hidden: [patch] url-hexify-string does not follow W3C spec] |
Date: |
Tue, 01 Aug 2006 16:14:30 +0900 |
User-agent: |
SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/22.0.50 (i686-pc-linux-gnu) MULE/5.0 (SAKAKI) |
In article <address@hidden>, Stefan Monnier <address@hidden> writes:
>>> What incompatibility? If the string only contains ASCII and
>>> eight-bit-*, then encoding it with utf-8 will return the same string
>>> of bytes (except in a unibyte string rather than multibyte string).
>> Here's an example:
>> (encode-coding-string "\x80" 'utf-8)
>> => "\302\200"
> Duh! Looks like a serious bug to me.
> Handa-san, what's up with that?
??? \x80 == U+0080 is a valid Unicode character in "C1
Controls" block.
However, I agree that the following is very questionable
behaviour:
>> (encode-coding-string (string-as-unibyte "\x80") 'utf-8)
>> => "\302\200"
But, that is a long standing problem, and should be fixed
(if necessary) after the release.
---
Kenichi Handa
address@hidden
Re: address@hidden: [patch] url-hexify-string does not follow W3C spec], YAMAMOTO Mitsuharu, 2006/08/01