Re: [Qemu-devel] KVM call minutes for Feb 15

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] KVM call minutes for Feb 15

From:	Avi Kivity
Subject:	Re: [Qemu-devel] KVM call minutes for Feb 15
Date:	Thu, 17 Feb 2011 15:25:21 +0200
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.13) Gecko/20101209 Fedora/3.1.7-0.35.b3pre.fc14 Lightning/1.0b3pre Thunderbird/3.1.7

On 02/17/2011 03:10 PM, Anthony Liguori wrote:

On 02/17/2011 06:23 AM, Avi Kivity wrote:
On 02/17/2011 02:12 PM, Anthony Liguori wrote:
(btw what happens in a non-UTF-8 locale? I guess we should justreject unencodable strings).
While QEMU is mostly ASCII internally, for the purposes of the JSONparser, we always encode and decode UTF-8. We reject invalid UTF-8sequences. But since JSON is string-encoded unicode, we can alwaysdecode a JSON string to valid UTF-8 as long as the string is wellformed.
That is wrong. If the user passes a Unicode filename it is expectedto be translated to the current locale encoding for the purpose of,say, filename lookup.
QEMU does not support anything but UTF-8.


Since when?

AFAICT, JSON string conversion is the only place where there is anydependency on UTF-8. Anything else should just work.

That's pretty common with Unix software. I don't think any modernUnix platform actually uses UCS2 or UTF-16. It's either ascii or UTF-8.

Most/all Linux distributions support UTF-8 as well as a zillion otherencodings (single-byte ASCII + another charset, or multi-byte charsetsfor languages with many characters.

The only place it even matters is Windows and Windows has ASCII andUTF-16 versions of their APIs. So on Windows, non-ASCII characterswon't be handled correctly (yet another one of the many issues withWindows support in QEMU). UTF-8 is self-recovering though so itdegrades gracefully.

It matters on Linux with el_GR.iso88597, for example. If you feed aJSON string and translate it blindly to UTF-8, you'll get garbage whenyou feed it to system calls.

Practically everyone uses UTF-8 these days, so the impact is minimal,but it is more correct (as well as simpler) to ask the system librariesto encode using the current locale.


--
error compiling committee.c: too many arguments to function

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] KVM call minutes for Feb 15, Chris Wright, 2011/02/15
- Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/15
  - Re: [Qemu-devel] KVM call minutes for Feb 15, Avi Kivity, 2011/02/16
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/16
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Avi Kivity, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Avi Kivity, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Avi Kivity <=
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Peter Maydell, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Avi Kivity, 2011/02/17
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/17
  - Re: [Qemu-devel] KVM call minutes for Feb 15, Amit Shah, 2011/02/16
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Anthony Liguori, 2011/02/16
    - Re: [Qemu-devel] KVM call minutes for Feb 15, Amit Shah, 2011/02/17

Prev by Date: Re: [Qemu-devel] [PATCH REBASE/RESEND 1/4] qdev: Add a description field for qdev properties for documentation
Next by Date: Re: [Qemu-devel] KVM call minutes for Feb 15
Previous by thread: Re: [Qemu-devel] KVM call minutes for Feb 15
Next by thread: Re: [Qemu-devel] KVM call minutes for Feb 15
Index(es):
- Date
- Thread