|
From: | Alexander Malmberg |
Subject: | Re: Getting UTF-8 value of string occasionally fails |
Date: | Wed, 13 Oct 2004 14:27:24 +0200 |
User-agent: | Mozilla Thunderbird 0.8 (X11/20040918) |
Christopher Culver wrote: [snip]
An example is U+D800 <Non Private Use High Surrogate, First>.
This is expected. A surrogate code point (0xd800-0xdfff, iirc) is not a valid character. Surrogates are used in pairs to encode characters above 0xffff in utf-16.
- Alexander Malmberg
[Prev in Thread] | Current Thread | [Next in Thread] |