[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: u32_normalize UNINORM_NFKC on 0xD800
From: |
Bruno Haible |
Subject: |
Re: u32_normalize UNINORM_NFKC on 0xD800 |
Date: |
Fri, 27 May 2011 19:42:42 +0200 |
User-agent: |
KMail/1.9.9 |
Simon Josefsson wrote:
> I'm calculating this IDNA2008 property
>
> toNFKC(toCaseFold(toNFKC(cp))) != cp
>
> for all code points.
It makes no sense to consider non-character code points here. Citing again
the Unicode standard, chapter 3 [1], section 3.8:
"High-surrogate and low-surrogate code units are used only in the context
of the UTF-16 character encoding form."
> Is this impossible to do with the u32_normalize interface?
Yes, you are passing invalid input to the u32_normalize function.
Bruno
[1] http://www.unicode.org/versions/Unicode6.0.0/ch03.pdf