[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Use the Unicode replacement character for replacing unencodable char
From: |
Mattias Engdegård |
Subject: |
Re: Use the Unicode replacement character for replacing unencodable characters into UTF-16 |
Date: |
Tue, 18 Aug 2020 21:43:45 +0200 |
18 aug. 2020 kl. 20.13 skrev Eli Zaretskii <eliz@gnu.org>:
> My reading is that this happens only for codepoints beyond 0x10ffff.
> Raw bytes end up there, but I'm not sure they always end up there.
> Characters that aren't unified also end up there.
As far as I can tell raw bytes (in the [128,255] range) are always replaced,
even when the input is a unibyte string.