nmh-workers
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bug reported regarding Unicode handling in email address


From: Steffen Nurpmeso
Subject: Re: Bug reported regarding Unicode handling in email address
Date: Mon, 14 Jun 2021 18:33:42 +0200
User-agent: s-nail v14.9.22-153-g062ff456ec

Steffen Nurpmeso wrote in
 <20210614162626.VfjxT%steffen@sdaoden.eu>:
 ...
 | <20210614121214.84C16210F4@orac.inputplus.co.uk>:
 ...
 ||Why not iconv(3) the input from the user's locale, the MIME part's
 ||charset, etc., to UTF-8, work internally, and then iconv() again on the
 ...
 |functions do not support graphemes, and __STDC_ISO_10646__ is an
 |option, so that you cannot simply code some tables on your own to
 |fill the gaps, because looking at the wchar_t codepoints may not
 |give you a Unicode "codepoint" (though maybe all do it like that
 |so in practice you could make this a precondition).  I had to

To add that if i recall correctly citrus for example does this,
using the upper bits of wchar_t for state info, but i have
forgotten whether that was done in an UTF-8 locale, or rather in
CJK or SHIFT-JS or whatever (my gut says the latter).

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)



reply via email to

[Prev in Thread] Current Thread [Next in Thread]