Re: Bug reported regarding Unicode handling in email address

nmh-workers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bug reported regarding Unicode handling in email address

From:	Ken Hornstein
Subject:	Re: Bug reported regarding Unicode handling in email address
Date:	Mon, 07 Jun 2021 11:08:59 -0400

>> The address parser code is used for a lot of things.  The specific bug
>> report was about a draft message that contained Cyrillic characters.
>> We know what that character set was in THAT case, because it's a draft
>> message and we can derive the locale from the environment or the nmh
>> locale setting.  But if we are processing an email message then we
>> don't easily know the character set.  In theory it should either be
>> us-ascii or utf-8, but reality sometimes intrudes and it could be
>> anything.
>
>If it's an email then won't it be ASCII?

Boy, you're out of the loop!  Check out RFC 6532.

>> I think really instead of using ctype macros, we should be using a
>> specific set of macros tailored for email addresses.
>
>Isn't the problem that one routine is being used to parse emails which
>should comply with the RFCs and also draft emails where it's up to nmh
>to decide the allowable format?  We should be parsing ASCII-encoded
>fields for display in the user's locale with one routine and
>locale-encoded fields for transmission as ASCII with a second routine.

I mean ... yes?  Like many things there's a lot of overloading (see:
using email header parsing routines for config files).  But I think
in practice as long as we don't interpret non-ASCII bytes as "spaces"
we'll be fine.  Like I said, really, for parsing an email header we really
shouldn't be using ctype macros AT ALL but email-specific macros.

--Ken

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Bug reported regarding Unicode handling in email address, (continued)
- Re: Bug reported regarding Unicode handling in email address, Valdis Klētnieks, 2021/06/02
  - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein, 2021/06/02
    - Re: Bug reported regarding Unicode handling in email address, Bob Carragher, 2021/06/03
    - Re: Bug reported regarding Unicode handling in email address, Ralph Corderoy, 2021/06/07
    - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein <=
    - Re: Bug reported regarding Unicode handling in email address, Ralph Corderoy, 2021/06/10
    - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein, 2021/06/10
    - Re: Bug reported regarding Unicode handling in email address, Ralph Corderoy, 2021/06/11
    - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein, 2021/06/11
    - Re: Bug reported regarding Unicode handling in email address, Ralph Corderoy, 2021/06/12
    - Re: Bug reported regarding Unicode handling in email address, Robert Elz, 2021/06/10
    - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein, 2021/06/10
    - Re: Bug reported regarding Unicode handling in email address, Ralph Corderoy, 2021/06/11
    - Re: Bug reported regarding Unicode handling in email address, Robert Elz, 2021/06/11
    - Re: Bug reported regarding Unicode handling in email address, Ken Hornstein, 2021/06/11

Prev by Date: Re: Very large folderTo:
Next by Date: Re: Sharding by Year.
Previous by thread: Re: Bug reported regarding Unicode handling in email address
Next by thread: Re: Bug reported regarding Unicode handling in email address
Index(es):
- Date
- Thread