[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: ucs-normalize and diacritics
From: |
Eli Zaretskii |
Subject: |
Re: ucs-normalize and diacritics |
Date: |
Fri, 03 Aug 2018 20:57:21 +0300 |
> From: address@hidden (K. Handa)
> Cc: address@hidden, address@hidden
> Date: Fri, 03 Aug 2018 23:21:54 +0900
>
> In article <address@hidden>, Eli Zaretskii <address@hidden> writes:
>
> > I meanwhile removed tis620-2533 on master and made it an alias to
> > thai-iso8859-11. It looked to me that a difference of one character
> > doesn't justify to have another charset with peculiar traits which in
> > addition causes unintuitive behavior. Do you think that change is OK?
>
> I think it is not the right way. Currently,
>
> (encode-char #xa0 'tis620-2533) => nil
> (encode-char #xa0 'thai-iso8859-11) => 32
>
> But, your change results in:
>
> (encode-char #xa0 'tis620-2533) => 32
>
> which is simply wrong.
How important is that discrepancy? After all, tis620-2533 doesn't
have the #xa0 character, so the situation above should never happen,
right? OTOH, it is strange to have two charsets that are identical
except for one character.
> That unintuitive behavior you mentioned is just Emacs' fault, and can be
> fixed, for instance, by my previous patch.
Yes, it could be fixed, but why do we need to jump through hoops to
keep one more charset, when we already have a charset that is
identical to it except for one character?
Re: ucs-normalize and diacritics,
Eli Zaretskii <=