[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [bug #36567] grep -i (case-insensitive) is broken with UTF8
From: |
Paolo Bonzini |
Subject: |
Re: [bug #36567] grep -i (case-insensitive) is broken with UTF8 |
Date: |
Fri, 15 Jun 2012 20:15:04 +0200 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:13.0) Gecko/20120605 Thunderbird/13.0 |
Il 12/06/2012 18:48, Paul Eggert ha scritto:
> On 06/12/2012 09:27 AM, Jim Meyering wrote:
>
>> But please move the declaration of ombclen down to its initialization.
>> The only part of grep for which we avoid decl-after-stmt is dfa.c.
>
> Ah, OK, thanks, I did that and pushed. It will be nice
> when we can assume statement-before-declaration everywhere.
>
>> Did you try to create a test case that triggers this bug?
>> It would be nice to do that, or if that is not possible,
>> to say that no such case exists (at least with UTF-8).
>
> Yes, a patch would be nice, but offhand I don't know of
> a test case for it (i.e., a locale where changing case
> shortens some characters and lengthens others).
Turkish lowercase i-with-dot is shorter than the uppercase, and
uppercase I-without-dot is shorter than the lowercase.
Paolo