[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: fixed bug: grep '.' didn't match some Hangul Syllables etc.
From: |
Jim Meyering |
Subject: |
Re: fixed bug: grep '.' didn't match some Hangul Syllables etc. |
Date: |
Sat, 21 May 2022 11:47:15 -0700 |
On Sat, May 14, 2022 at 9:22 AM Jim Meyering <jim@meyering.net> wrote:
>
> On Sat, May 14, 2022 at 12:15 AM Paul Eggert <eggert@cs.ucla.edu> wrote:
> > While looking into a TZDB problem I noticed that GNU grep mistakenly
> > reported that some files were not UTF-8 even though they were. One can
> > reproduce the problem in Gnulib by running the command:
> >
> > grep -v '^.*$' gnulib/tests/uninorm/NormalizationTest.txt
> >
> > This should output nothing, but outputs 943 lines containing Hangul
> > syllables.
> >
> > I tracked this down to a Gnulib bug I introduced in 2019, and fixed it
> > in Gnulib here:
> >
> > https://git.savannah.gnu.org/cgit/gnulib.git/commit/?id=b19a10775e54f8ed17e3a8c08a72d261d8c26244
> >
> > and in Grep by installing the attached patches.
>
> Nice! Thank you.
> Great timing!
> I am glad you found and fixed that **before** the release.
I've just noticed that some of those new tests fail on Fedora 35, but
probably won't have time to investigate today:
hangul-syllable.log
Description: Binary data