bug-grep
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#24975: Matching issues with characters whose encoding ends in some o


From: Jim Meyering
Subject: bug#24975: Matching issues with characters whose encoding ends in some other character
Date: Mon, 28 Nov 2016 09:11:55 -0800

On Mon, Nov 28, 2016 at 5:49 AM, Norihiro Tanaka <address@hidden> wrote:
> Jim Meyering <address@hidden> wrote:
>
>> I suspect this won't be the last word in this area, because it feels
>> like we should be able to adjust DFA's tables so that people using
>> such locales can retain DFA's efficiency without the bug in the
>> current implementation.
>
> Hi Jim,
>
> It is a bug in dfa for period expression in non-UTF8 locales.  dfa
> calculates transition for single byte characters and a multibyte
> character separately and merge both results.  However, if backs to
> an initial state in transition for single byte characters, we should
> stop matching single byte characters.

Nice work. Thank you.





reply via email to

[Prev in Thread] Current Thread [Next in Thread]