[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug #13859] matching '([A]|[B]){2}' in different locales

From: anonymous
Subject: [bug #13859] matching '([A]|[B]){2}' in different locales
Date: Wed, 20 Jul 2005 12:52:50 +0000
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8b3) Gecko/20050714 Firefox/1.0+


                 Summary: matching '\([A]\|[B]\)\{2\}' in different locales
                 Project: grep
            Submitted by: None
            Submitted on: Wed 07/20/2005 at 08:52
                Category: None
                Severity: 3 - Normal
              Item Group: None
                  Status: None
                 Privacy: Public
             Assigned to: None
             Open/Closed: Open




Firstly, I had originally e-mailed this to address@hidden, as described
in the info page for grep, but since I didn't get any response, and I then
found the link here on gnu.org/software/grep/, I thought it'd be best to
report this again. My apologies if I should've just waited longer.

When I try to grep '\([A]\|[B]\)\{2\}', and as input I pass AA / AB / BA / BB
(yes, on separate lines), in the C locale, grep prints all four lines.
However, in UTF-8 locales (specifically en_GB.UTF-8, but I got the same result
with some others as well), it only prints the second and fourth line. If I
change [B] to B, or [A] to A, or both, grep prints all four lines regardless
of locale. If I remove \{2\} and try '\([A]\|[B]\)\([A]\|[B]\)' instead, grep
again prints all four lines regardless of locale.

I've tested this with grep 2.5.1 on Linux, configured with nothing more than
--prefix=$HOME/grep, compiled with gcc 4.0.1, on a system with glibc 2.3.5
(20050421 snapshot). I also tried --with-included-regex just in case, but this
didn't change anything.

Am I correct in assuming this is a bug?


Reply to this item at:


  Message sent via/by Savannah

reply via email to

[Prev in Thread] Current Thread [Next in Thread]