[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Changes to grep/TODO
From: |
Charles Levert |
Subject: |
Changes to grep/TODO |
Date: |
Thu, 05 May 2005 21:22:46 -0400 |
Index: grep/TODO
diff -u grep/TODO:1.4 grep/TODO:1.5
--- grep/TODO:1.4 Thu Feb 8 05:33:57 2001
+++ grep/TODO Fri May 6 01:22:46 2005
@@ -1,3 +1,16 @@
+Get sane performance with UTF-8 locales.
+
+1) rewrite the configure.in script, perhaps also Makefile.am
+2) set up for gnulib-tool --import
+3) improve the test infrastructure
+4) check in the patches for the sync of dfa.c with GNU awk
+5) other small patches which wait for a test case
+6) process the Fedora/Red Hat patches
+7) some _minimal_ cleanup of the grep(), grepdir(), recursion
+ (the "main loop") and fix --directories=read
+
+##
+
Write Texinfo documentation for grep. The manual page would be a good
place to start, but Info documents are also supposed to contain a
tutorial and examples.
@@ -30,9 +43,15 @@
claim that his algo. is faster then Boyer-More ????
Worth Checking.
-Take a look at cgrep (Context grep) seems like nice work.
-Take a look at sgrep (Struct grep).
-Take a look at agrep (Approximate grep), from glimpse.
+Check <http://flame.cs.dal.ca/~taa/greps.html>.
+Take a look at:
+ -- cgrep (Context grep) seems like nice work;
+ -- sgrep (Struct grep);
+ -- agrep (Approximate grep), from glimpse;
+ -- nr-grep (Nondeterministic reverse grep);
+ -- ggrep (Grouse grep);
+ -- grep.py (Python grep);
+ -- pcregrep (from Perl-Compatible Regular Expressions library).
Can we merge ?
POSIX Compliance see p10003.x
@@ -40,3 +59,35 @@
Moving away from GNU regex API for POSIX regex API.
Better and faster !!
+
+##
+
+Check POSIX:
+ -- Volume "Base Definitions (XBD)",
+ Chapter "Regular Expressions"
+ and in particular
+ Section "Regular Expression General Requirements"
+ and its paragraph about caseless matching.
+
+Check the Unicode Standard:
+ -- Chapter 3 ("Conformance"),
+ Section 3.13 ("Default Case Operations")
+ and the toCasefold() case conversion operation;
+ -- Chapter 4 ("Character Properties"),
+ Section 4.2 ("Case -- Normative")
+ and the SpecialCasing.txt and CaseFolding.txt
+ files from the Unicode Database;
+ -- Chapter 5 ("Implementation Guidelines"),
+ Section 5.18 ("Case Mappings"),
+ Subsection "Caseless Matching".
+
+Check Unicode Technical Standard #18 ("Unicode Regular Expressions").
+Check Unicode Standard Annex #15 ("Unicode Normalization Forms").
+
+##
+
+Before every release:
+ -- drop dfa.[ch] into a copy of gawk and run "make check";
+ -- send pot file to the Translation Project to get fresh po files;
+ -- get up-to-date version of ABOUT-NLS;
+ -- update NEWS.
- Changes to grep/TODO,
Charles Levert <=