bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

uniq/sort documentation flaw


From: Andries E. Brouwer
Subject: uniq/sort documentation flaw
Date: Mon, 4 May 2009 18:07:50 +0200
User-agent: Mutt/1.5.18 (2008-05-17)

uniq(1) says

       Discard all but one of successive identical lines from INPUT

However, this is very misleading. "Identical" does not mean identical
but "equal if one ignores differences that LC_COLLATE says should be ignored".

This man page line should be changed, adding a reference to the locale.
As it is now, the words locale and LC_COLLATE do not occur on the man page.

The info file is better and mentions LC_COLLATE.
But also there the fact that the meanings of "repeated" and "duplicate"
are modified by LC_COLLATE is not mentioned explicitly.

Andries


(Sorting is an operation done on all kinds of data, not only lines of text.
I would not mind an option that tells sort to ignore the locale rules for
sorting because what is sorted is not text. That feels cleaner than
preceding each invocation with LC_COLLATE=C. And locale-free sort also
is much faster.)





reply via email to

[Prev in Thread] Current Thread [Next in Thread]