bug-coreutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#19329: Uniq Bug


From: throwaway1024
Subject: bug#19329: Uniq Bug
Date: Tue, 9 Dec 2014 13:47:01 +0100

Dear developers,
The following command results in a document where valbug.txt contains multiple duplicate lines (e.g. "(TDCYXT)" without quotes)
--------
cat validation.csv | sed 's!\r\n!\n!g' | sed 's! !\n!g' | LC_ALL=C sort | uniq > valbug.txt
--------
 
It is definitely a problem of uniq, since the second command line of:
--------
cat validation.csv | sed 's!\r\n!\n!g' | sed 's! !\n!g' | LC_ALL=C sort > valo.txt
cat valo.txt | uniq > valbug.txt
--------
would result in the same bug and the valo.txt file is sorted correctly. In addition, the lines mentioned are equal (in hexadecimal representation) among each other in both valo.txt and valbug.txt
 
I also could not reduce the example since using only a few lines around the buggy lines would make it work again correctly.
 
You can find all files in attached zip or under https://mega.co.nz/#!PUZVgbgZ!R74BmxKcYzT_AEMwRHsMD-8CCgeRIvSnhPT-hFMihDA .

Kind regards,
Serge
 
 
 
 

Attachment: Uniq Bug.zip
Description: Zip archive


reply via email to

[Prev in Thread] Current Thread [Next in Thread]