[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#22155: Wrong char count with UTF8 in sort -k
From: |
Pádraig Brady |
Subject: |
bug#22155: Wrong char count with UTF8 in sort -k |
Date: |
Sun, 13 Dec 2015 02:32:51 +0000 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 |
On 13/12/15 01:32, Pádraig Brady wrote:
> On 12/12/15 22:53, Holger Klene wrote:
>>> sort sort.bug.txt -u -s -k 1.20 -b --debug
>> sort: es werden die Sortierregeln für »de_DE.UTF-8“ verwendet
>> 05. Mär 2015 13:30 ./mess.jpg
>> __________
>> 07. Feb 2015 15:57 ./mess.jpg
>> __________
>>
>> In fact, it does correct the underlines, but still -u gives both lines,
>> though I want it to discard the second line. You can add more lines for the
>> same file, but sort insists on keeping exactly two: one with Umlaut and the
>> other without.
>
> That's a bug in --debug because the implementation was split
> from the actual processing done during the sort (for performance reasons).
> Therefore we'll need to fix --debug to show what's being actually done
Patch attached.
thanks,
Pádraig.
sort-debug-b.patch
Description: Text Data