bug#7489: [coreutils] over aggressive threads in sort

bug-coreutils

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#7489: [coreutils] over aggressive threads in sort

From:	Chen Guo
Subject:	bug#7489: [coreutils] over aggressive threads in sort
Date:	Sun, 5 Dec 2010 21:16:20 -0800

Hi Professor Eggert,

On Fri, Dec 3, 2010 at 1:10 PM, Paul Eggert <address@hidden> wrote:
> On 12/03/10 12:18, Chen Guo wrote:
> Either option (either switch to mutexes everywhere, or have the top-level
> merge go to memory) should work.  Perhaps we should try both and benchmark
> them.

    Test machine is 4 core i7. The numbers I'm giving are averaged
over 20 runs, given in seconds, and are of the form elapsed / user +
system.

spinlock:
1 thread: 3.354 / 3.349
2 threads: 1.960 / 3.812
4 threads: 1.366 / 5.085

mutex:
1 thread: 3.354 / 3.350
2 threads: 2.062 / 3.628
4 threads: 1.497 / 4.172

spin/ output after
1 thread: 3.519 / 3.517
2 threads: 2.098 / 3.996
4 threads: 1.488 / 5.347

It seems if we have to choose between mutex and output post-sort,
mutex is the way to go. Mutex is faster in the single threaded case,
while in multithreaded the elapsed time is negligibly different, the
user time is much greater. With spinlocks only, the greater system
time was justified (though some might disagree) by the lower elapsed
time. With spinlock outputting post-sort, there is no more
justification for the higher user time.

Before saying anything else, I should note that for mutexes, on 4
threads 20% of the time there's a segfault on a seemingly innocuous
line in queue_insert ():
  node->queued = true

GDB shows that pointers all look normal, and I could not trigger this
over 10 runs with valgrind (it seems valgrind is singlethreaded). If
we do decide to go back to mutexes, I'll look into this issue more.

[Prev in Thread]

Current Thread

[Next in Thread]

bug#7489: [coreutils] over aggressive threads in sort, Chen Guo, 2010/12/01
- bug#7489: [coreutils] over aggressive threads in sort, Paul Eggert, 2010/12/01
  - bug#7489: [coreutils] over aggressive threads in sort, Jim Meyering, 2010/12/01
- bug#7489: [coreutils] over aggressive threads in sort, Paul Eggert, 2010/12/01
  - bug#7489: [PATCH] sort: fix bug on 64-bit hosts with at least 32768 processors, Paul Eggert, 2010/12/02
- bug#7489: [coreutils] over aggressive threads in sort, Chen Guo, 2010/12/02
  - bug#7489: [coreutils] over aggressive threads in sort, Paul Eggert, 2010/12/02
  - bug#7489: [coreutils] over aggressive threads in sort, Jim Meyering, 2010/12/02
    - bug#7489: [coreutils] over aggressive threads in sort, Chen Guo, 2010/12/03
    - bug#7489: [coreutils] over aggressive threads in sort, Paul Eggert, 2010/12/03
    - bug#7489: [coreutils] over aggressive threads in sort, Chen Guo <=
    - bug#7489: [coreutils] over aggressive threads in sort, Paul Eggert, 2010/12/06
    - bug#7489: [coreutils] over aggressive threads in sort, Chen Guo, 2010/12/06
    - bug#7489: [coreutils] over aggressive threads in sort, Jim Meyering, 2010/12/07
    - bug#7489: [coreutils] over aggressive threads in sort, Jim Meyering, 2010/12/07
    - bug#7489: [coreutils] over aggressive threads in sort, Chen Guo, 2010/12/07
    - bug#7597: multi-threaded sort can segfault (unrelated to the sort -u segfault), Jim Meyering, 2010/12/09
    - bug#7597: [coreutils] multi-threaded sort can segfault (unrelated to the sort -u segfault), Jim Meyering, 2010/12/09
    - bug#7597: multi-threaded sort can segfault (unrelated to the sort -u segfault), Paul Eggert, 2010/12/09
    - bug#7597: multi-threaded sort can segfault (unrelated to the sort -u segfault), Chen Guo, 2010/12/10
    - bug#7597: multi-threaded sort can segfault (unrelated to the sort -u segfault), Chen Guo, 2010/12/10

Prev by Date: bug#7568: stat 'i\i' shows 'i\\i'
Next by Date: bug#7489: [coreutils] over aggressive threads in sort
Previous by thread: bug#7489: [coreutils] over aggressive threads in sort
Next by thread: bug#7489: [coreutils] over aggressive threads in sort
Index(es):
- Date
- Thread