Re: better buffer size for copy

bug-coreutils

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: better buffer size for copy

From:	Phillip Susi
Subject:	Re: better buffer size for copy
Date:	Mon, 21 Nov 2005 00:45:40 -0500
User-agent:	Mozilla Thunderbird 1.0.7 (X11/20051010)

What would such network filesystems report as their blocksize? I have afeeling it isn't going to be on the order of a MB. At least for localfilesystems, the ideal transfer block size is going to be quite a bitlarger than the filesystem block size ( if the filesystem is even blockoriented... think reiser4, or cramfs ). In the case of networkfilesystems, they should be performing readahead in the backgroundbetween small block copies to keep the pipeline full. As long as thecopy program isn't blocked elsewhere for long periods, say in the writeto the destination, then the readahead mechanism should keep thepipeline full. Up to a point, using larger block sizes saves some cpuby lowering the number of system calls. After a certain point, the copyprogram can start to waste enough time in the write that the readaheadstops and stalls the pipeline.If you want really fast copies of large files, then you want to senddown multiple overlapped aio ( real aio, not the glibc threadedimplementation ) O_DIRECT reads and writes, but that gets quitecomplicated. Simply using blocking O_DIRECT reads into a memory mappeddestination file buffer performs nearly as well, provided you use adecent block size. On my system I have found that 128 KB+ buffers areneeded to keep the pipeline full because I'm using a 2 disk raid0 with a64k stripe factor. As a result, blocks smaller than 128 KB only keepone disk going at a time. That's probably getting a bit too complicatedthough for this conversation.If we are talking about the conventional blocking cached read, followedby a blocking cached write, then I think you will find that using abuffer size of several pages ( say 32 or 64 KB ) will be MUCH moreefficient than 1024 bytes ( the typical local filesystem block size ),so using st_blksize for the size of the read/write buffer is not good.I think you may be ascribing meaning to st_blksize that is not there.


Robert Latham wrote:

In local file systems, i'm sure you are correct.  If you are working
with a remote file system, however, the optimal size is on the order
of megabytes, not kilobytes.  For a specific example, consider the
PVFS2 file system, where the plateau in "blocksize vs. bandwitdh" is
two orders of magnitude larger than 64 KB.  PVFS2 is a parallel file
system for linux clusters.  I am not nearly as familiar with Lustre,
GPFS, or GFS, but I suspect those filesystems too would benefit from

block sizes larger than 64 KB.

Are you taking umbrage at the idea of using st_blksize to direct how
large the transfer size should be for I/O?  I don't know what other
purpose st_blksize should have, nor are there any other fields which

are remotely valid for that purpose.Thanks for your feedback.==rob

[Prev in Thread]

Current Thread

[Next in Thread]

better buffer size for copy, Robert Latham, 2005/11/04
- Re: better buffer size for copy, Paul Eggert, 2005/11/05
  - Re: better buffer size for copy, Robert Latham, 2005/11/07
    - Re: better buffer size for copy, Paul Eggert, 2005/11/07
    - Re: better buffer size for copy, Robert Latham, 2005/11/07
    - Re: better buffer size for copy, Robert Latham, 2005/11/18
    - Re: better buffer size for copy, Phillip Susi, 2005/11/19
    - Re: better buffer size for copy, Robert Latham, 2005/11/20
    - Re: better buffer size for copy, Phillip Susi <=
    - Re: better buffer size for copy, Robert Latham, 2005/11/22
    - Re: better buffer size for copy, Phillip Susi, 2005/11/22
    - Re: better buffer size for copy, Paul Eggert, 2005/11/24
    - Re: better buffer size for copy, Jim Meyering, 2005/11/24

Prev by Date: Re: bug in sort with '.' in strings
Next by Date: can we remove a directory from within that directory??
Previous by thread: Re: better buffer size for copy
Next by thread: Re: better buffer size for copy
Index(es):
- Date
- Thread