Re: Idea for reducing disk IO on tagging operations

info-cvs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Idea for reducing disk IO on tagging operations

From:	Paul Sander
Subject:	Re: Idea for reducing disk IO on tagging operations
Date:	Sun, 20 Mar 2005 17:00:54 -0800


On Mar 20, 2005, at 3:54 PM, address@hidden wrote:

* Mark D. Baushke (address@hidden) wrote:

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Dr. David Alan Gilbert <address@hidden> writes:


OK, if I create a dummy ",foo.c," before
modifying (or create a hardlink with that name
to foo.c,v ?) would that be sufficient?


I would say that it is likely necessary, but may
not be sufficient.


Hmm ok.

Or perhaps create the ,foo,c, as I normally
would - but if I can use this overwrite trick on
the original then I just delete the ,foo.c,
file.


I am unclear how this lets you perform a speedup.


I only create the ,foo.c, file - I don't write anything into it; the
existence of the file is enough to act as the RCS lock; if I can do my

inplace modification then I delete this file after doing it, if notthen

I proceed as normal and just write the ,foo.c, file and do the rename
as you normally would.

You're forgetting something: The RCS commands will complete read-onlyoperations on RCS files even in the presence of the comma files ownedby other processes. Your update protocol introduces race conditions inwhich the RCS file is not self-consistent at all times.

There's also the interrupt issue: Killing an update before itcompletes leaves the RCS file corrupt. You'd have to build in somekind of crash recovery. But RCS already has that by way of the commafile, which can simply be deleted. Other crash recovery algorithmsusually involve transaction logs that can be reversed and replayed, orthe creation of backup copies. None of these are more efficient thanthe existing RCS update protocol.

So the issue is what happens if the interrupt
occurs as I'm overwriting the white space to add
a tag; hmm yes;


Correct. Depending on the filesystem kind and the
level of I/O, your rewrite could impact up to three
fileblocks and the directory data.

is it possible to guard against this by using a
single call to write(2) for that?


Not for all possible filesystem types.

You'd have to guarantee that the write is atomic and flushes resultscompletely to disk, even in the presence of things like power failures.It's hard to make this guarantee given all the buffering that goes onbelow the write(2) API.

Optimizing for tagging does not seem very useful
to me as we typically do not drop that many tags
on our repository.


In the company I work for we are very tag heavy, but more importantly
it is the tagging that gets in peoples way and places the strain
on the write bandwidth of the discs/RAID.

I once built a successful system that tracked desirable configurationsby building lists of file/version pairs, then committing and taggingthe lists. The lists were built by polling the Entries files inworkspaces (and making sure there were no uncommitted changes). Thiswas fast and efficient, and it opens you up to use the optimization Imentioned earlier. And if you rely on floating tags, such lists couldtrack the history of the tags as well.

In addition, an algebra can be easily written to manipulate such lists.Combine this with a way to link these lists with your defect trackingsystem, and you have the tools to build a very good change controlsystem.

--

Paul Sander | "Lets stick to the new mistakes and get rid of theold

address@hidden | ones" -- William Brown

[Prev in Thread]

Current Thread

[Next in Thread]

Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/20
- Re: Idea for reducing disk IO on tagging operations, Mark D. Baushke, 2005/03/20
  - Re: Idea for reducing disk IO on tagging operations, Paul Sander, 2005/03/20
    - Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/20
    - Re: Idea for reducing disk IO on tagging operations, Mark D. Baushke, 2005/03/21
- Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/20
  - Re: Idea for reducing disk IO on tagging operations, Mark D. Baushke, 2005/03/20
    - Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/20
    - Re: Idea for reducing disk IO on tagging operations, Paul Sander <=
    - Re: Idea for reducing disk IO on tagging operations, Mark D. Baushke, 2005/03/21
    - Re: Idea for reducing disk IO on tagging operations, Todd Denniston, 2005/03/21
    - Re: Idea for reducing disk IO on tagging operations, Mark D. Baushke, 2005/03/21
    - Re: Idea for reducing disk IO on tagging operations, Spiro Trikaliotis, 2005/03/30
- Re: Idea for reducing disk IO on tagging operations, Tony Aiuto, 2005/03/21
- Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/22
  - Re: Idea for reducing disk IO on tagging operations, Jim Hyslop, 2005/03/22
    - Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/23
    - Re: Idea for reducing disk IO on tagging operations, Dr. David Alan Gilbert, 2005/03/28
    - Re: Idea for reducing disk IO on tagging operations, Doug Lee, 2005/03/28

Prev by Date: Re: Idea for reducing disk IO on tagging operations
Next by Date: Re: Idea for reducing disk IO on tagging operations
Previous by thread: Re: Idea for reducing disk IO on tagging operations
Next by thread: Re: Idea for reducing disk IO on tagging operations
Index(es):
- Date
- Thread