[Monotone-devel] Re: Support for binary files, scalability andWindows po

monotone-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Monotone-devel] Re: Support for binary files, scalability andWindows po

From:	graydon hoare
Subject:	[Monotone-devel] Re: Support for binary files, scalability andWindows port
Date:	Wed, 21 Jan 2004 15:44:41 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.6b) Gecko/20031205 Thunderbird/0.4

Asger Kunuk Ottar Alstrup wrote:

Yes, that is of course perceivable, but that would defeat the advantage
of a truly distributed system, where a user does not have to be online
to work. I'd prefer another approach. Maybe we will set up separate
databases for different purposes - we can divide the work into sealed
compartments.

yes, I'd recommend the approach -- if possible -- of creating multiple"collections" of files. I'll briefly describe what I have in mind withthe hashtrees:

each database will have a set of "collections", each of which has ahashtree associated with it. collections can share members -- they'reworking on the same underlying store of blocks and keys and whatnot --but the hashtree picks out a subset and names it. each collection willalso map to a set of peers which the database syncs that collection withnormally (eg. when you just type "monotone sync <collection>"). one ofthe states in a hashtree is called "tombstone", which marks a block youdo not have and do not want, but hashes to the same value as if you hadthe block. this makes it possible to expire blocks from your copy of acollection. of course if you un-expire the block (set from tombstone ->empty) you will fetch it on the next sync.

in answer to the previous question about distribution costs: obviouslyyou have to send, at least once, every block you expect to exist at theother end of a connection. the hashtree is an auxiliary structure usedto discover which blocks are missing from either end of a connection. asof 2 nights ago, I have a prototype working which synchronizes blockcollections in a pair of sqlite databases, over a TCP connection. itshows pretty good promise; the interactive protocol isn't *quite*pipelined enough, plus the encodings could use some tuning; it's atleast twice as bulky as it needs to be. even now it finds and sends the50 missing random 512-byte blocks amongst a collection of 10,000 such,with only 16k written and 130k received (including the 34.7k worth ofbase64-encoded data blocks). it will add a little overhead in the caseof small collections, but the cost curve is a very flat logarithm of thecollection size.

I'm not certain a 256-ary tree is the best fan-out though; I only choseit because it's easy to prototype in ASCII. a 16-ary tree is equallyeasy, so maybe I'll run through that too.. it is a subtle issue: largernodes mean fewer round trips and less protocol chatter, but also moreretransmission of hash values for unchanged portions of the tree and abit more asymmetry in the transmit/receive load (as in this instance).I'll see if it's sufficiently easy to make the whole protocol andhashtree calculator depend on a couple template constants, and try towrite it that way so we can wiggle it around and find a sweet spot.

OK. I tried with VS.NET, but did not have time to complete it. It seems
there are a couple of places where the code uses some non-standard C++
features not supported by VS.NET. Also, it uses a bunch of Unix-only
#includes.

But AFAIK it's nothing that can not be handled with a few days of work -
it should also be possible to use VS6.

I think you might find some of the fancy-pants stuff spirit does withtemplates will make VS6 upset. I think that compiler series really onlyapproximates the ISO standard around version 7 (around the ".net" edition)


-graydon

[Prev in Thread]

Current Thread

[Next in Thread]

[Monotone-devel] RE: Support for binary files, scalability and Windows port, (continued)

Prev by Date: RE: [Monotone-devel] RE: Support for binary files, scalability andWindows port
Next by Date: [Monotone-devel] BitTorrent
Previous by thread: RE: [Monotone-devel] RE: Support for binary files, scalability andWindows port
Next by thread: Re: [Monotone-devel] Re: Support for binary files, scalability andWindows port
Index(es):
- Date
- Thread