[rdiff-backup-users] more experiments, + apology

rdiff-backup-users

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[rdiff-backup-users] more experiments, + apology

From:	Marcel (Felix) Giannelia
Subject:	[rdiff-backup-users] more experiments, + apology
Date:	Sat, 14 Mar 2009 14:53:51 -0700
User-agent:	Thunderbird 2.0.0.16 (X11/20080726)

Hello again,

First of all, I just re-read the changelog and checked which version ofrdiff-backup was installed on the server I've been playing with -- and Iowe the developers an apology. The current version does diffmirror_metadata files (and has done for quite a while), so that I'veindirectly done the same thing in my "rdiff-backup-rollup" experimentsis no great achievement. It turns out that our server has (yikes)version 1.0.4 -- a consequence of Gentoo Linux's package repositoriesbeing dreadfully far behind, and my unfamiliarity with Gentoo.

Regardless, parts of what I said are still true (very small patchesbetween increments), but it's looking now more like something'sconfusing the rsync algorithm itself, rather than rdiff-backup. Thereare files in the backup set that seem to change daily (zipped backupsthat Moodle produces), but only by a few bytes. Most of the file staysexactly the same, but trying to use rdiff by itself on any pair of themcauses a patch that's just as big as the file. Somehow, putting severalof them together in a tar file (even gzipped) clues rdiff/rsync in tothe similarities, and then it can make a decent patch.

Trying my procedure on a different machine's backup sets still makespatches that are smaller than the increment files, but by a much lesseramount that can be totally explained by the mirror_metadata andfile_statistics files. I'm a little disappointed that it's allexplicable, but nonetheless, space savings are fun :) For instance,this second machine's backup set includes rotated log files, so I wasstill able to compress its archive of old increments by 76% usingrdiff-backup-rollup followed by rdiff'ing all but a few files (I kept awhole file every 15 increments or so as a basis). (I cheated a bit toget that much compression -- I wrote another script that removesfile_statistics files from the increments entirely [they look optional],then gunzips all the individually gzipped files in the increment andre-tars it. That makes the tar file much bigger, but rdiff has an easiertime with the uncompressed data. After I've generated increments andre-compressed everything, it's smaller than it was originally.)

Now that rdiff-backup increments mirror_metadata, though, theseexperiments are of limited interest except in dealing with old backupsets retroactively (and for the side-effect of file move detection).

Is this of interest to anybody? If not, I'll relegate myself to a quietcorner of the wiki and stop cluttering up the mailing list with it :)


~Felix.

[Prev in Thread]

Current Thread

[Next in Thread]

[rdiff-backup-users] more experiments, + apology, Marcel (Felix) Giannelia <=
- Re: [rdiff-backup-users] more experiments, + apology, David Kempe, 2009/03/14

Prev by Date: Re: [rdiff-backup-users] centos 5 rpm's for 1.2.7
Next by Date: Re: [rdiff-backup-users] centos 5 rpm's for 1.2.7
Previous by thread: [rdiff-backup-users] centos 5 rpm's for 1.2.7
Next by thread: Re: [rdiff-backup-users] more experiments, + apology
Index(es):
- Date
- Thread