Re: The best approach for moving files and retaining history

info-cvs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: The best approach for moving files and retaining history

From:	Paul Sander
Subject:	Re: The best approach for moving files and retaining history
Date:	Tue, 30 Jun 2009 23:12:34 -0700


On Jun 30, 2009, at 4:19 PM, Todd Denniston wrote:

Arthur Barrett wrote, On 06/30/2009 04:34 PM:
Ultimately, I would like to know if my approach is good or bad.
The two techniques are neither good nor bad just different each with
it's own advantages and disadvantages.
Alternatively if you are using CVSNT (on unix/linux/win/mac) youcan use
the rename command.
Curiosity has struck me...

Plagiarizing some from Rez P
method A:
copy the files to a newly created folder, add, and commit, with acvs remove the originals and commit.
method B:
go to the server side and copy all the 'v' files, to the new folder,then cvs remove originals and commit.
I know:
Method A makes it look like the files, in the new directory, have nohistory.
        if comments are done properly cvs2cl shows the change over.
Method B has all the history, but can be a bit confusing if youcheckout an old tag.Likely there will not be a comment in the new files to show thatthey moved, so cvs2cl will be less than useful at detecting it.
what is the behavior of the CVSNT rename command?
what does it's repository markings look like in cvs2cl? (cvs2cl.plor cvs2cl.py)
If I were trying to mimic it by hand what would it most look like?

There's a bit more to the problem, because you don't necessarily wanta rename to take effect on every branch of the file. And then there'sthe problem of replicating the rename on some (but possibly not all)branches of the file. And then there the problem where you might wantthe result of a rename to originate from different locations in therepository depending on the branch. These are very, very stickyproblems.

If you go back to early discussions on this list, Brian Berlinerrecommended method B, then deleting version tags. But that predatesbranches. You should consider deleting the uninteresting branch tags,and possibly even the data for the versions on the unwanted branches.But then subsequent renames on different branches of the same file mayadd some of them back.

But this still is an incomplete solution because there may be a needto merge from branches that are used exclusively in the pre-moveorganization to the branch where the rename occurs and its newerchildren.

I believe that renaming or copying RCS files ultimately is not the wayto implement renaming a file. I believe that a versioned mapping offiles in the workspace to RCS files in the repository is necessary.This mapping is updated whenever an added, removed, or renamed file iscommitted, and the current version of the mapping is somehowrepresented in the user's workspace whenever it's updated. Thismethod has its drawbacks, particularly the evil twin condition inwhich two essentially different files occupy the same place in thesandbox filesystem at different times. Another big one is that itsimplementation requires a redesign of CVS at a fundamental level. Butit also enables possible solutions for other problems in a contextwhere renaming is done, problems that relate to merging betweenbranches where the file on each branch contains a different type ofdata.

Others in this forum have recommended creating new RCS files and usingthreaded data structures within RCS files, using specialized commentsor perhaps other metadata stored in RCS newphrase phrases, to connectthe fragments of histories to simulate renames. Features such as "cvslog" would be revised to understand the threaded structures andpresent the history appropriately, and some other operations such asmerging would have to traverse the links to locate the proper tags.This appears to be simple to implement at the outset but in practicethe number of special cases is daunting and even then there arepeculiar side-effects or limitations.

Bottom line: Don't expect a complete solution until CVS is redesignedfrom the bottom up. The various manuals suggest several methodsbesides the two mentioned in this thread, all of which solve somesubset of problems. You might find one that you can live with.

All that said, here is my recommendation using what's available today,for renaming a single file:

1. Identify all branches that will survive the rename to the newlocation, and get their owners to agree on a cutover time.2. Audit all sandboxes for uncommitted changes to branches that willsurvive the rename, and commit them.3. Apply a pre-rename version tag to the top of all branches thatsurvive the rename.4. Copy (or hard-link) the RCS file in the new location, which mightbe the Attic.

5.  Move the original RCS file to the Attic, if it's not already there.

6. For all branches surviving the rename, do the following in theoriginal RCS file: Mark them "dead", remove applicable floating tags.7. For all other branches (old maintenance or frozen branches), dothe following in the new RCS file: Mark them "dead", removeapplicable floating tags.


This method has the following problems:

- Branch owners must agree on a cutover time.
- The sandbox audit is a big pain.
- Dead branches can be revived in the wrong locations.
- Undoing it is a pain.
- Reversing it is a royal pain, and it leaves side-effects.

- It doesn't support renaming file foo to bar on branch A, and filebaz to bar on branch B, at least not without much care and not withoutintroducing annoying side-effects.- You lose the completeness of version history in both locations ifdevelopment continues on the old maintenance branches.- Merging from maintenance branches to new development requirescomputing and applying patches.

- Renaming the file repeatedly increases cruft and fragmentation.

- It's very time-consuming when done to every file in a large tree(i.e. when renaming a directory).

- It's not represented well in "cvs history".

- Operations involving tags and timestamps are best avoided while therename procedure is running.- Non-floating version tags remain on both branches after the rename.You want them for diff, merge, and log but not for checkout and update.

Using this method, it's possible to migrate branches individually, butextra care is required to verify that two existing RCS files arereally related in this way. Also, individual versions must be copied,with all of their metadata, from one RCS file to the other onapplicable branches. This is a major pain.


Good luck!

[Prev in Thread]

Current Thread

[Next in Thread]

Re: The best approach for moving files and retaining history, Paul Sander <=
- RE: The best approach for moving files and retaining history, Arthur Barrett, 2009/07/01

Next by Date: RE: The best approach for moving files and retaining history
Next by thread: RE: The best approach for moving files and retaining history
Index(es):
- Date
- Thread