I am looking for existing ideas / solutions for the problem of finding the difference between the two directories. Specifically how to identify files that have been changed, their names can be changed and moved.
A small list of the things I have considered:
- With 75% of the matches in the content such as some estimates, with new files in DI B It does not seem sufficiently adequate (Problems with the problem include: Potential changes in content, compression or encryption, possible multi-match)
- Use alternative data streams to add an ID to each file Will work only on NTFS only.
- Add header / footer to each file and there is no way to guarantee adding header / footer, will not corrupt the file.
- To determine whether the file has been actually moved or simply moved, ask user input for each change, it is very difficult on the user.
- Using a special order requires a user to simply rename / move files, which will keep track of such changes, it is very difficult on the user.
- Setting up a file system watchcher to capture changes to the flight
Any ideas are welcome ...
I have some kind of distributed version control system such as git
, It can recognize all files very much like Operations such as copies, moves, rename, ...
No comments:
Post a Comment