Wednesday 15 April 2015

file - Directory contents diff -


I am looking for existing ideas / solutions for the problem of finding the difference between the two directories. Specifically how to identify files that have been changed, their names can be changed and moved.

A small list of the things I have considered:

  • With 75% of the matches in the content such as some estimates, with new files in DI B It does not seem sufficiently adequate (Problems with the problem include: Potential changes in content, compression or encryption, possible multi-match)
  • Use alternative data streams to add an ID to each file Will work only on NTFS only.
  • Add header / footer to each file and there is no way to guarantee adding header / footer, will not corrupt the file.
  • To determine whether the file has been actually moved or simply moved, ask user input for each change, it is very difficult on the user.
  • Using a special order requires a user to simply rename / move files, which will keep track of such changes, it is very difficult on the user.
  • Setting up a file system watchcher to capture changes to the flight

Any ideas are welcome ...

< P>

I have some kind of distributed version control system such as git

, It can recognize all files very much like Operations such as copies, moves, rename, ...


No comments:

Post a Comment