Duplicate file finder

This utility recursively searching directories for duplicate files (exact content matches). Using the --symlink option, duplicate files are replaced by a relative symlink to a matching file. Alternatively, specifying the --remove option removes duplicates.

Note: this utility is relatively untested and should be considered experimental.

Usage

Find duplicate files in a directory structure

Usage: dedup [OPTIONS] <PATHS>...

Arguments:
  <PATHS>...  Directories to search

Options:
  -m, --min-size <MIN_SIZE>    Minimum size (in bytes) of files to search [default: 0]
  -v, --verbose                Print file names and sizes of the found duplicates
  -d, --max-depth <MAX_DEPTH>  Do not search files beyond this depth. Files in the specified paths are considered depth 1.
  -s, --symlink                Replace duplicate files by symlinks
      --remove                 Remove duplicate files
  -h, --help                   Print help information

Algorithm

The tool tries to be relatively efficient, by first making an index of file sizes mapping to paths. If a second file is found with the same file size, the first 64 KiB of the files are hashed using SHA-256, and stored into a second index of files with that size. Only once a hash collision is found for two files that have identical starts, are the full contents of the files hashed and compared.

License

Licensed under the Apache 2 License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Duplicate file finder

Usage

Algorithm

License

About

Releases

Packages

Languages

License

abspoel/dedup

Folders and files

Latest commit

History

Repository files navigation

Duplicate file finder

Usage

Algorithm

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages