dedupe

Deduplicate files within a given list of directories by keeping one copy and making the rest hard-links. (by Gumnos)

Dedupe Alternatives

Similar projects and alternatives to dedupe

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better dedupe alternative or higher similarity.

dedupe reviews and mentions

Posts with mentions or reviews of dedupe. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-03-17.
  • fdupes alternatives?
    1 project | /r/commandline | 18 Jun 2022
    I wrote https://github.com/Gumnos/dedupe which sounds like it might be useful to you. It's faster than several of the alternatives I've found (many run the checksum across the whole of every file, this uses the file-size as a first-line discriminator, and only if the files are the same size does it go to the trouble of checking the checksum of the files). I designed it for creating hard-links in my media collection, but in the --dry-run mode, it should emit the file-names allowing you to pass it to xargs to remove them if it looks copacetic.
  • File Management via CLI
    7 projects | /r/commandline | 17 Mar 2022
    You can use my dedupe.py script with the dry-run flag (-n) to find all the duplicates on your drive. If you run it without the dry-run flag, it will attempt to make hard-links so that each file exists only once on the drive with multiple hard-links to the underlying file. It should be pretty fast, only needing to checksum file-content in the event that files have the same size (several other such deduplication methods work by checksumming every file on the drive which can be slow).
  • What tools / utilities have you written that you use regularly?
    42 projects | /r/commandline | 17 Sep 2021
    a file-deduplication utility that hard-links duplicate files to save space (our family photo gallery gets pics put in multiple albums for various audiences, so I can cut down on a lot of duplication with this)
  • A note from our sponsor - SaaSHub
    www.saashub.com | 25 Apr 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic dedupe repo stats
3
3
0.0
about 6 years ago

Gumnos/dedupe is an open source project licensed under BSD 2-clause "Simplified" License which is an OSI approved license.

The primary programming language of dedupe is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com