dduper
duperemove
dduper | duperemove | |
---|---|---|
6 | 16 | |
162 | 662 | |
- | - | |
5.4 | 9.2 | |
6 months ago | about 1 month ago | |
Python | C | |
GNU General Public License v3.0 only | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dduper
-
NIST Retires SHA-1 Cryptographic Algorithm
In some cases deduplication happens at the file system layer transparently without you even realizing it. E.g. there are tools like https://github.com/lakshmipathi/dduper
I agree that image editing workflows are a different use case more suited to perceptual hashes than cryptographic hashes.
-
Can I view the internal hash values for files?
ddupper uses a patched btrfs command to read file hases from the raw disk. It requires root access and is kind of a hack.
- Ask HN: Who Wants to Collaborate?
-
Deduplication experiences with various tools?
There are various tools to use for COW deduplication, such as bees, duperemove, rmlint, jdupes, and dduper
-
DSM 7: Release Candidate released!
Hmm, could we use something like dduper to achieve this, if they don't have it included?
duperemove
-
fdupes: Identify or Delete Duplicate Files
Very useful for identifying files that may need to get deduplicate or that can be removed entirely. Unfortunately, I don't think this will also find identical directories.
If deleting files isn't what you want, I'd suggest looking into deduplicating tools.
ZFS has its own de duplicator built in, which is nice. It should just deduplicate files and individual extents of files by itself once you enable it. Probably not a good idea on very write-heavy disks, but it's an option.
Other file systems with extent level deduplication can use https://github.com/markfasheh/duperemove to not only deduplicaye files, but also deduplicate individual extents. This can be very useful for file systems that store a lot of duplicate content, like different WINE prefixes. For filesystems without extent deduplication, duperemove should try hard linking files to make them take up practically no disks space.
-
Bcachefs Merged into the Linux 6.7 Kernel
ZFS now has reflink support, which doesn't require lots of RAM, but isn't done automatically while writing. You need to run something like https://github.com/markfasheh/duperemove
- ZFS 2.2.0 (RC): Block Cloning merged
- Craziest thing I ever used SQLite for: partial file deduplication
-
Deduplication on EXT4
Then duperemove
-
What is a wineprefix and should I make a new one every time I add a new game to lutris?
Filesystems like Btrfs and XFS have support for deduplication, you can use a program like duperemove to save space.
-
File deduplication report?
Maybe you could use a file deduplication instead of a block based? https://github.com/markfasheh/duperemove
-
Anyone running Bees? Or deduping data some other way?
If not bees, do you run other programs for deduping? I see jdupes has support for BTRFS, https://github.com/jbruchon/jdupes, and also duperemove, https://github.com/markfasheh/duperemove.
-
Is switching to BTRFS useful for my use-cases?
It's a good filesystem, I use it with a special setup that needs a filesystem with snapshots. It's been stable for me, I run a duperemovehttps://github.com/markfasheh/duperemove) occasionally and that's about all the maintenance it needs.
-
With Proton being as good as it is now, do we still need separate prefixes for every game?
With Btrfs or XFS you can easily deduplicate the data with tools like duperemove, potentially saving a lot of space if you've installed many small games.
What are some alternatives?
bees - Best-Effort Extent-Same, a btrfs dedupe agent
dupeguru - Find duplicate files
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
r8152 - Synology DSM driver for Realtek RTL8152/RTL8153/RTL8156 based adapters
btrfs - WinBtrfs - an open-source btrfs driver for Windows
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
Deduper - The goal of this project is to make a deduper program that anybody can run on their computer to save storage space.
compsize - btrfs: find compression type/ratio on a file or set of files
Typesense - Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
bees-docker - docker container for zygo/bees