lrzip | Duplicacy | |
---|---|---|
7 | 136 | |
595 | 5,025 | |
- | - | |
3.7 | 5.6 | |
24 days ago | about 1 month ago | |
C | Go | |
GNU General Public License v3.0 only | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lrzip
-
How to Get Your Backup to Half of Its Size – ZSTD Support in XtraBackup
lrzip
Long Range ZIP or LZMA RZIP
https://github.com/ckolivas/lrzip
"A compression utility that excels at compressing large files (usually > 10-50 MB). Larger files and/or more free RAM means that the utility will be able to more effectively compress your files (ie: faster / smaller size), especially if the filesize(s) exceed 100 MB. You can either choose to optimise for speed (fast compression / decompression) or size, but not both."
-
File compression
7zip and XZ are almost always the best in any comparison. (They use the same algorithm.) Occasionally something new comes allong that may be bettyer, but it fades away... Like lrzip. https://lkml.org/lkml/2011/6/4/23 https://github.com/ckolivas/lrzip
-
If we found a way to reverse a hashing function, would that make them ultra-compression algorithms?
For example lrzip has an intense "dupe hunting" mode and takes days for large content, but does compress very well once it's done (and expansion is fast). I use it on long term storage backups and disk images and junk. Completely incompatible with streaming, unlike chunk-based like gzip or deflate or etc, although unpacking can stream such as searching or verifying a tarfile archive. But the original source has to be file-based so seeking for the hunting can work across the entire file-as-a-block.
- Lrzip – Long Range Zip or LZMA RZIP
-
Ask HN: How would you store 10PB of data for your startup today?
Best I know of for that is something like lrzip still, but even then it's probably not state of the art. https://github.com/ckolivas/lrzip
It'll also take a hell of a long time to do the compression and decompression. It'd probably be better to do some kind of chunking and deduplication instead of compression itself simply because I don't think you're ever going to have enough ram to store any kind of dictionary that would effectively handle so much data. You'd also not want to have to re-read and reconstruct that dictionary to get at some random image too.
-
Encrypted Backup Shootout
There's also lrzip for large files: https://github.com/ckolivas/lrzip
Duplicacy
- Rclone syncs your files to cloud storage
-
Duplicity
I have been having great luck with incremental backups with the very similar named Duplicacy https://duplicacy.com/
- Restic – Simple Backups
- A new generation cross-platform cloud backup tool
-
Researching what to use for purely local Linux home server backup (no cloud backups)
Pro: No need for a special index database. The chunks are placed in the file system. This explains it in greater detail. Seems to place great emphasis on reliability, which is important for me. Versioning is also supported.
-
Your privacy is optional
Having all your data in one place isn't wise though, so I am planning on storing encrypted backups on Dropbox and Backblaze B2 using Duplicity so that I am following the 3-2-1 backup rule.
- Kopia: Open-Source, Fast and Secure Open-Source Backup Software
-
Ask HN: How do you do backups for personal/home server?
I tried a bunch of different ways but ultimately settled on Duplicacy [0].
It runs inside a Docker container and backs up both my data as well as configurations like my docker compose file and smb.conf.
Off site storage was Backblaze B2, but I moved to Hetzner. Likely will move back just because B2 is cheaper and a bit faster for my region.
Another layer of backup I do is use Duplicacy to backup to a portable hard drive occasionally that I keep off site.
[0] https://duplicacy.com/
-
Before I deploy to several computers: UrBackup, Bacula, Duplicati or Syncovery (paid)?
Duplicacy
-
Kopia VS duplicati for homeserver backups
I use Kopia and works well. Have also used this https://duplicacy.com
What are some alternatives?
bupstash - Easy and efficient encrypted backups.
restic - Fast, secure, efficient backup program
rdedup - Data deduplication engine, supporting optional compression and public key encryption.
Duplicati - Store securely encrypted backups in the cloud!
duplicity - mirror of duplicity: https://code.launchpad.net/duplicity
rclone - "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
LeoFS - The LeoFS Storage System
BorgBackup - Deduplicating archiver with compression and authenticated encryption.
kopia - Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
ParlAI - A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
borg - Search and save shell snippets without leaving your terminal