pghoard VS backy2

Compare pghoard vs backy2 and see what are their differences.

backy2

backy2: Deduplicating block based backup software for ceph/rbd, image files and devices (by wamdam)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
pghoard backy2
2 4
1,278 189
0.9% -
6.2 0.0
9 days ago 8 months ago
Python Python
Apache License 2.0 GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pghoard

Posts with mentions or reviews of pghoard. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-22.
  • Future PostgreSQL: improvement to the replication protocol
    1 project | dev.to | 13 Jan 2022
    This story starts with our own PgHoard, a PITR backup tool for PostgreSQL. PgHoard offers several methods to archive the WAL (Write Ahead Log), including pg_receivewal, a small application shipping with PostgreSQL which connects to a PostgreSQL cluster using the physical replication protocol to stream WAL as they are produced, optionally keeping track of the position on the server using a replication slot.
  • Backup PostgreSQL
    2 projects | /r/PostgreSQL | 22 Nov 2021

backy2

Posts with mentions or reviews of backy2. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-11.
  • Are small ceph clusters viable?
    3 projects | /r/ceph | 11 Jun 2023
    Overbuilt and OTT? Sure... but this works fantastically for my use case. I have current backups of everything except my media library because of the size of it; my VM's are all backed up to my Synology nightly using Backy2, my application data gets dumped to that same Synology NAS nightly as well, and all of that also gets synced to Glacier deep storage once a week using Duplicity. I'm going to be adding a new ZFS array later in the year to replace my Synology and hopefully I'll build it out with enough storage to take my media library as well.
  • PVE-based Ceph cluster build (II): Ceph storage pool build and basic performance testing
    1 project | /r/homelab | 13 Jan 2023
    It's also been fun for discovery of new things... new tools and use cases. Being able to use Cephfs is great but also being able to leverage it as native S3 buckets is awesome. Learning how to manage snapshots both in RBD images (for my VM's) and Cephfs is cool, and developing my own scripts to snapshot and replicate critical data to my Synology has been rewarding. There's also some pretty cool tools out there even without being as well supported as ZFS like backy2 for backing up RBD images... again to my Synology with a fun little script.
  • Advice on backing up a Ceph cluster
    1 project | /r/DataHoarder | 17 Aug 2021
    I've been a DataHoarder for a while, but only a modest ~10TB or so. I finally had the space to set up a rack and some servers, and am setting up a Ceph cluster with a ton of old disks I've accumulated over the years, totaling upwards of 20TB. I would like to still have an offsite and preferably offline backup for this data though, but backing up 20+ TB of data to a single drive is obviously off the table. Is there any other alternative to just deploying another Ceph cluster offsite? I don't want to use cloud storage due to the costs, and I also very much prefer to keep all my data under my own physical control. I was looking at Backy2 for the actual extraction of data and writing it to a destination, but that doesn't seem to support idempotent writes (i.e. take one full object and place it on a single drive). I could theoretically combine drives via LVM, but without additional redundancy (I would probably use raid 1 for that) losing one drive would be disastrous, and I am trying to avoid having to add additional redundancy for backups, considering the main ceph cluster will already have 3 copies of the data on it. I also am wondering if I should avoid using Ceph for the backups, since then all my eggs would be in the Ceph basket so to speak. I would love some advice from some of the folks with larger hoards and how you make backups. Thank you!
  • Backups for virtual servers on Ceph
    1 project | /r/ceph | 28 Jul 2021
    Have you tried reaching out to the dev on GitHub? I have before and was able to get some bugs ironed out. https://github.com/wamdam/backy2

What are some alternatives?

When comparing pghoard and backy2 you can also consider the following projects:

pgBackRest - Reliable PostgreSQL Backup & Restore

Back In Time - Back In Time - An easy-to-use backup tool for GNU Linux using rsync in the back

Butterfly-Backup - Butterfly Backup is a simple command line wrapper of rsync for complex task, written in python.

Barman - Barman - Backup and Recovery Manager for PostgreSQL

postgres-gcs-backup - Simple Docker image to backup a Postgres db, to a GCS bucket

RedditDownloader - Scrapes Reddit to download media of your choice.

wal-e - Continuous Archiving for Postgres

benji - Benji Backup: A block based deduplicating backup software for Ceph RBD images, iSCSI targets, image files and block devices

gmvault - gmail backup software

XGP-save-extractor - Python script to extract savefiles out of Xbox Game Pass for PC games

microceph - Ceph for a one-rack cluster and appliances