Versioning data in Postgres? Testing a Git like approach

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • simonwillisonblog-backup

    Backups of the database for simonwillison.net

  • In case it's useful to anyone, I've been running the cheapest possible implementation of Git-based revision history for my blog's PostgreSQL database for a few years now.

    I run a GitHub Actions workflow every two hours which grabs the latest snapshot of the database, writes the key tables out as newline-delimited JSON and commits them to a Git repository.

    https://github.com/simonw/simonwillisonblog-backup

    This gives me a full revision history (1,500+ commits at this point) for all of my content and I didn't have to do anything extra in my PostgreSQL or Django app to get it.

    If you need version tracking for audit purposes or to give you the ability to manually revert a mistake, and you're dealing with tens-of-thousands of rows, I think this is actually a pretty solid simple way to get that.

  • neon

    Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, branching, and bottomless storage.

  • Neon does something like this I believe. They allow you to "branch" your databases: https://neon.tech

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • aquameta

    Web development platform built entirely in PostgreSQL

  • Aquameta has an interesting extension that something like this in a different way: https://github.com/aquametalabs/aquameta/tree/master/extensi....

  • Logidze

    Database changes log for Rails

  • There's an interesting approach to it that works with Rails and PostgreSQL using triggers.

    https://github.com/palkan/logidze

  • temporal_tables

    Postgresql temporal_tables extension in PL/pgSQL, without the need for external c extension. (by nearform)

  • It was reimplemented in pure SQL here https://github.com/nearform/temporal_tables for this purpose

  • pgreplay

    pgreplay reads a PostgreSQL log file (*not* a WAL file), extracts the SQL statements and executes them in the same order and relative time against a PostgreSQL database cluster.

  • pgreplay parses not the WAL Write Ahead Log but the log file: https://github.com/laurenz/pgreplay

    From "A PostgreSQL Docker container that automatically upgrades your database" (2023) https://news.ycombinator.com/item?id=36748041 :

    pgkit wraps Postgres PITR backup and recovery:

  • pgkit

    Pgkit - Backup, PITR and recovery management made easy

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • blog

    Subscribe to this repository to receive notifications whenever a new article is published at https://vb-consulting.github.io/ (by vb-consulting)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts