Go Cdc

Open-source Go projects categorized as Cdc

Top 9 Go Cdc Projects

  1. peerdb

    Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

    Project mention: Go channels are bad and you should feel bad (2016) | news.ycombinator.com | 2025-04-13

    I'm guilty of this too https://github.com/PeerDB-io/peerdb/blob/d36da8bb2f4f6c1c821...

    The inner channel is a poor man's future. Came up with this to have lua runtimes be able to process in parallel while maintaining ordering (A B C in, results of A B C out)

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. olake

    Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB and MySQL

    Project mention: Debezium to olake.io – PhysicsWallah switch for CDC | news.ycombinator.com | 2025-04-30

    -> Automatic schema evolution: new MongoDB fields appear as nullable columns; complex sub-docs land as JSON strings you can parse later

    -> Resumable, chunked full loads: a pod crash resumes instead of restarting

    -> Runs as either a Kubernetes CronJob or an Airflow task; config is one YAML/JSON file.

    Their stack in one line: MongoDB → OLake writer → Iceberg on S3 → Spark jobs → Trino / occasional Redshift, all orchestrated by Airflow and/or K8s.

    Posting here because many of us still bolt Kafka onto CDC just to land files. If you only need Iceberg tables, a simpler path might exist now. Curious to hear others’ experiences with broker-less CDC tools.

    (Disclaimer: I work on OLake and hosted the meetup, but the talk is purely technical.)

    Check out github repo - https://github.com/datazip-inc/olake

  4. transfer

    Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

  5. tiflow

    This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)

  6. pgcapture

    A scalable Netflix DBLog implementation for PostgreSQL

    Project mention: Pgcapture – CDC Framework for PostgreSQL in Golang | news.ycombinator.com | 2024-09-02
  7. kuvasz-streamer

    Kuvasz-Streamer is a Postgres-to-Postgres data consolidation and change data capture project.

    Project mention: Kuvasz-streamer: open-source CDC for Postgres for low latency replication | news.ycombinator.com | 2025-01-03

    One will want to be cognizant of its AGPLv3 license https://github.com/kuvasz-io/kuvasz-streamer/blob/v1.19.2/LI...

  8. go-cdc-chunkers

    A Golang package that implements CDC chunkers with a generic interface

    Project mention: Ask HN: What Are You Working On? (June 2025) | news.ycombinator.com | 2025-06-29

    Oh !

    An ISC-licensed implementation of several Content-Defined Chunking algorithms in Golang at https://github.com/PlakarKorp/go-cdc-chunkers

    Whenever you have redundant data you want to store / transfer, this library lets you perform fast content defined chunking

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. go-benchmarks

    Comprehensive and reproducible benchmarks for Go developers and architects. (by skerkour)

  11. imsy

    simple incremental pull of immutable large files

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go Cdc discussion

Log in or Post with

Go Cdc related posts

Index

What are some of the best open-source Cdc projects in Go? This list will help you:

# Project Stars
1 peerdb 2,579
2 olake 925
3 transfer 658
4 tiflow 438
5 pgcapture 269
6 kuvasz-streamer 133
7 go-cdc-chunkers 21
8 go-benchmarks 10
9 imsy 10

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com