InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 17 Go data-engineering Projects
-
Project mention: Data on Kubernetes: Part 4 - Argo Workflows: Simplify parallel jobs : Container-native workflow engine for Kubernetes 🔮 | dev.to | 2024-07-28
Remember to meet the prerequisites, including AWS cli, kubectl, terraform and Argo Workflow CLI.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
-
-
-
-
incubator-devlake
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
bacalhau
Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data.
Absolutely no business model behind this - just Apache2/MIT. If you like it, just use it! If you don't, happy to tweak it!
[1] https://github.com/bacalhau-project/bacalhau
[2] https://github.com/bacalhau-project/examples/tree/main/utili...
[3] https://github.com/orgs/bacalhau-project/packages/container/...
-
conduit
Conduit streams data between data stores. Kafka Connect replacement. No JVM required. (by ConduitIO)
Conduit
-
Dataplane
Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.
-
-
bulker
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL) (by jitsucom)
Project mention: Bulker: Streaming and batching large amount of data into data warehouses | news.ycombinator.com | 2025-02-14 -
-
-
-
amplify
Bacalhau Amplify: automatic enrichment, enhancement, and explanation of your data (by bacalhau-project)
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Go data-engineering discussion
Go data-engineering related posts
-
Fivetran to Acquire Census
-
Apache DevLake
-
Code Quality at Scale with AST Grep and LLMs
-
Databrew Blink: Open-Source Database CDC Tool
-
connect VS goka - a user suggested alternative
2 projects | 23 Jul 2024 -
Engineering Metrics Are Overrated
-
Go concurrency simplified. Part 1: Channels and goroutines
-
A note from our sponsor - InfluxDB
www.influxdata.com | 14 Jun 2025
Index
What are some of the best open-source data-engineering projects in Go? This list will help you:
# | Project | Stars |
---|---|---|
1 | argo | 15,713 |
2 | connect | 8,375 |
3 | cloudquery | 6,120 |
4 | lakeFS | 4,716 |
5 | Rudderstack | 4,207 |
6 | memphis | 3,303 |
7 | incubator-devlake | 2,748 |
8 | bacalhau | 806 |
9 | conduit | 528 |
10 | Dataplane | 226 |
11 | dud | 210 |
12 | bulker | 178 |
13 | rtdl | 45 |
14 | pippin | 14 |
15 | csv2opensearch | 12 |
16 | amplify | 12 |
17 | Gear5 | 3 |