Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR. Learn more β
Top 23 Go Data Projects
-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Project mention: Boost your ML pipeline performance with efficient parallelism | dev.to | 2025-04-09Flyte is a distributed computation framework that uses a Kubernetes Pod as the fundamental execution environment for each task in a pipeline. When you use MapTasks, Flyte automatically distributes the load among multiple Pods that run in parallel and limits each Pod to downloading and processing only a specific index from the inputs list, preventing inefficient duplicate data movement.
-
InfluxDB
InfluxDB high-performance time series database. Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
-
-
-
-
-
Stats
A well tested and comprehensive Golang statistics library package with no dependencies. (by montanaflynn)
-
incubator-devlake
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Rill founder here, I have no comment on the UI similarity :) but I would emphasize our vision is building DuckDB-powered metrics layers and exploratory dashboards -- which we presented at DuckCon #6 last month, PDF below [1] -- and less on notebook style UIs like Hex and Jupyter.
Rill is fully open-source under the Apache license. [2]
[1] https://blobs.duckdb.org/events/duckcon6/mike-driscoll-rill-...
[2] https://github.com/rilldata/rill
-
I should mention Tigris[0] here. They're also a new Object Storage service, but they have this two-way replication facility with another S3-compatible service. The primary purpose they built it for is to mirror files from your existing S3 to Tigris as files are requested.
However they also have an option to copy files that are added to Tigris, to S3 automatically [1] (`--shadow-write-through`). I asked their founder if it's okay to use it as an extra redundancy continuously instead of a one-time migration, and they said they have no issues with it.
[0] https://www.tigrisdata.com
-
Project mention: Kuvasz-streamer: open-source CDC for Postgres for low latency replication | news.ycombinator.com | 2025-01-03
* pg_flo: https://github.com/pgflo/pg_flo
Are there others? Each of them has slightly different angles and messaging, but it is interesting to see.
-
-
-
aqueduct
Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure. (by RunLLM)
-
-
-
Dataplane
Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.
-
guardian
Guardian is universal data access management tool with automated access workflows and security controls across data stores, analytical systems, and cloud products. (by raystack)
-
-
steampipe-postgres-fdw
The Steampipe foreign data wrapper (FDW) is a zero-ETL product that provides Postgres foreign tables which translate queries into API calls to cloud services and APIs. It's bundled with Steampipe and also available as a set of standalone extensions for use in your own Postgres database.
-
steampipe-sqlite
Steampipe SQLite is a zero-ETL engine for SQLite. Virtual tables translate queries into live API calls for cloud services and APIs. Hundreds of plugins with thousands of documented examples.
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Go Data discussion
Go Data related posts
-
cue VS rcl - a user suggested alternative
2 projects | 15 Mar 2025 -
Apache DevLake
-
Show HN: Holos β Configure Kubernetes with CUE data structures instead of YAML
-
Ask HN: Happy Thanksgiving What technology are you thankful for?
-
Stream, transform, and route PostgreSQL data in real-time
-
Stream, transform, and route PostgreSQL data in real-time (early build)
-
Pg_flo: Move and transform data between PostgreSQL databases
-
A note from our sponsor - CodeRabbit
coderabbit.ai | 19 Apr 2025
Index
What are some of the best open-source Data projects in Go? This list will help you:
# | Project | Stars |
---|---|---|
1 | flyte | 6,182 |
2 | cloudquery | 6,076 |
3 | cue | 5,399 |
4 | gofakeit | 4,873 |
5 | memphis | 3,294 |
6 | Stats | 2,966 |
7 | incubator-devlake | 2,703 |
8 | rill | 2,010 |
9 | tigris | 934 |
10 | pg_flo | 773 |
11 | finance-go | 735 |
12 | tyson | 553 |
13 | aqueduct | 520 |
14 | webpalm | 366 |
15 | ArtiVC | 298 |
16 | Dataplane | 226 |
17 | guardian | 136 |
18 | pgsink | 89 |
19 | steampipe-postgres-fdw | 78 |
20 | steampipe-sqlite | 55 |
21 | rtdl | 45 |
22 | go-notebook | 38 |
23 | pippin | 14 |