InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 12 Go Data Analysis Projects
-
gonum
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Project mention: Boost your ML pipeline performance with efficient parallelism | dev.to | 2025-04-09Flyte is a distributed computation framework that uses a Kubernetes Pod as the fundamental execution environment for each task in a pipeline. When you use MapTasks, Flyte automatically distributes the load among multiple Pods that run in parallel and limits each Pod to downloading and processing only a specific index from the inputs list, preventing inefficient duplicate data movement.
-
-
octosql
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
-
incubator-devlake
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
-
rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
Rill founder here, I have no comment on the UI similarity :) but I would emphasize our vision is building DuckDB-powered metrics layers and exploratory dashboards -- which we presented at DuckCon #6 last month, PDF below [1] -- and less on notebook style UIs like Hex and Jupyter.
Rill is fully open-source under the Apache license. [2]
[1] https://blobs.duckdb.org/events/duckcon6/mike-driscoll-rill-...
[2] https://github.com/rilldata/rill
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Bruin CLI is an end-to-end data pipeline tool that brings together data ingestion, data transformation with SQL and Python, and data quality in a single framework.
-
bacalhau
Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data.
Absolutely no business model behind this - just Apache2/MIT. If you like it, just use it! If you don't, happy to tweak it!
[1] https://github.com/bacalhau-project/bacalhau
[2] https://github.com/bacalhau-project/examples/tree/main/utili...
[3] https://github.com/orgs/bacalhau-project/packages/container/...
-
Dataplane
Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.
-
go-dataframe
A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.
-
Go Data Analysis discussion
Go Data Analysis related posts
-
Atuin Desktop: Runbooks That Run
-
Show HN: Sample NCSA Log Generator
-
Apache DevLake
-
Code Quality at Scale with AST Grep and LLMs
-
Engineering Metrics Are Overrated
-
How to set up interface to accept multi-dimension array?
-
Ask HN: Who is hiring? (December 2022)
-
A note from our sponsor - InfluxDB
www.influxdata.com | 19 May 2025
Index
What are some of the best open-source Data Analysis projects in Go? This list will help you:
# | Project | Stars |
---|---|---|
1 | gonum | 8,005 |
2 | pachyderm | 6,227 |
3 | flyte | 6,233 |
4 | cloudquery | 6,087 |
5 | octosql | 5,035 |
6 | incubator-devlake | 2,723 |
7 | rill | 2,049 |
8 | bruin | 937 |
9 | bacalhau | 806 |
10 | Dataplane | 226 |
11 | go-dataframe | 125 |
12 | rtdl | 45 |