Go Data Analysis

Open-source Go projects categorized as Data Analysis

Top 12 Go Data Analysis Projects

Data Analysis
  1. gonum

    Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. pachyderm

    Data-Centric Pipelines and Data Versioning

  4. flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

    Project mention: Boost your ML pipeline performance with efficient parallelism | dev.to | 2025-04-09

    Flyte is a distributed computation framework that uses a Kubernetes Pod as the fundamental execution environment for each task in a pipeline. When you use MapTasks, Flyte automatically distributes the load among multiple Pods that run in parallel and limits each Pod to downloading and processing only a specific index from the inputs list, preventing inefficient duplicate data movement.

  5. cloudquery

    The developer first cloud governance platform

  6. octosql

    OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

    Project mention: Feldera Incremental Compute Engine | news.ycombinator.com | 2024-09-29
  7. incubator-devlake

    Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

    Project mention: Apache DevLake | news.ycombinator.com | 2025-01-19
  8. rill

    Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.

    Project mention: The DuckDB Local UI | news.ycombinator.com | 2025-03-12

    Rill founder here, I have no comment on the UI similarity :) but I would emphasize our vision is building DuckDB-powered metrics layers and exploratory dashboards -- which we presented at DuckCon #6 last month, PDF below [1] -- and less on notebook style UIs like Hex and Jupyter.

    Rill is fully open-source under the Apache license. [2]

    [1] https://blobs.duckdb.org/events/duckcon6/mike-driscoll-rill-...

    [2] https://github.com/rilldata/rill

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. bruin

    Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

    Project mention: I built a data pipeline tool in Go | dev.to | 2024-12-23

    Bruin CLI is an end-to-end data pipeline tool that brings together data ingestion, data transformation with SQL and Python, and data quality in a single framework.

  11. bacalhau

    Community-driven, simple, yet powerful framework for fast, cost-effective distributed Compute over Data.

    Project mention: Show HN: Sample NCSA Log Generator | news.ycombinator.com | 2025-03-15

    Absolutely no business model behind this - just Apache2/MIT. If you like it, just use it! If you don't, happy to tweak it!

    [1] https://github.com/bacalhau-project/bacalhau

    [2] https://github.com/bacalhau-project/examples/tree/main/utili...

    [3] https://github.com/orgs/bacalhau-project/packages/container/...

  12. Dataplane

    Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.

  13. go-dataframe

    A simple package to abstract away the process of creating usable DataFrames for data analytics. This package is heavily inspired by the amazing Python library, Pandas.

  14. rtdl

    rtdl makes it easy to build and maintain a real-time data lake (by realtimedatalake)

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go Data Analysis discussion

Log in or Post with

Go Data Analysis related posts

  • Atuin Desktop: Runbooks That Run

    5 projects | news.ycombinator.com | 22 Apr 2025
  • Show HN: Sample NCSA Log Generator

    2 projects | news.ycombinator.com | 15 Mar 2025
  • Apache DevLake

    1 project | news.ycombinator.com | 19 Jan 2025
  • Code Quality at Scale with AST Grep and LLMs

    2 projects | news.ycombinator.com | 17 Oct 2024
  • Engineering Metrics Are Overrated

    1 project | dev.to | 3 Jul 2024
  • How to set up interface to accept multi-dimension array?

    2 projects | /r/golang | 13 Jul 2023
  • Ask HN: Who is hiring? (December 2022)

    14 projects | news.ycombinator.com | 1 Dec 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 19 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Data Analysis projects in Go? This list will help you:

# Project Stars
1 gonum 8,005
2 pachyderm 6,227
3 flyte 6,233
4 cloudquery 6,087
5 octosql 5,035
6 incubator-devlake 2,723
7 rill 2,049
8 bruin 937
9 bacalhau 806
10 Dataplane 226
11 go-dataframe 125
12 rtdl 45

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Go is
the 4th most popular programming language
based on number of references?