Go Data

Open-source Go projects categorized as Data

Top 23 Go Data Projects

  • cloudquery

    The open source high performance ELT framework powered by Apache Arrow

  • Project mention: We might want to regularly keep track of how important each server is | news.ycombinator.com | 2024-02-06

    Check out CloudQuery - https://github.com/cloudquery/cloudquery for an easy cloud asset inventory.

  • cue

    The home of the CUE language! Validate and define text-based and dynamic configuration

  • Project mention: Show HN: Workout Tracker – self-hosted, single binary web application | news.ycombinator.com | 2024-02-29

    Where `kube.cue` sets reasonable defaults (e.g. image is /). The "cluster" runs on a mini PC in my basement, and I have a small Digital Ocean VM with a static IP acting as an ingress (networking via Tailscale). Backups to cloud storage with restic, alerting/monitoring with Prometheus/Grafana, Caddy/Tailscale for local ingress.

    [1] https://www.talos.dev/

    [2] https://cuelang.org/

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

  • Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

    9. Flyte by Union AI | Github | tutorial

  • gofakeit

    Random fake data generator written in go

  • Project mention: I've made my first PR. | /r/cscareerquestions | 2023-11-03
  • memphis

    Memphis.dev is a highly scalable and effortless data streaming platform

  • Project mention: Memphis | /r/devopspro | 2023-05-11
  • Stats

    A well tested and comprehensive Golang statistics library package with no dependencies. (by montanaflynn)

  • incubator-devlake

    Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • rill

    Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code. (by rilldata)

  • Project mention: Governments on GitHub | news.ycombinator.com | 2023-06-09
  • tigris

    Tigris is an Open Source Serverless NoSQL Database and Search Platform.

  • Project mention: How to use fly.io and Tigris to deploy a Next.js app | dev.to | 2024-04-02

    You can learn more about fly.io and tigris, we will need to create an account on both platforms for this project regardless. Anyway with the theory out of the way let's get started in the next section as we create our accounts and start building the app.

  • finance-go

    :bar_chart: Financial markets data library implemented in go.

  • Project mention: finance-go: NEW Data - star count:602.0 | /r/algoprojects | 2023-05-13
  • tyson

    🥊 TypeScript as a Configuration Language. TySON stands for TypeScript Object Notation

  • Project mention: TySON: TypeScript Object Notation | news.ycombinator.com | 2024-02-04
  • aqueduct

    Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure. (by RunLLM)

  • ArtiVC

    A version control system to manage large files.

  • Dataplane

    Dataplane is a data platform that makes it easy to construct a data mesh with automated data pipelines and workflows.

  • guardian

    Guardian is universal data access management tool with automated access workflows and security controls across data stores, analytical systems, and cloud products. (by raystack)

  • pgsink

    Logically replicate data out of Postgres into sinks (files, Google BigQuery, etc)

  • steampipe-postgres-fdw

    The Steampipe foreign data wrapper (FDW) is a zero-ETL product that provides Postgres foreign tables which translate queries into API calls to cloud services and APIs. It's bundled with Steampipe and also available as a set of standalone extensions for use in your own Postgres database.

  • rtdl

    rtdl makes it easy to build and maintain a real-time data lake (by realtimedatalake)

  • steampipe-sqlite

    Steampipe SQLite is a zero-ETL engine for SQLite. Virtual tables translate queries into live API calls for cloud services and APIs. Hundreds of plugins with thousands of documented examples.

  • Project mention: Steampipe SQLite – Virtual tables translated for common APIs | news.ycombinator.com | 2023-12-20
  • go-notebook

    Go-Notebook is inspired by Jupyter Project (link) in order to document Golang code.

  • turbine-go

    Turbine Library for Go

  • pippin

    Go library to create and manage data pipelines on your machine

  • Project mention: Go concurrency simplified. Part 4: Post office as a data pipeline | dev.to | 2023-12-21

    take a look at the concurrent code written by other devs out there: for example, feel free to check the internals of my library Pippin, but I bet there are many better projects out there to learn from - Google/Bing/DuckDuckGo/Kagi and ChatGPT can help to find the right one

  • amplify

    Bacalhau Amplify: automatic enrichment, enhancement, and explanation of your data (by bacalhau-project)

  • Project mention: Jupyter Lab Extension to run your GPU-heavy stuff (for free for now) on somebody's else server without blocking yours | /r/datascience | 2023-09-22

    When using Jupyter Lab and running GPU-heavy notebooks are you annoyed that your computer is not usable for anything else? I made an extension which allows you to run complex AI inference, training,... remotely on decentralized servers [see bacalhau.org]. This allows you to work on multiple GPU-heavy notebooks in parallel. For now Bacalhau is free, so this is a really cool way to run GPU stuff.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-02.

Go Data related posts


What are some of the best open-source Data projects in Go? This list will help you:

Project Stars
1 cloudquery 5,581
2 cue 4,737
3 flyte 4,727
4 gofakeit 4,195
5 memphis 3,145
6 Stats 2,877
7 incubator-devlake 2,420
8 rill 1,338
9 tigris 885
10 finance-go 683
11 tyson 530
12 aqueduct 521
13 ArtiVC 281
14 Dataplane 182
15 guardian 134
16 pgsink 76
17 steampipe-postgres-fdw 61
18 rtdl 43
19 steampipe-sqlite 43
20 go-notebook 38
21 turbine-go 16
22 pippin 14
23 amplify 10

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives