Go Data Science

Open-source Go projects categorized as Data Science | Edit details

Top 13 Go Data Science Projects

  • GitHub repo excelize

    Golang library for reading and writing Microsoft Excel™ (XLSX) files.

    Project mention: Excelize 2.5.0 is Released – Go language API for spreadsheet (Excel) document | reddit.com/r/golang | 2022-01-02

    Documentation website with multilingual: Arabic, German, Spanish, English, French, Russian, Chinese, Japanese, and Korean, which has been updated

  • GitHub repo gop

    GoPlus - The Go+ language for engineering, STEM education, and data science

    Project mention: Hacking Go compiler to add a new keyword | news.ycombinator.com | 2021-12-08

    I wonder if the author has heard of https://github.com/goplus/gop

    He'd have fun reverse engineering it.

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • GitHub repo pachyderm

    Reproducible Data Science at Scale!

    Project mention: Dud: a tool for versioning data alongside source code, written in Go | reddit.com/r/golang | 2021-06-21
  • GitHub repo gophernotes

    The Go kernel for Jupyter notebooks and nteract.

    Project mention: Is there a program or plugin in that's similar to jupyter notebooks or google collab for Go lang? | reddit.com/r/golang | 2021-12-16
  • GitHub repo lgo

    Interactive Go programming with Jupyter

  • GitHub repo reflow

    A language and runtime for distributed, incremental data processing in the cloud

    Project mention: reflow - A language and runtime for distributed, incremental data processing in the cloud | reddit.com/r/DistributedComputing | 2021-05-20
  • GitHub repo dataframe-go

    DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

  • GitHub repo decimal

    A high-performance, arbitrary-precision, floating-point decimal library. (by ericlagergren)

    Project mention: Companies that use server-side Kotlin | reddit.com/r/Kotlin | 2021-12-27
  • GitHub repo qframe

    Immutable data frame for Go

    Project mention: Roapi: An API Server for Static Datasets | news.ycombinator.com | 2021-10-08


    Yes, in the abstract sense, which I guess you mean. QFrame (https://github.com/tobgu/qframe), the underlying dataframe used, is column oriented.

  • GitHub repo goro

    A High-level Machine Learning Library for Go

    Project mention: I'm looking for a Go computer vision package that isn't GoCV. | reddit.com/r/golang | 2021-10-16

    I've been meaning to try GoML and Goro, the latter being based on Gorgonia. No idea how relevant either are to your needs.

  • GitHub repo dud

    A lightweight CLI tool for versioning data alongside source code and building data pipelines.

    Project mention: Git-annex – Managing large files with Git | news.ycombinator.com | 2022-01-15

    Thanks for sharing your experience. It's non-trivial and surprising behavior like this that drove me to build a custom system[0] myself. When I started researching version control tools for large files, I remember feeling like git-annex and Git LFS were awkwardly bolted onto Git; Git simply wasn't designed for large files. Then I found DVC[1], and its approach rang true for me. However, after using DVC for a year or so, I grew tired of DVC's many puzzling behaviors (most of which are outlined in the README at [0]). In the end, I built the tool I wanted for the job -- one that is exceptionally simple and fast.

    [0]: https://github.com/kevin-hanselman/dud

  • GitHub repo beneath

    Beneath is a serverless real-time data platform ⚡️

    Project mention: Analyzing the r/wallstreetbets hivemind — August 2021 | dev.to | 2021-09-08

    If you’re interested, here’s the raw Reddit data, my data pipeline, the derived data, and my Jupyter notebook. I’m using Beneath, an open data platform I’m building, to stream and save the data.

  • GitHub repo mab

    Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

    Project mention: Can you share some Go package that you think has high quality clean code? | reddit.com/r/golang | 2021-04-13

    MAB library for Multi-Armed Bandits

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-01-15.

Go Data Science related posts


What are some of the best open-source Data Science projects in Go? This list will help you:

Project Stars
1 excelize 10,676
2 gop 7,896
3 pachyderm 5,326
4 gophernotes 3,139
5 lgo 2,222
6 reflow 888
7 dataframe-go 730
8 decimal 397
9 qframe 321
10 goro 292
11 dud 83
12 beneath 63
13 mab 27
Find remote jobs at our new job board 99remotejobs.com. There are 29 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
OPS - Build and Run Open Source Unikernels
Quickly and easily build and deploy open source unikernels in tens of seconds. Deploy in any language to any cloud.