Top 23 Go llm Projects

Milvus

104 26,857 10.0 Go

A cloud-native vector database, storage for next generation AI applications

Project mention: Ask HN: Who is hiring? (April 2024) | news.ycombinator.com | 2024-04-01

Zilliz (zilliz.com) | Hybrid/ONSITE (SF, NYC) | Full-time
I am part of the hiring team for DevRel
NYC - https://boards.greenhouse.io/zilliz/jobs/4307910005
SF - https://boards.greenhouse.io/zilliz/jobs/4317590005
Zilliz is the company behind Milvus (https://github.com/milvus-io/milvus), the most starred vector database on GitHub. Milvus is a distributed vector database that shines in 1B+ vector use cases. Examples include autonomous driving, e-commerce, and drug discovery. (and, of course, RAG)
We are also hiring for other roles that I am not personally involved in the hiring process for such as product managers, software engineers, and recruiters.

plandex

11 8,740 9.7 Go

An AI coding engine for building complex, real-world software with LLMs

Project mention: Meta Llama 3 | news.ycombinator.com | 2024-04-18

I'm building Plandex (https://github.com/plandex-ai/plandex), which currently uses the OpenAI api--I'm working on support for Anthropic and OSS models right now and hoping I can ship it later today.
You can self-host it so that data is only going to the model provider (i.e. OpenAI) and nowhere else, and it gives you fine-grained control of context, so you can pick and choose exactly which files you want to load in. It's not going to pull in anything in the background that you don't want uploaded.
There's a contributor working on integration with local models and making some progress, so that will likely be an option the future as well, but for now it should at least be a pretty big improvement for you compared to the copy-paste heavy ChatGPT workflow.

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
flyte

31 4,761 9.8 Go

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

9. Flyte by Union AI | Github | tutorial

casibase

6 2,075 9.7 Go

⚡️Open-source AI LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO⚡️, supports OpenAI, Azure, LLaMA, Google Gemini, HuggingFace, Claude, Grok, etc., chat bot demo: https://demo.casibase.com, admin UI demo: https://demo-admin.casibase.com

Project mention: Open-source AI knowledge database with web UI and Enterprise SSO | news.ycombinator.com | 2023-12-21

zep

15 1,978 9.0 Go

Zep: Long-Term Memory for ‍AI Assistants.

Project mention: Zep: Fast, scalable building blocks for production LLM apps | news.ycombinator.com | 2023-09-22

llama.go

12 1,160 8.2 Go

llama.go is like llama.cpp in pure Golang!

Project mention: Understanding GPT Tokenizers | news.ycombinator.com | 2023-06-08

You might reuse simple LLaMA tokenizer right in your Go code, look there:
https://github.com/gotzmann/llama.go/blob/8cc54ca81e6bfbce25...

tlm

2 1,023 9.3 Go

Local CLI Copilot, powered by CodeLLaMa. 💻🦙 (by yusufcanb)

Project mention: What AI assistants are already bundled for Linux? | news.ycombinator.com | 2024-03-01

Perhaps this: https://github.com/yusufcanb/tlm?
it is not distro bundled (yet), but I have it running on my Fedora Linux 39 running on a NUC with 16GB of RAM. Performance is good enough for me.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
BricksLLM

5 730 9.8 Go

🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI, Azure OpenAI, Anthropic, vLLM, and open-source LLMs.

Project mention: We built a highly scalable LLM gateway with go | news.ycombinator.com | 2024-04-01

aqueduct

2 521 8.7 Go

Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure. (by RunLLM)
hof

33 475 8.9 Go

Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.

Project mention: Ask HN: Are SQL developers generally familiar with JSON, VSCode and Docker? | news.ycombinator.com | 2024-03-14

Many business analysts use SQL, have for a long time. They are probably not your target audience. With the problem being JAVA specific, you'd likely want to start there
This sounds similar to the goals of my hof tool (https://github.com/hofstadter-io/hof), lift type definitions out of code so they can be defined in one place, then generate the code for all the places. Is that sounding like what you are after?

lingoose

3 461 8.2 Go

🪿 LinGoose is a Go framework for building awesome AI/LLM applications.

Project mention: LLM in Go | /r/golang | 2023-11-10

agency

3 375 8.2 Go

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach. (by neurocult)

Project mention: Agency: Pure Go LangChain Alternative | news.ycombinator.com | 2023-11-27

I would, at the very least, wrap the errors being returned inside the process function https://github.com/neurocult/agency/blob/14b14e50a7570189388...
Or, I suppose the user must handle exception behavior in their custom `OperationHandler`

HuggingFaceModelDownloader

6 364 6.9 Go

Simple go utility to download HuggingFace Models and Datasets

Project mention: HuggingFace Is Down | news.ycombinator.com | 2024-02-28

Model downloaders of the world unite!
Here’s my PR ;,) https://github.com/bodaay/HuggingFaceModelDownloader/pull/25

galah

2 261 5.2 Go

Galah: an LLM-powered web honeypot using the OpenAI API.

Project mention: Galah: An LLM-powered web honeypot using the OpenAI API | news.ycombinator.com | 2024-02-02

evalgpt

1 242 6.7 Go

EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivering precise results for user-defined tasks. (by index-labs)

Project mention: Show HN: EvalGPT – Code interpreter and agent framework inspired by Google Borg | news.ycombinator.com | 2023-09-05

helix

1 201 9.9 Go

Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models (by helixml)

Project mention: Show HN: We got fine-tuning Mistral-7B to not suck | news.ycombinator.com | 2024-02-07

If you look at the source [1] you can see how they solved their what are the doctors going to do problem. It is literally included in one of the prompts now:
Users tend to ask broad, vague questions of the document in order to test that the system is working. We want those queries to work well. For example, a user would ask "what are the doctors going to do?" of a document that is about a junior doctors' strike. Take this into account when generating the questions - in particular, refer to noun phrases by less specific descriptions, so for example instead of "junior doctors", say "doctors" in your questions.
[1]: https://github.com/helixml/helix/blob/main/api/pkg/dataprep/...

llama2.go

1 177 8.0 Go

LLaMA-2 in native Go

Project mention: LLAMA2.go | /r/golang | 2023-07-29

aikit

2 173 9.4 Go

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Project mention: AIKit: Build and deploy LLMs easily with only Docker | news.ycombinator.com | 2023-12-12

speakeasy

7 139 9.8 Go

Speakeasy CLI - Enterprise developer experience for your API (by speakeasy-api)

Project mention: Generating Code Without Generating Technical Debt? | news.ycombinator.com | 2023-11-10

I’ve built conviction that code generation only gets useful in the long term when it is entirely deterministic, or filtered through humans. Otherwise it is almost always technical debt. Hence LLM code generation products are a cool toy, but no sensible teams will use them without an amazing “Day 2” workflow.
As an example, in my day job (https://speakeasyapi.dev), we sell code generation products using the OpenAPI specification to generate downstream artefacts (language SDKs, terraform providers, markdown documentation). The determinism makes it useful — API updates propagate continuously from server code, to specifications, then to the SDKs / providers / docs site. There are no breaking changes because the pipeline is deterministic and humans are in control of the API at the start. The code generation itself is just a means to an end : removing boilerplate effort and language differences by driving it from a source of truth (server api routes/types). Continuously generated, it is not debt.
We’ve put a lot of effort into trying to make an LLM agent useful in this context. However giving them control of generated code directly means it’s hard to keep the “no breaking changes”, and “consistency” restrictions that’s needed to make code generation useful.
The trick we’ve landed on to get utility out of an LLM in a code generation task, is to restrict it to manipulating a strictly typed interface document, such that it can only do non-breaking things to code (e.g. adjust comments / descriptions / examples) by making changes through this interface.

llama-nuts-and-bolts

1 103 8.6 Go

A holistic way of understanding how LLaMA and its components run in practice, with code and detailed documentation.

Project mention: Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run | news.ycombinator.com | 2024-03-20

glide

4 82 9.3 Go

🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps (by EinStack)

Project mention: Glide, an open fast model gateway for production-ready GenAI apps | news.ycombinator.com | 2024-02-13

infra

3 80 9.9 Go

Infrastructure powering E2B - Cloud Runtime for AI Agents (by e2b-dev)

Project mention: Show HN: Open-source SDK for creating custom code interpreters with any LLM | news.ycombinator.com | 2024-04-19

We'll have nice and easy support for self-hosting soon-ish.
In the meantime, everything is open-source and the infra is codified with Terraform. GCP should have the best support now. If you want to dig into it, we'd love to give you support along the road so we can improve the process.
Our infra repo [0] is a good place to start. Once you have E2B deployed, you can just change E2B_DOMAIN env var and use our SDK.
Feel free to email me, join our Discord, or open an issue if you have any questions
[0] https://github.com/e2b-dev/infra

chatgptui

3 76 2.2 Go

ChatGPT 🤖 with Textual User Interface (TUI) mode written in Go.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go llm related posts

Show HN: Plandex – an AI coding engine for complex tasks
9 projects | news.ycombinator.com | 3 Apr 2024
Discovering Devin, Devika, and OpenDevin
1 project | news.ycombinator.com | 2 Apr 2024
We built a highly scalable LLM gateway with go
1 project | news.ycombinator.com | 1 Apr 2024
I built an open-source tool that helps add usage-based billing for your LLM projects
1 project | dev.to | 1 Apr 2024
How well can LLMs write COBOL?
1 project | news.ycombinator.com | 31 Mar 2024
Milvus VS pgvecto.rs - a user suggested alternative
2 projects | 13 Mar 2024
HuggingFace Is Down
1 project | news.ycombinator.com | 28 Feb 2024
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source llm projects in Go? This list will help you:

	Project	Stars
1	Milvus	26,857
2	plandex	8,740
3	flyte	4,761
4	casibase	2,075
5	zep	1,978
6	llama.go	1,160
7	tlm	1,023
8	BricksLLM	730
9	aqueduct	521
10	hof	475
11	lingoose	461
12	agency	375
13	HuggingFaceModelDownloader	364
14	galah	261
15	evalgpt	242
16	helix	201
17	llama2.go	177
18	aikit	173
19	speakeasy	139
20	llama-nuts-and-bolts	103
21	glide	82
22	infra	80
23	chatgptui	76