Go LLM

Open-source Go projects categorized as LLM

Large Language Models

Top 23 Go LLM Projects

  1. ollama

    Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

    Project mention: No More API Bills: The Economics of Running LLMs Locally on Your Mac with ServBay | dev.to | 2025-05-22

    While tools like Ollama are fantastic, managing different services and ensuring everything plays nicely together in your development environment can still involve some setup. This is where ServBay steps in to make local AI development on macOS not just cost-effective, but also incredibly convenient.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. Milvus

    Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

    Project mention: Vector Database Showdown: Architectural Insights for AI Developers | dev.to | 2025-05-19

    A Technical Evaluation of Milvus and Chroma DB

  4. LocalAI

    :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

    Project mention: Nvidia on NixOS WSL – Ollama up 24/7 on your gaming PC | news.ycombinator.com | 2025-04-10

    If you're going to run Ollama in Windows anyway, why not use the native build? And if you want to use WSL, then I'd sugggest using something like LocalAI which gives you a lot more control and support for additional formats (GGML, GGUF, GPTQ, ONNX, etc).

    https://github.com/mudler/LocalAI

  5. gin-vue-admin

    🚀Vite+Vue3+Gin拥有AI辅助的基础开发平台,支持TS和JS混用。它集成了JWT鉴权、权限管理、动态路由、显隐可控组件、分页封装、多点登录拦截、资源权限、上传下载、代码生成器、表单生成器和可配置的导入导出等开发必备功能。

  6. kubesphere

    The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

    Project mention: KubeSphere v4.1.3 Open Source Edition Released | dev.to | 2025-03-27

    Submit Issues: GitHub Issues

  7. plandex

    Open source AI coding agent. Designed for large projects and real world tasks.

    Project mention: GitHub Copilot Coding Agent | news.ycombinator.com | 2025-05-19

    My tool Plandex[1] allows you to switch between automatic and manual context management. It can be useful to begin a task with automatic context loading while scoping it out and making the high level plan, then switch to the more 'aider-style' manual context once the relevant files are clearly established.

    1 - https://github.com/plandex-ai/plandex

    Also, a bit more on auto vs. manual context management in the docs: https://docs.plandex.ai/core-concepts/context-management

  8. flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

    Project mention: Boost your ML pipeline performance with efficient parallelism | dev.to | 2025-04-09

    Flyte is a distributed computation framework that uses a Kubernetes Pod as the fundamental execution environment for each task in a pipeline. When you use MapTasks, Flyte automatically distributes the load among multiple Pods that run in parallel and limits each Pod to downloading and processing only a specific index from the inputs list, preventing inefficient duplicate data movement.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. LLocalSearch

    LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.

  11. casibase

    ⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com

  12. deepflow

    eBPF Observability - Distributed Tracing and Profiling

    Project mention: Sentry Alternatives for Better Error Tracking | dev.to | 2024-09-24

    Deepflow is a technological tool that specializes in providing advanced analytics and insights for businesses. It features a robust platform for data analysis, visualization, and decision-making. Deepflow aims to assist organizations in effectively utilizing their data to make strategic decisions and enhance performance.

  13. algernon

    Small self-contained pure-Go web server with Lua, Teal, Markdown, Ollama, HTTP/2, QUIC, Redis, SQLite and PostgreSQL support ++

    Project mention: Ask HN: What Are You Working On? (February 2025) | news.ycombinator.com | 2025-02-23

    A self-contained little web server that supports Lua, HTTP/3, PostgreSQL and all sorts of different things:

    https://github.com/xyproto/algernon

    A snappy and configuration-free little editor/IDE for the terminal:

    https://github.com/xyproto/orbiton

    Also a game, a book and an album, but those are unreleased long-term projects.

  14. opencode

    Project mention: Claude Code SDK – Anthropic | news.ycombinator.com | 2025-05-19

    You might be interested in the OpenCode project: https://github.com/opencode-ai/opencode

    It's still under development but looks promising.

  15. tlm

    Local CLI Copilot, powered by Ollama. 💻🦙 (by yusufcanb)

  16. llama.go

    llama.go is like llama.cpp in pure Golang!

  17. kubectl-ai

    ✨ Kubectl plugin to create manifests with LLMs

    Project mention: Kubernetes and AI: 3 Open Source Tools Powered by OpenAI | dev.to | 2024-05-25

    yum install wget wget https://github.com/sozercan/kubectl-ai/releases/download/v0.0.10/kubectl-ai_linux_amd64.tar.gz tar xvf kubectl-ai_linux_amd64.tar.gz mv kubectl-ai /usr/local/bin/kubectl-ai

  18. gollama

    Go manage your Ollama models

    Project mention: Show HN: GoManageDocker – TUI tool to manage your Docker objects | news.ycombinator.com | 2024-07-17
  19. beelzebub

    A secure low code honeypot framework, leveraging LLM for System Virtualization.

    Project mention: Securing Kubernetes Using Honeypots to Detect and Prevent Lateral Movement Attacks | dev.to | 2025-04-04

    Clone Beelzebub repository: $ git clone https://github.com/mariocandela/beelzebub.git

  20. BricksLLM

    🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI, Azure OpenAI, Anthropic, vLLM, and open-source LLMs.

  21. kubeai

    AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

    Project mention: Implémenter le Retrieval Augmented Generation (RAG) de manière privée sur Kubernetes avec KubeAI et… | dev.to | 2024-12-30

    KubeAI

  22. beta9

    Run serverless GPU workloads with fast cold starts on bare-metal servers, anywhere in the world

    Project mention: Show HN: Run GPU Workloads Across Multiple Clouds | news.ycombinator.com | 2024-09-17
  23. lingoose

    🪿 LinGoose is a Go framework for building awesome AI/LLM applications.

  24. HuggingFaceModelDownloader

    Simple go utility to download HuggingFace Models and Datasets

  25. aperture

    Rate limiting, caching, and request prioritization for modern workloads

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go LLM discussion

Log in or Post with

Go LLM related posts

  • No More API Bills: The Economics of Running LLMs Locally on Your Mac with ServBay

    1 project | dev.to | 22 May 2025
  • De documentación y capturas al .feature: IA local con Llama 3.2 + Llava

    1 project | dev.to | 19 May 2025
  • The most powerful MCP Slack Server with no permission requirements on Workspaces

    1 project | news.ycombinator.com | 18 May 2025
  • Deploying LLMs on Amazon EKS using NVIDIA GPUs

    3 projects | dev.to | 16 May 2025
  • Flask API com DeepSeek-R1 via Ollama with Python

    2 projects | dev.to | 13 May 2025
  • Build an MCP Client in Minutes: Local AI Agents Just Got Real

    3 projects | dev.to | 9 May 2025
  • Understanding MCP Servers: The Model Context Protocol Explained

    2 projects | dev.to | 9 May 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 23 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source LLM projects in Go? This list will help you:

# Project Stars
1 ollama 141,141
2 Milvus 34,846
3 LocalAI 32,732
4 gin-vue-admin 22,850
5 kubesphere 15,950
6 plandex 13,489
7 flyte 6,245
8 LLocalSearch 5,913
9 casibase 3,627
10 deepflow 3,289
11 algernon 2,906
12 opencode 2,556
13 tlm 1,422
14 llama.go 1,356
15 kubectl-ai 1,173
16 gollama 1,086
17 beelzebub 1,082
18 BricksLLM 1,049
19 kubeai 951
20 beta9 782
21 lingoose 777
22 HuggingFaceModelDownloader 668
23 aperture 656

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Go is
the 4th most popular programming language
based on number of references?