Go llm

Open-source Go projects categorized as llm
Topics: Golang AI openai Go llama

Top 23 Go llm Projects

  • Milvus

    A cloud-native vector database, storage for next generation AI applications

  • Project mention: Ask HN: Who is hiring? (April 2024) | news.ycombinator.com | 2024-04-01

    Zilliz (zilliz.com) | Hybrid/ONSITE (SF, NYC) | Full-time

    I am part of the hiring team for DevRel

    NYC - https://boards.greenhouse.io/zilliz/jobs/4307910005

    SF - https://boards.greenhouse.io/zilliz/jobs/4317590005

    Zilliz is the company behind Milvus (https://github.com/milvus-io/milvus), the most starred vector database on GitHub. Milvus is a distributed vector database that shines in 1B+ vector use cases. Examples include autonomous driving, e-commerce, and drug discovery. (and, of course, RAG)

    We are also hiring for other roles that I am not personally involved in the hiring process for such as product managers, software engineers, and recruiters.

  • plandex

    An AI coding engine for building complex, real-world software with LLMs

  • Project mention: Meta Llama 3 | news.ycombinator.com | 2024-04-18

    I'm building Plandex (https://github.com/plandex-ai/plandex), which currently uses the OpenAI api--I'm working on support for Anthropic and OSS models right now and hoping I can ship it later today.

    You can self-host it so that data is only going to the model provider (i.e. OpenAI) and nowhere else, and it gives you fine-grained control of context, so you can pick and choose exactly which files you want to load in. It's not going to pull in anything in the background that you don't want uploaded.

    There's a contributor working on integration with local models and making some progress, so that will likely be an option the future as well, but for now it should at least be a pretty big improvement for you compared to the copy-paste heavy ChatGPT workflow.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • flyte

    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

  • Project mention: First 15 Open Source Advent projects | dev.to | 2023-12-15

    9. Flyte by Union AI | Github | tutorial

  • casibase

    ⚡️Open-source AI LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO⚡️, supports OpenAI, Azure, LLaMA, Google Gemini, HuggingFace, Claude, Grok, etc., chat bot demo: https://demo.casibase.com, admin UI demo: https://demo-admin.casibase.com

  • Project mention: Open-source AI knowledge database with web UI and Enterprise SSO | news.ycombinator.com | 2023-12-21
  • zep

    Zep: Long-Term Memory for ‍AI Assistants.

  • Project mention: Zep: Fast, scalable building blocks for production LLM apps | news.ycombinator.com | 2023-09-22
  • llama.go

    llama.go is like llama.cpp in pure Golang!

  • Project mention: Understanding GPT Tokenizers | news.ycombinator.com | 2023-06-08

    You might reuse simple LLaMA tokenizer right in your Go code, look there:

    https://github.com/gotzmann/llama.go/blob/8cc54ca81e6bfbce25...

  • tlm

    Local CLI Copilot, powered by CodeLLaMa. 💻🦙 (by yusufcanb)

  • Project mention: What AI assistants are already bundled for Linux? | news.ycombinator.com | 2024-03-01

    Perhaps this: https://github.com/yusufcanb/tlm?

    it is not distro bundled (yet), but I have it running on my Fedora Linux 39 running on a NUC with 16GB of RAM. Performance is good enough for me.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • BricksLLM

    🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI, Azure OpenAI, Anthropic, vLLM, and open-source LLMs.

  • Project mention: We built a highly scalable LLM gateway with go | news.ycombinator.com | 2024-04-01
  • aqueduct

    Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure. (by RunLLM)

  • hof

    Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.

  • Project mention: Ask HN: Are SQL developers generally familiar with JSON, VSCode and Docker? | news.ycombinator.com | 2024-03-14

    Many business analysts use SQL, have for a long time. They are probably not your target audience. With the problem being JAVA specific, you'd likely want to start there

    This sounds similar to the goals of my hof tool (https://github.com/hofstadter-io/hof), lift type definitions out of code so they can be defined in one place, then generate the code for all the places. Is that sounding like what you are after?

  • lingoose

    🪿 LinGoose is a Go framework for building awesome AI/LLM applications.

  • Project mention: LLM in Go | /r/golang | 2023-11-10
  • agency

    🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach. (by neurocult)

  • Project mention: Agency: Pure Go LangChain Alternative | news.ycombinator.com | 2023-11-27

    I would, at the very least, wrap the errors being returned inside the process function https://github.com/neurocult/agency/blob/14b14e50a7570189388...

    Or, I suppose the user must handle exception behavior in their custom `OperationHandler`

  • HuggingFaceModelDownloader

    Simple go utility to download HuggingFace Models and Datasets

  • Project mention: HuggingFace Is Down | news.ycombinator.com | 2024-02-28

    Model downloaders of the world unite!

    Here’s my PR ;,) https://github.com/bodaay/HuggingFaceModelDownloader/pull/25

  • galah

    Galah: an LLM-powered web honeypot using the OpenAI API.

  • Project mention: Galah: An LLM-powered web honeypot using the OpenAI API | news.ycombinator.com | 2024-02-02
  • evalgpt

    EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivering precise results for user-defined tasks. (by index-labs)

  • Project mention: Show HN: EvalGPT – Code interpreter and agent framework inspired by Google Borg | news.ycombinator.com | 2023-09-05
  • helix

    Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models (by helixml)

  • Project mention: Show HN: We got fine-tuning Mistral-7B to not suck | news.ycombinator.com | 2024-02-07

    If you look at the source [1] you can see how they solved their what are the doctors going to do problem. It is literally included in one of the prompts now:

    Users tend to ask broad, vague questions of the document in order to test that the system is working. We want those queries to work well. For example, a user would ask "what are the doctors going to do?" of a document that is about a junior doctors' strike. Take this into account when generating the questions - in particular, refer to noun phrases by less specific descriptions, so for example instead of "junior doctors", say "doctors" in your questions.

    [1]: https://github.com/helixml/helix/blob/main/api/pkg/dataprep/...

  • llama2.go

    LLaMA-2 in native Go

  • Project mention: LLAMA2.go | /r/golang | 2023-07-29
  • aikit

    🏗️ Fine-tune, build, and deploy open-source LLMs easily!

  • Project mention: AIKit: Build and deploy LLMs easily with only Docker | news.ycombinator.com | 2023-12-12
  • speakeasy

    Speakeasy CLI - Enterprise developer experience for your API (by speakeasy-api)

  • Project mention: Generating Code Without Generating Technical Debt? | news.ycombinator.com | 2023-11-10

    I’ve built conviction that code generation only gets useful in the long term when it is entirely deterministic, or filtered through humans. Otherwise it is almost always technical debt. Hence LLM code generation products are a cool toy, but no sensible teams will use them without an amazing “Day 2” workflow.

    As an example, in my day job (https://speakeasyapi.dev), we sell code generation products using the OpenAPI specification to generate downstream artefacts (language SDKs, terraform providers, markdown documentation). The determinism makes it useful — API updates propagate continuously from server code, to specifications, then to the SDKs / providers / docs site. There are no breaking changes because the pipeline is deterministic and humans are in control of the API at the start. The code generation itself is just a means to an end : removing boilerplate effort and language differences by driving it from a source of truth (server api routes/types). Continuously generated, it is not debt.

    We’ve put a lot of effort into trying to make an LLM agent useful in this context. However giving them control of generated code directly means it’s hard to keep the “no breaking changes”, and “consistency” restrictions that’s needed to make code generation useful.

    The trick we’ve landed on to get utility out of an LLM in a code generation task, is to restrict it to manipulating a strictly typed interface document, such that it can only do non-breaking things to code (e.g. adjust comments / descriptions / examples) by making changes through this interface.

  • llama-nuts-and-bolts

    A holistic way of understanding how LLaMA and its components run in practice, with code and detailed documentation.

  • Project mention: Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run | news.ycombinator.com | 2024-03-20
  • glide

    🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps (by EinStack)

  • Project mention: Glide, an open fast model gateway for production-ready GenAI apps | news.ycombinator.com | 2024-02-13
  • infra

    Infrastructure powering E2B - Cloud Runtime for AI Agents (by e2b-dev)

  • Project mention: Show HN: Open-source SDK for creating custom code interpreters with any LLM | news.ycombinator.com | 2024-04-19

    We'll have nice and easy support for self-hosting soon-ish.

    In the meantime, everything is open-source and the infra is codified with Terraform. GCP should have the best support now. If you want to dig into it, we'd love to give you support along the road so we can improve the process.

    Our infra repo [0] is a good place to start. Once you have E2B deployed, you can just change E2B_DOMAIN env var and use our SDK.

    Feel free to email me, join our Discord, or open an issue if you have any questions

    [0] https://github.com/e2b-dev/infra

  • chatgptui

    ChatGPT 🤖 with Textual User Interface (TUI) mode written in Go.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Go llm related posts

Index

What are some of the best open-source llm projects in Go? This list will help you:

Project Stars
1 Milvus 26,857
2 plandex 8,740
3 flyte 4,761
4 casibase 2,075
5 zep 1,978
6 llama.go 1,160
7 tlm 1,023
8 BricksLLM 730
9 aqueduct 521
10 hof 475
11 lingoose 461
12 agency 375
13 HuggingFaceModelDownloader 364
14 galah 261
15 evalgpt 242
16 helix 201
17 llama2.go 177
18 aikit 173
19 speakeasy 139
20 llama-nuts-and-bolts 103
21 glide 82
22 infra 80
23 chatgptui 76

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com