The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Go llm Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
flyte
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
-
casibase
⚡️Open-source AI LangChain-like RAG (Retrieval-Augmented Generation) knowledge database with web UI and Enterprise SSO⚡️, supports OpenAI, Azure, LLaMA, Google Gemini, HuggingFace, Claude, Grok, etc., chat bot demo: https://demo.casibase.com, admin UI demo: https://demo-admin.casibase.com
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
BricksLLM
🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports OpenAI, Azure OpenAI, Anthropic, vLLM, and open-source LLMs.
-
aqueduct
Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure. (by RunLLM)
-
hof
Framework that joins data models, schemas, code generation, and a task engine. Language and technology agnostic.
-
agency
🕵️♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach. (by neurocult)
-
evalgpt
EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivering precise results for user-defined tasks. (by index-labs)
-
helix
Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source models (by helixml)
-
llama-nuts-and-bolts
A holistic way of understanding how LLaMA and its components run in practice, with code and detailed documentation.
-
glide
🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps (by EinStack)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Zilliz (zilliz.com) | Hybrid/ONSITE (SF, NYC) | Full-time
I am part of the hiring team for DevRel
NYC - https://boards.greenhouse.io/zilliz/jobs/4307910005
SF - https://boards.greenhouse.io/zilliz/jobs/4317590005
Zilliz is the company behind Milvus (https://github.com/milvus-io/milvus), the most starred vector database on GitHub. Milvus is a distributed vector database that shines in 1B+ vector use cases. Examples include autonomous driving, e-commerce, and drug discovery. (and, of course, RAG)
We are also hiring for other roles that I am not personally involved in the hiring process for such as product managers, software engineers, and recruiters.
I'm building Plandex (https://github.com/plandex-ai/plandex), which currently uses the OpenAI api--I'm working on support for Anthropic and OSS models right now and hoping I can ship it later today.
You can self-host it so that data is only going to the model provider (i.e. OpenAI) and nowhere else, and it gives you fine-grained control of context, so you can pick and choose exactly which files you want to load in. It's not going to pull in anything in the background that you don't want uploaded.
There's a contributor working on integration with local models and making some progress, so that will likely be an option the future as well, but for now it should at least be a pretty big improvement for you compared to the copy-paste heavy ChatGPT workflow.
9. Flyte by Union AI | Github | tutorial
Project mention: Open-source AI knowledge database with web UI and Enterprise SSO | news.ycombinator.com | 2023-12-21
Project mention: Zep: Fast, scalable building blocks for production LLM apps | news.ycombinator.com | 2023-09-22
You might reuse simple LLaMA tokenizer right in your Go code, look there:
https://github.com/gotzmann/llama.go/blob/8cc54ca81e6bfbce25...
Project mention: What AI assistants are already bundled for Linux? | news.ycombinator.com | 2024-03-01Perhaps this: https://github.com/yusufcanb/tlm?
it is not distro bundled (yet), but I have it running on my Fedora Linux 39 running on a NUC with 16GB of RAM. Performance is good enough for me.
Project mention: Ask HN: Are SQL developers generally familiar with JSON, VSCode and Docker? | news.ycombinator.com | 2024-03-14Many business analysts use SQL, have for a long time. They are probably not your target audience. With the problem being JAVA specific, you'd likely want to start there
This sounds similar to the goals of my hof tool (https://github.com/hofstadter-io/hof), lift type definitions out of code so they can be defined in one place, then generate the code for all the places. Is that sounding like what you are after?
I would, at the very least, wrap the errors being returned inside the process function https://github.com/neurocult/agency/blob/14b14e50a7570189388...
Or, I suppose the user must handle exception behavior in their custom `OperationHandler`
Model downloaders of the world unite!
Here’s my PR ;,) https://github.com/bodaay/HuggingFaceModelDownloader/pull/25
Project mention: Galah: An LLM-powered web honeypot using the OpenAI API | news.ycombinator.com | 2024-02-02
Project mention: Show HN: EvalGPT – Code interpreter and agent framework inspired by Google Borg | news.ycombinator.com | 2023-09-05
Project mention: Show HN: We got fine-tuning Mistral-7B to not suck | news.ycombinator.com | 2024-02-07If you look at the source [1] you can see how they solved their what are the doctors going to do problem. It is literally included in one of the prompts now:
Users tend to ask broad, vague questions of the document in order to test that the system is working. We want those queries to work well. For example, a user would ask "what are the doctors going to do?" of a document that is about a junior doctors' strike. Take this into account when generating the questions - in particular, refer to noun phrases by less specific descriptions, so for example instead of "junior doctors", say "doctors" in your questions.
[1]: https://github.com/helixml/helix/blob/main/api/pkg/dataprep/...
Project mention: AIKit: Build and deploy LLMs easily with only Docker | news.ycombinator.com | 2023-12-12
Project mention: Generating Code Without Generating Technical Debt? | news.ycombinator.com | 2023-11-10I’ve built conviction that code generation only gets useful in the long term when it is entirely deterministic, or filtered through humans. Otherwise it is almost always technical debt. Hence LLM code generation products are a cool toy, but no sensible teams will use them without an amazing “Day 2” workflow.
As an example, in my day job (https://speakeasyapi.dev), we sell code generation products using the OpenAPI specification to generate downstream artefacts (language SDKs, terraform providers, markdown documentation). The determinism makes it useful — API updates propagate continuously from server code, to specifications, then to the SDKs / providers / docs site. There are no breaking changes because the pipeline is deterministic and humans are in control of the API at the start. The code generation itself is just a means to an end : removing boilerplate effort and language differences by driving it from a source of truth (server api routes/types). Continuously generated, it is not debt.
We’ve put a lot of effort into trying to make an LLM agent useful in this context. However giving them control of generated code directly means it’s hard to keep the “no breaking changes”, and “consistency” restrictions that’s needed to make code generation useful.
The trick we’ve landed on to get utility out of an LLM in a code generation task, is to restrict it to manipulating a strictly typed interface document, such that it can only do non-breaking things to code (e.g. adjust comments / descriptions / examples) by making changes through this interface.
Project mention: Show HN: LLaMA Nuts and Bolts, A holistic way of understanding how LLMs run | news.ycombinator.com | 2024-03-20
Project mention: Glide, an open fast model gateway for production-ready GenAI apps | news.ycombinator.com | 2024-02-13
Project mention: Show HN: Open-source SDK for creating custom code interpreters with any LLM | news.ycombinator.com | 2024-04-19We'll have nice and easy support for self-hosting soon-ish.
In the meantime, everything is open-source and the infra is codified with Terraform. GCP should have the best support now. If you want to dig into it, we'd love to give you support along the road so we can improve the process.
Our infra repo [0] is a good place to start. Once you have E2B deployed, you can just change E2B_DOMAIN env var and use our SDK.
Feel free to email me, join our Discord, or open an issue if you have any questions
[0] https://github.com/e2b-dev/infra
Go llm related posts
- Show HN: Plandex – an AI coding engine for complex tasks
- Discovering Devin, Devika, and OpenDevin
- We built a highly scalable LLM gateway with go
- I built an open-source tool that helps add usage-based billing for your LLM projects
- How well can LLMs write COBOL?
-
Milvus VS pgvecto.rs - a user suggested alternative
2 projects | 13 Mar 2024
- HuggingFace Is Down
-
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024
Index
What are some of the best open-source llm projects in Go? This list will help you:
Project | Stars | |
---|---|---|
1 | Milvus | 26,857 |
2 | plandex | 8,740 |
3 | flyte | 4,761 |
4 | casibase | 2,075 |
5 | zep | 1,978 |
6 | llama.go | 1,160 |
7 | tlm | 1,023 |
8 | BricksLLM | 730 |
9 | aqueduct | 521 |
10 | hof | 475 |
11 | lingoose | 461 |
12 | agency | 375 |
13 | HuggingFaceModelDownloader | 364 |
14 | galah | 261 |
15 | evalgpt | 242 |
16 | helix | 201 |
17 | llama2.go | 177 |
18 | aikit | 173 |
19 | speakeasy | 139 |
20 | llama-nuts-and-bolts | 103 |
21 | glide | 82 |
22 | infra | 80 |
23 | chatgptui | 76 |
Sponsored