SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 large-language-model Open-Source Projects
-
Project mention: CVE-2026-33017: How I Found an Unauthenticated RCE in Langflow by Reading the Code They Already Fixed | dev.to | 2026-03-19
I reported this through Langflow's GitHub Security Advisory on February 25, 2026. The initial response took about two weeks and a couple of follow-up pings from my end. Once the team engaged, things moved quickly. They merged a fix in PR #12160, and the advisory was published on March 16, 2026.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
A hands-on, end-to-end course on building, evaluating, and deploying LLM applications. Ideal when you want a clear path from spark of an idea to deployment. Link: https://github.com/mlabonne/llm-course
-
Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
-
Project mention: I Tested Flowise, Dify, and n8n Across 30+ Client Deployments. Here Is My Verdict. | dev.to | 2026-04-07
Citation Capsule: n8n's GitHub community reached 182,000+ stars across a 7-year development history, with 70+ AI-specific nodes added in 2024 to 2025. Source: n8n GitHub. Dify crossed 106,000 stars on GitHub with an Apache 2.0 license. Source: Dify GitHub. Flowise reached 51,000+ stars with MIT license. Source: Flowise GitHub. Dify's minimum recommended RAM is 4 GB versus Flowise's 1 GB and n8n's 300 MB. Source: Dify Docs.
-
Ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Project mention: GSoC 2026 Predictions: 30 NEW AI/ML/Security Organizations You Should Start Contributing to NOW! | dev.to | 2026-02-06Main: https://github.com/ray-project/ray ⭐ 34k+
-
system_prompts_leaks
Extracted system prompts from Anthropic - Claude Code, Claude Design, Opus 4.8, Sonnet 4.6. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex, Google - Gemini - 3.5 Flash, 3.1 Pro, Antigravity, xAI - Grok, Cursor, Copilot, VS Code, Perplexity, and more. Updated regularly.
Project mention: System_prompts_leaks: Anthropic/Claude-Opus-4.6.md | news.ycombinator.com | 2026-04-06 -
langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
langextract: A tool for extracting language information. View on GitHub
-
Project mention: Show HN: Query years of Ask HN and Show HN discussions as local knowledge graph | news.ycombinator.com | 2026-05-10
I built lightrag-snkv, Basically it uses lightRAG https://github.com/HKUDS/LightRAG ,this requires various storage databases like key value store, graph database, vector database, I built single embedded file based database which covers all these requirements: https://github.com/hash-anu/snkv.
-
langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Project mention: Three Budget-Guardrail Failure Modes That Matter More Than Model Quality (May 2026) | dev.to | 2026-05-19Source: https://github.com/langfuse/langfuse/issues/12614 (open, updated 2026-05-14)
-
storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
-
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
A one-stop repo for GenAI research updates, notebooks, interview prep, and more. Great for staying current while practicing with solid reference materials you can trust. Link: https://github.com/aishwaryanr/awesome-generative-ai-guide
-
Hands-On-Large-Language-Models
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
It’s the full code from the book, with notebooks covering LLM basics, training, and fine-tuning. If you like a guided, notebook-first path from foundations to customization, this feels like a friendly trail map. Link: https://github.com/HandsOnLLM/Hands-On-Large-Language-Models
-
GitHub Link: https://github.com/agentscope-ai/agentscope Summary: Agentscope is an agent-oriented programming library that makes it easier to build LLM applications. It's designed to be "developer-centric" with features like asynchronous execution, parallel tool calls, and real-time steering. It offers a transparent approach where prompt engineering and API invocation are fully visible and controllable. Why it's important: Agentscope, along with its related libraries like agentscope-runtime and agentscope-studio, provides a comprehensive toolkit for not only developing but also deploying and visualizing agent-based applications.
-
haystack
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
Project mention: Show HN: Haystack – Review pull requests like you wrote them yourself | news.ycombinator.com | 2025-09-11I immediately thought this was an update by Deepset and their Haystack framework. https://haystack.deepset.ai/
Just FYI.
-
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
-
FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
-
-
For kernel-level performance tuning you can use the occupancy calculator as pointed out by jplusqualt or you can profile your kernel with Nsight compute which will give you a ton of info.
But for model-wide performance, you basically have to come up with your own calculation to estimate the FLOPs required by your model and based on that figure out how well your model is maxing out the GPU capabilities (MFU/HFU).
Here is a more in-depth example on how you might do this: https://github.com/stas00/ml-engineering/tree/master/trainin...
-
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
-
camel
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Project mention: Eigent: An open source Claude cowork alternative | news.ycombinator.com | 2026-01-14You can have a try; almost all sota models are supported all powered thanks to https://github.com/camel-ai/camel
-
generative-ai
Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform
Project mention: Gemini Embedding 2: Our first natively multimodal embedding model | dev.to | 2026-03-10Learn how to use the model in our interactive Gemini API and Vertex AI Colab notebooks. You can also use it through LangChain, LlamaIndex, Haystack, Weaviate, QDrant, ChromaDB, and Vector Search.
large-language-models discussion
large-language-models related posts
-
Gemma 4 12B: A unified, encoder-free multimodal model
-
How to track LLM costs per customer in production
-
Train LLMs from Scratch, Hermes Agent WebUI, & Efficient OlmoEarth v1.1 for Local AI
-
I scanned Langfuse. It observes its own LLM calls through its own platform.
-
DeepSeek Sparse Attention
-
Per-user cost attribution for your AI APP
-
Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 Jun 2026
Index
What are some of the best open-source large-language-model projects? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | langflow | 149,263 |
| 2 | LLMs-from-scratch | 96,593 |
| 3 | llm-course | 79,907 |
| 4 | LlamaFactory | 71,870 |
| 5 | gpt_academic | 70,836 |
| 6 | Flowise | 53,317 |
| 7 | Ray | 42,791 |
| 8 | system_prompts_leaks | 41,269 |
| 9 | langextract | 36,808 |
| 10 | LightRAG | 36,193 |
| 11 | langfuse | 28,520 |
| 12 | storm | 28,323 |
| 13 | awesome-generative-ai-guide | 26,976 |
| 14 | Hands-On-Large-Language-Models | 26,813 |
| 15 | agentscope | 26,238 |
| 16 | haystack | 25,466 |
| 17 | Qwen | 21,244 |
| 18 | FinGPT | 20,392 |
| 19 | Chinese-LLaMA-Alpaca | 18,949 |
| 20 | ml-engineering | 18,044 |
| 21 | Awesome-Multimodal-Large-Language-Models | 17,850 |
| 22 | camel | 17,122 |
| 23 | generative-ai | 16,986 |