SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 language-model Open-Source Projects
-
View on GitHub
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
Prompt-Engineering-Guide
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Prompt Engineering Guide
-
gpt4free
The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3
Project mention: GPT4Free: "educational project" for free LLM inference from various services | news.ycombinator.com | 2025-06-30 -
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
-
repomix
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
Most people paste files one at a time. Install Repomix and feed your entire project to the AI in one command.
-
ai
The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents
Project mention: Vercel AI Gateway Appears to Block BYOK Requests When Account Balance Reaches $0 | news.ycombinator.com | 2026-06-01 -
-
Project mention: Stop Sending Medical Data to the Cloud: Build a 100% Private Health AI with WebLLM and Transformers.js | dev.to | 2026-05-03
Tech Stack: React (Vite), WebLLM, Transformers.js.
-
DocsGPT
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
-
-
RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
Project mention: RWKV-7 beats Llama 3.2 with 3x fewer training tokens and formally exceeds TC^0 | news.ycombinator.com | 2026-02-23 -
OpenCLIP: Open Source Implementation
-
Project mention: LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams | dev.to | 2026-06-04
LoRA (Hu et al., 2021) freezes the base model weights and injects trainable low-rank decomposition matrices into the attention layers. Instead of updating all 7 billion parameters of a 7B model, LoRA trains ~1–5% of equivalent parameters. Results:
-
Project mention: What is an LLM evaluation harness? A deep dive into lm-eval-harness | dev.to | 2026-06-03
EleutherAI started the project in 2020 as a unified way to reproduce published LLM benchmark numbers. It's now at v0.4.12 (May 2026), ships with 200+ tasks spanning reasoning, knowledge, coding, math, multilingual, and long-context benchmarks, and supports a long list of model backends: Hugging Face transformers, vLLM, SGLang, GPT-NeoX, Megatron-DeepSpeed, plus API endpoints for OpenAI, Anthropic, and a few others.
-
-
Star the Speech Brain repository ⭐
-
Project mention: Building Sentence Transformers in Rust: A Practical Guide with Burn, ONNX Runtime, and Candle | dev.to | 2025-10-30
HuggingFace Tokenizers: https://huggingface.co/docs/tokenizers
-
https://github.com/LostRuins/koboldcpp Download models at HuggingFace and run them locally. No logins, no spying, no hidden data harvesting.
-
-
MemOS
Self-evolving memory OS for LLM & AI Agents: ultra-persistent memory, hybrid-retrieval, and cross-task skill reuse, with 35.24% token savings (by MemTensor)
Project mention: Top 10 OpenClaw Development Patterns and Architecture Best Practices | dev.to | 2026-02-18Repository: https://github.com/MemTensor/MemOS Architecture Pattern: Layered Memory System
-
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
language-model discussion
language-model related posts
-
LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams
-
Operator: cuando responder no basta
-
What is an LLM evaluation harness? A deep dive into lm-eval-harness
-
Vercel AI Gateway Appears to Block BYOK Requests When Account Balance Reaches $0
-
Agentic engineering patterns that survive contact with production
-
Frontier AI in 2026, what actually changed and what did not
-
Liquid AI reveals 8B-A1B MoE trained on 38T
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 Jun 2026
Index
What are some of the best open-source language-model projects? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | generative-ai-for-beginners | 111,683 |
| 2 | LLMs-from-scratch | 96,593 |
| 3 | Prompt-Engineering-Guide | 75,276 |
| 4 | gpt4free | 66,281 |
| 5 | Open-Assistant | 37,413 |
| 6 | stanford_alpaca | 30,261 |
| 7 | repomix | 25,919 |
| 8 | ai | 24,695 |
| 9 | mlc-llm | 22,749 |
| 10 | web-llm | 18,110 |
| 11 | DocsGPT | 17,921 |
| 12 | StableLM | 15,740 |
| 13 | RWKV-LM | 14,548 |
| 14 | open_clip | 13,882 |
| 15 | LoRA | 13,316 |
| 16 | lm-evaluation-harness | 12,818 |
| 17 | txtai | 12,627 |
| 18 | speechbrain | 11,592 |
| 19 | tokenizers | 10,795 |
| 20 | koboldcpp | 10,708 |
| 21 | ChatRWKV | 9,491 |
| 22 | MemOS | 9,615 |
| 23 | LMFlow | 8,488 |