SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 llama Open-Source Projects
-
ollama
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
This guide shows how to swap out every cloud API with a local Ollama server running qwen3-coder:30b. Same tools, same workflows, no data leaving your network.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Speculative decoding: when and why it actually speeds up inference | dev.to | 2026-06-04
Here's a real, runnable config that uses EAGLE for offline batched generation. It's straight from the vLLM repo's eagle.md example:
-
Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
unsloth
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Project mention: I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot | dev.to | 2026-05-08Training: unsloth + trl (SFTTrainer). Unsloth handles the 4-bit quantization and gradient checkpointing; trl handles the training loop.
-
LocalAI
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
LocalAI: https://github.com/mudler/LocalAI
-
Project mention: Microsoft pulled internal Claude Code licenses — what their engineers' preference tells you about picking AI tools | dev.to | 2026-06-06
Aider with a bring-your-own API key has no subscription floor. Pay per token. For 2-3 focused hours per week, costs less than $20/mo. For full-time use, costs more than Max. Best for terminal natives who already live in tmux and do not want a separate IDE.
-
AstrBot
AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨
-
git clone https://github.com/fishaudio/fish-speech.git cd fish-speech pip install uv uv sync
-
sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Project mention: DeepSeek makes the V4 Pro price discount permanent | news.ycombinator.com | 2026-05-22There are several things at play:
Inference stack efficiency: Many of these providers take off the shelf sglang / vllm / trtllm and hope for the best. Meanwhile DeepSeek team is known for pushing the boundary of optimizations.
Now, sglang and vllm are great pieces of software, but take DeepSeek's Sparse Attention (DSA). Introduced 1.5 years ago (https://arxiv.org/abs/2512.02556), used by DeepSeek 3.2, GLM 5, DeepSeek V4. Only now is it slowly strating to get optimized in the major inference engines: (https://github.com/sgl-project/sglang/issues/19380 https://github.com/sgl-project/sglang/pull/22851 etc.). Of course, DS V4 adds extra optimizations into the model architecture on top of DSA, and those will take more time to be taken full advantage of by the open source inference engines.
Privacy: Betting that people will pay extra for inference hosted outside China. This is especially true with DeepSeek, because DeepSeek is transparent about using API data for model improvements.
And few other things (scale (matters a lot for MoEs), reliability, soft enterprise lock in, etc.)
---
There is also, likely, tacit collusion at play here. Look at GLM 5 and GLM 5.1 prices. GLM 5 and 5.1 cost the same to run, but providers decided to charge much more for 5.1 because it is much better model, and because Z.AI raised their price as well.
-
repomix
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.
Most people paste files one at a time. Install Repomix and feed your entire project to the AI in one command.
-
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
Project mention: 7 GitHub Repos I'd Actually Use to Find Free AI APIs and No-Cost AI Tools | dev.to | 2026-04-23
View on GitHub
-
-
Project mention: AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models | dev.to | 2026-06-03
-
llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
While people often focus on the big models, smaller and more specialised models are increasingly important. Codex for code generation, Stable Diffusion for images, and small LLaMA-class models for focused tasks are all good examples.
-
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
-
-
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06REST APIs to connect AI models to Vue.js apps (example 1, example 2).
-
langchain4j
LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing tool calling (including MCP support), agents and RAG easy. It integrates seamlessly with enterprise Java frameworks like Quarkus and Spring Boot.
In this article, we'll build a small, memory-backed assistant with LangChain4j and Oracle AI Database. The assistant can search prior incidents, runbooks, decisions, and shift handoffs to answer questions. It can write new memories back to the database so they become searchable in any session. Additionally, all user, agent, and tool messages are logged to database table for observability and auditing.
-
shell_gpt
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.
-
-
tensorzero
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Project mention: Supervised Fine Tuning on Curated Data Is Reinforcement Learning | news.ycombinator.com | 2025-07-29[I'm his coworker.] We ran Unsloth ourselves on a GPU-by-the-hour server. We have a notebook in the repository showing how to query historical data and use it with Unsloth.
It's a WIP PR that we plan to merge soon: https://github.com/tensorzero/tensorzero/pull/2273
llama discussion
llama related posts
-
Run Coding Agents on Local AI — Zero Cloud, Full Control
-
Run Gemma-4 12B on WSL2 with llama.cpp
-
Running Brand-New Gemma 4 12B on an 8-Year-Old GTX 1080 Ti: Speed, 3 Gotchas, and Why Q8 Beat Q4 on My Own Field
-
Agent Skills in Microsoft Agent Framework
-
5 Best Local LLM Tools and Models You Should Run in 2026
-
Run Cursor with a Local Model: Privacy-First AI Coding Without a Subscription
-
How fast is LlamaStash? Overhead, throughput, and a fair comparison with Ollama and LM Studio
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 Jun 2026
Index
What are some of the best open-source llama projects? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | ollama | 173,220 |
| 2 | vllm | 81,898 |
| 3 | LlamaFactory | 71,870 |
| 4 | unsloth | 65,904 |
| 5 | LocalAI | 46,698 |
| 6 | aider | 45,826 |
| 7 | AstrBot | 33,820 |
| 8 | fish-speech | 30,666 |
| 9 | sglang | 28,872 |
| 10 | repomix | 25,919 |
| 11 | LLaVA | 24,756 |
| 12 | free-llm-api-resources | 22,947 |
| 13 | Chinese-LLaMA-Alpaca | 18,949 |
| 14 | airllm | 19,431 |
| 15 | llama-cookbook | 18,341 |
| 16 | ChuanhuChatGPT | 15,318 |
| 17 | dalai | 12,980 |
| 18 | PaddleNLP | 12,950 |
| 19 | OpenLLM | 12,346 |
| 20 | langchain4j | 12,221 |
| 21 | shell_gpt | 12,110 |
| 22 | ludwig | 11,710 |
| 23 | tensorzero | 11,435 |