SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Python llama Projects
-
Project mention: Speculative decoding: when and why it actually speeds up inference | dev.to | 2026-06-04
Here's a real, runnable config that uses EAGLE for offline batched generation. It's straight from the vLLM repo's eagle.md example:
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
-
unsloth
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Project mention: I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot | dev.to | 2026-05-08Training: unsloth + trl (SFTTrainer). Unsloth handles the 4-bit quantization and gradient checkpointing; trl handles the training loop.
-
Project mention: Microsoft pulled internal Claude Code licenses — what their engineers' preference tells you about picking AI tools | dev.to | 2026-06-06
Aider with a bring-your-own API key has no subscription floor. Pay per token. For 2-3 focused hours per week, costs less than $20/mo. For full-time use, costs more than Max. Best for terminal natives who already live in tmux and do not want a separate IDE.
-
AstrBot
AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨
-
Project mention: Fish Audio S2-Pro:用自然è¯è¨€æŽ§åˆ¶è¯éŸ³æƒ…感的 TTS 模型 | dev.to | 2026-03-11
git clone https://github.com/fishaudio/fish-speech.git cd fish-speech pip install uv uv sync
-
sglang
SGLang is a high-performance serving framework for large language models and multimodal models.
Project mention: DeepSeek makes the V4 Pro price discount permanent | news.ycombinator.com | 2026-05-22There are several things at play:
Inference stack efficiency: Many of these providers take off the shelf sglang / vllm / trtllm and hope for the best. Meanwhile DeepSeek team is known for pushing the boundary of optimizations.
Now, sglang and vllm are great pieces of software, but take DeepSeek's Sparse Attention (DSA). Introduced 1.5 years ago (https://arxiv.org/abs/2512.02556), used by DeepSeek 3.2, GLM 5, DeepSeek V4. Only now is it slowly strating to get optimized in the major inference engines: (https://github.com/sgl-project/sglang/issues/19380 https://github.com/sgl-project/sglang/pull/22851 etc.). Of course, DS V4 adds extra optimizations into the model architecture on top of DSA, and those will take more time to be taken full advantage of by the open source inference engines.
Privacy: Betting that people will pay extra for inference hosted outside China. This is especially true with DeepSeek, because DeepSeek is transparent about using API data for model improvements.
And few other things (scale (matters a lot for MoEs), reliability, soft enterprise lock in, etc.)
---
There is also, likely, tacit collusion at play here. Look at GLM 5 and GLM 5.1 prices. GLM 5 and 5.1 cost the same to run, but providers decided to charge much more for 5.1 because it is much better model, and because Z.AI raised their price as well.
-
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
Project mention: 7 GitHub Repos I'd Actually Use to Find Free AI APIs and No-Cost AI Tools | dev.to | 2026-04-23
View on GitHub
-
Chinese-LLaMA-Alpaca
䏿–‡LLaMA&Alpaca大è¯è¨€æ¨¡åž‹+本地CPU/GPUè®ç»ƒéƒ¨ç½² (Chinese LLaMA & Alpaca LLMs)
-
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
-
-
OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06REST APIs to connect AI models to Vue.js apps (example 1, example 2).
-
shell_gpt
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.
-
-
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
there are already projects like Petals https://github.com/bigscience-workshop/petals
-
inference
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
-
oumi
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
Project mention: Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi | news.ycombinator.com | 2025-11-12 -
GPTCache — an open source library built specifically for caching LLM responses. Pretty flexible and worth looking at if you're rolling your own pipeline.
-
-
-
-
YuE
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open (by multimodal-art-projection)
Python llama discussion
Python llama related posts
-
The Chomsky Objection the AI Industry Has Been Quietly Working Around
-
5 Best Local LLM Tools and Models You Should Run in 2026
-
DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026
-
Agents 101: Reasoning, Actions & Autonomy
-
I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot
-
Meltdown: LLM Client Made in Python and Tk
-
Accelerating Gemma 4: faster inference with multi-token prediction drafters
-
A note from our sponsor - SaaSHub
www.saashub.com | 13 Jun 2026
Index
What are some of the best open-source llama projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | vllm | 82,489 |
| 2 | LlamaFactory | 72,081 |
| 3 | unsloth | 66,397 |
| 4 | aider | 46,112 |
| 5 | AstrBot | 34,454 |
| 6 | fish-speech | 30,752 |
| 7 | sglang | 28,913 |
| 8 | LLaVA | 24,756 |
| 9 | free-llm-api-resources | 23,375 |
| 10 | Chinese-LLaMA-Alpaca | 18,946 |
| 11 | ChuanhuChatGPT | 15,313 |
| 12 | PaddleNLP | 12,950 |
| 13 | OpenLLM | 12,352 |
| 14 | shell_gpt | 12,122 |
| 15 | ludwig | 11,716 |
| 16 | petals | 10,171 |
| 17 | inference | 9,348 |
| 18 | oumi | 9,315 |
| 19 | GPTCache | 8,066 |
| 20 | lmdeploy | 7,895 |
| 21 | mergekit | 7,137 |
| 22 | Liger-Kernel | 6,417 |
| 23 | YuE | 6,249 |