Python llama

Open-source Python projects categorized as llama
LLM openai Python Gpt AI

Top 23 Python llama Projects

  1. vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Project mention: Speculative decoding: when and why it actually speeds up inference | dev.to | 2026-06-04

    Here's a real, runnable config that uses EAGLE for offline batched generation. It's straight from the vLLM repo's eagle.md example:

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
  4. unsloth

    Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

    Project mention: I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot | dev.to | 2026-05-08

    Training: unsloth + trl (SFTTrainer). Unsloth handles the 4-bit quantization and gradient checkpointing; trl handles the training loop.

  5. aider

    aider is AI pair programming in your terminal

    Project mention: Microsoft pulled internal Claude Code licenses — what their engineers' preference tells you about picking AI tools | dev.to | 2026-06-06

    Aider with a bring-your-own API key has no subscription floor. Pay per token. For 2-3 focused hours per week, costs less than $20/mo. For full-time use, costs more than Max. Best for terminal natives who already live in tmux and do not want a separate IDE.

  6. AstrBot

    AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

  7. fish-speech

    SOTA Open Source TTS

    Project mention: Fish Audio S2-Pro:用自然语言控制语音情感的 TTS 模型 | dev.to | 2026-03-11

    git clone https://github.com/fishaudio/fish-speech.git cd fish-speech pip install uv uv sync

  8. sglang

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Project mention: DeepSeek makes the V4 Pro price discount permanent | news.ycombinator.com | 2026-05-22

    There are several things at play:

    Inference stack efficiency: Many of these providers take off the shelf sglang / vllm / trtllm and hope for the best. Meanwhile DeepSeek team is known for pushing the boundary of optimizations.

    Now, sglang and vllm are great pieces of software, but take DeepSeek's Sparse Attention (DSA). Introduced 1.5 years ago (https://arxiv.org/abs/2512.02556), used by DeepSeek 3.2, GLM 5, DeepSeek V4. Only now is it slowly strating to get optimized in the major inference engines: (https://github.com/sgl-project/sglang/issues/19380 https://github.com/sgl-project/sglang/pull/22851 etc.). Of course, DS V4 adds extra optimizations into the model architecture on top of DSA, and those will take more time to be taken full advantage of by the open source inference engines.

    Privacy: Betting that people will pay extra for inference hosted outside China. This is especially true with DeepSeek, because DeepSeek is transparent about using API data for model improvements.

    And few other things (scale (matters a lot for MoEs), reliability, soft enterprise lock in, etc.)

    ---

    There is also, likely, tacit collusion at play here. Look at GLM 5 and GLM 5.1 prices. GLM 5 and 5.1 cost the same to run, but providers decided to charge much more for 5.1 because it is much better model, and because Z.AI raised their price as well.

  9. LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Project mention: OpenAI Open Models | news.ycombinator.com | 2025-08-05
  10. free-llm-api-resources

    A list of free LLM inference resources accessible via API.

    Project mention: 7 GitHub Repos I'd Actually Use to Find Free AI APIs and No-Cost AI Tools | dev.to | 2026-04-23

    View on GitHub

  11. Chinese-LLaMA-Alpaca

    中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

  12. ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

  13. PaddleNLP

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

  14. OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06

    REST APIs to connect AI models to Vue.js apps (example 1, example 2).

  15. shell_gpt

    A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.

    Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01

    🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.

  16. ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  17. petals

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Project mention: Open Source AI Must Win | news.ycombinator.com | 2026-06-12

    there are already projects like Petals https://github.com/bigscience-workshop/petals

  18. inference

    Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

  19. oumi

    Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!

    Project mention: Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi | news.ycombinator.com | 2025-11-12
  20. GPTCache

    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

    Project mention: What's semantic caching? | dev.to | 2026-03-16

    GPTCache — an open source library built specifically for caching LLM responses. Pretty flexible and worth looking at if you're rolling your own pipeline.

  21. lmdeploy

    LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

  22. mergekit

    Tools for merging pretrained large language models.

  23. Liger-Kernel

    Efficient Triton Kernels for LLM Training

    Project mention: Why ML Needs a New Programming Language | news.ycombinator.com | 2025-09-05
  24. YuE

    YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open (by multimodal-art-projection)

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llama discussion

Log in or Post with

Python llama related posts

  • The Chomsky Objection the AI Industry Has Been Quietly Working Around

    4 projects | dev.to | 9 Jun 2026
  • 5 Best Local LLM Tools and Models You Should Run in 2026

    5 projects | dev.to | 2 Jun 2026
  • DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026

    1 project | dev.to | 20 May 2026
  • Agents 101: Reasoning, Actions & Autonomy

    4 projects | dev.to | 11 May 2026
  • I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot

    1 project | dev.to | 8 May 2026
  • Meltdown: LLM Client Made in Python and Tk

    1 project | news.ycombinator.com | 8 May 2026
  • Accelerating Gemma 4: faster inference with multi-token prediction drafters

    7 projects | news.ycombinator.com | 5 May 2026
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source llama projects in Python? This list will help you:

# Project Stars
1 vllm 82,489
2 LlamaFactory 72,081
3 unsloth 66,397
4 aider 46,112
5 AstrBot 34,454
6 fish-speech 30,752
7 sglang 28,913
8 LLaVA 24,756
9 free-llm-api-resources 23,375
10 Chinese-LLaMA-Alpaca 18,946
11 ChuanhuChatGPT 15,313
12 PaddleNLP 12,950
13 OpenLLM 12,352
14 shell_gpt 12,122
15 ludwig 11,716
16 petals 10,171
17 inference 9,348
18 oumi 9,315
19 GPTCache 8,066
20 lmdeploy 7,895
21 mergekit 7,137
22 Liger-Kernel 6,417
23 YuE 6,249

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?