llama

Open-source projects categorized as llama
LLM AI openai chatgpt Gpt

Top 23 llama Open-Source Projects

  1. ollama

    Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

    Project mention: Run Coding Agents on Local AI — Zero Cloud, Full Control | dev.to | 2026-06-06

    This guide shows how to swap out every cloud API with a local Ollama server running qwen3-coder:30b. Same tools, same workflows, no data leaving your network.

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Project mention: Speculative decoding: when and why it actually speeds up inference | dev.to | 2026-06-04

    Here's a real, runnable config that uses EAGLE for offline batched generation. It's straight from the vLLM repo's eagle.md example:

  4. LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Project mention: Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs | news.ycombinator.com | 2025-09-18
  5. unsloth

    Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

    Project mention: I Trained an LLM on 75K of My Own Messages So It Would Stop Writing Like a Chatbot | dev.to | 2026-05-08

    Training: unsloth + trl (SFTTrainer). Unsloth handles the 4-bit quantization and gradient checkpointing; trl handles the training loop.

  6. LocalAI

    LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

    Project mention: 5 Best Local LLM Tools and Models You Should Run in 2026 | dev.to | 2026-06-02

    LocalAI: https://github.com/mudler/LocalAI

  7. aider

    aider is AI pair programming in your terminal

    Project mention: Microsoft pulled internal Claude Code licenses — what their engineers' preference tells you about picking AI tools | dev.to | 2026-06-06

    Aider with a bring-your-own API key has no subscription floor. Pay per token. For 2-3 focused hours per week, costs less than $20/mo. For full-time use, costs more than Max. Best for terminal natives who already live in tmux and do not want a separate IDE.

  8. AstrBot

    AI Agent Assistant & development framework that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

  9. fish-speech

    SOTA Open Source TTS

    Project mention: Fish Audio S2-Pro:用自然语言控制语音情感的 TTS 模型 | dev.to | 2026-03-11

    git clone https://github.com/fishaudio/fish-speech.git cd fish-speech pip install uv uv sync

  10. sglang

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Project mention: DeepSeek makes the V4 Pro price discount permanent | news.ycombinator.com | 2026-05-22

    There are several things at play:

    Inference stack efficiency: Many of these providers take off the shelf sglang / vllm / trtllm and hope for the best. Meanwhile DeepSeek team is known for pushing the boundary of optimizations.

    Now, sglang and vllm are great pieces of software, but take DeepSeek's Sparse Attention (DSA). Introduced 1.5 years ago (https://arxiv.org/abs/2512.02556), used by DeepSeek 3.2, GLM 5, DeepSeek V4. Only now is it slowly strating to get optimized in the major inference engines: (https://github.com/sgl-project/sglang/issues/19380 https://github.com/sgl-project/sglang/pull/22851 etc.). Of course, DS V4 adds extra optimizations into the model architecture on top of DSA, and those will take more time to be taken full advantage of by the open source inference engines.

    Privacy: Betting that people will pay extra for inference hosted outside China. This is especially true with DeepSeek, because DeepSeek is transparent about using API data for model improvements.

    And few other things (scale (matters a lot for MoEs), reliability, soft enterprise lock in, etc.)

    ---

    There is also, likely, tacit collusion at play here. Look at GLM 5 and GLM 5.1 prices. GLM 5 and 5.1 cost the same to run, but providers decided to charge much more for 5.1 because it is much better model, and because Z.AI raised their price as well.

  11. repomix

    📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

    Project mention: 15 AI Coding Hacks Nobody Talks About (2026) | dev.to | 2026-05-28

    Most people paste files one at a time. Install Repomix and feed your entire project to the AI in one command.

  12. LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Project mention: OpenAI Open Models | news.ycombinator.com | 2025-08-05
  13. free-llm-api-resources

    A list of free LLM inference resources accessible via API.

    Project mention: 7 GitHub Repos I'd Actually Use to Find Free AI APIs and No-Cost AI Tools | dev.to | 2026-04-23

    View on GitHub

  14. Chinese-LLaMA-Alpaca

    中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

  15. airllm

    AirLLM 70B inference with single 4GB GPU

    Project mention: AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models | dev.to | 2026-06-03
  16. llama-cookbook

    Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

    Project mention: Using AI to prototype games in the browser | dev.to | 2025-12-01

    While people often focus on the big models, smaller and more specialised models are increasingly important. Codex for code generation, Stable Diffusion for images, and small LLaMA-class models for focused tasks are all good examples.

  17. ChuanhuChatGPT

    GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

  18. dalai

    The simplest way to run LLaMA on your local machine

  19. PaddleNLP

    Easy-to-use and powerful LLM and SLM library with awesome model zoo.

  20. OpenLLM

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Project mention: Your 2025 Roadmap to Becoming an AI Engineer for Free for Vue.js Developers | dev.to | 2025-08-06

    REST APIs to connect AI models to Vue.js apps (example 1, example 2).

  21. langchain4j

    LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing tool calling (including MCP support), agents and RAG easy. It integrates seamlessly with enterprise Java frameworks like Quarkus and Spring Boot.

    Project mention: Agent Memory with LangChain4j and Oracle AI Database | dev.to | 2026-04-22

    In this article, we'll build a small, memory-backed assistant with LangChain4j and Oracle AI Database. The assistant can search prior incidents, runbooks, decisions, and shift handoffs to answer questions. It can write new memories back to the database so they become searchable in any session. Additionally, all user, agent, and tool messages are logged to database table for observability and auditing.

  22. shell_gpt

    A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.

    Project mention: Supercharge Your Terminal: ShellGPT + ChromaDB + LangChain for Context-Aware Automation | dev.to | 2025-09-01

    🗃 To explore ShellGPT in depth, including installation instructions, usage examples, and advanced configuration options, head over to the official ShellGPT GitHub repository.

  23. ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  24. tensorzero

    TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

    Project mention: Supervised Fine Tuning on Curated Data Is Reinforcement Learning | news.ycombinator.com | 2025-07-29

    [I'm his coworker.] We ran Unsloth ourselves on a GPU-by-the-hour server. We have a notebook in the repository showing how to query historical data and use it with Unsloth.

    It's a WIP PR that we plan to merge soon: https://github.com/tensorzero/tensorzero/pull/2273

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

llama discussion

Log in or Post with

llama related posts

  • Run Coding Agents on Local AI — Zero Cloud, Full Control

    1 project | dev.to | 6 Jun 2026
  • Run Gemma-4 12B on WSL2 with llama.cpp

    1 project | dev.to | 5 Jun 2026
  • Running Brand-New Gemma 4 12B on an 8-Year-Old GTX 1080 Ti: Speed, 3 Gotchas, and Why Q8 Beat Q4 on My Own Field

    1 project | dev.to | 4 Jun 2026
  • Agent Skills in Microsoft Agent Framework

    2 projects | dev.to | 3 Jun 2026
  • 5 Best Local LLM Tools and Models You Should Run in 2026

    5 projects | dev.to | 2 Jun 2026
  • Run Cursor with a Local Model: Privacy-First AI Coding Without a Subscription

    1 project | dev.to | 2 Jun 2026
  • How fast is LlamaStash? Overhead, throughput, and a fair comparison with Ollama and LM Studio

    3 projects | dev.to | 2 Jun 2026
  • A note from our sponsor - SaaSHub
    www.saashub.com | 7 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source llama projects? This list will help you:

# Project Stars
1 ollama 173,220
2 vllm 81,898
3 LlamaFactory 71,870
4 unsloth 65,904
5 LocalAI 46,698
6 aider 45,826
7 AstrBot 33,820
8 fish-speech 30,666
9 sglang 28,872
10 repomix 25,919
11 LLaVA 24,756
12 free-llm-api-resources 22,947
13 Chinese-LLaMA-Alpaca 18,949
14 airllm 19,431
15 llama-cookbook 18,341
16 ChuanhuChatGPT 15,318
17 dalai 12,980
18 PaddleNLP 12,950
19 OpenLLM 12,346
20 langchain4j 12,221
21 shell_gpt 12,110
22 ludwig 11,710
23 tensorzero 11,435

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?