language-model

Open-source projects categorized as language-model

Top 23 language-model Open-Source Projects

language-model
  1. generative-ai-for-beginners

    21 Lessons, Get Started Building with Generative AI

    Project mention: 10 GitHub Repos Every Serious Prompt Writer Should Be Using | dev.to | 2025-11-22

    View on GitHub

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. LLMs-from-scratch

    Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

    Project mention: DeepSeek Sparse Attention | news.ycombinator.com | 2026-05-24
  4. Prompt-Engineering-Guide

    🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

    Project mention: Your AI is not bad, your instructions are | dev.to | 2026-05-22

    Prompt Engineering Guide

  5. gpt4free

    The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3

    Project mention: GPT4Free: "educational project" for free LLM inference from various services | news.ycombinator.com | 2025-06-30
  6. Open-Assistant

    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

  7. stanford_alpaca

    Code and documentation to train Stanford's Alpaca models, and generate the data.

  8. repomix

    📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

    Project mention: 15 AI Coding Hacks Nobody Talks About (2026) | dev.to | 2026-05-28

    Most people paste files one at a time. Install Repomix and feed your entire project to the AI in one command.

  9. ai

    The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents

    Project mention: Vercel AI Gateway Appears to Block BYOK Requests When Account Balance Reaches $0 | news.ycombinator.com | 2026-06-01
  10. mlc-llm

    Universal LLM Deployment Engine with ML Compilation

  11. web-llm

    High-performance In-browser LLM Inference Engine

    Project mention: Stop Sending Medical Data to the Cloud: Build a 100% Private Health AI with WebLLM and Transformers.js | dev.to | 2026-05-03

    Tech Stack: React (Vite), WebLLM, Transformers.js.

  12. DocsGPT

    Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

  13. StableLM

    StableLM: Stability AI Language Models

  14. RWKV-LM

    RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

    Project mention: RWKV-7 beats Llama 3.2 with 3x fewer training tokens and formally exceeds TC^0 | news.ycombinator.com | 2026-02-23
  15. open_clip

    An open source implementation of CLIP.

    Project mention: Cross-Modal Embeddings: Bridging AI Modalities | dev.to | 2025-11-21

    OpenCLIP: Open Source Implementation

  16. LoRA

    Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

    Project mention: LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams | dev.to | 2026-06-04

    LoRA (Hu et al., 2021) freezes the base model weights and injects trainable low-rank decomposition matrices into the attention layers. Instead of updating all 7 billion parameters of a 7B model, LoRA trains ~1–5% of equivalent parameters. Results:

  17. lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Project mention: What is an LLM evaluation harness? A deep dive into lm-eval-harness | dev.to | 2026-06-03

    EleutherAI started the project in 2020 as a unified way to reproduce published LLM benchmark numbers. It's now at v0.4.12 (May 2026), ships with 200+ tasks spanning reasoning, knowledge, coding, math, multilingual, and long-context benchmarks, and supports a long list of model backends: Hugging Face transformers, vLLM, SGLang, GPT-NeoX, Megatron-DeepSpeed, plus API endpoints for OpenAI, Anthropic, and a few others.

  18. txtai

    💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

    Project mention: Agent Tools | dev.to | 2026-03-16
  19. speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

    Star the Speech Brain repository ⭐

  20. tokenizers

    💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

    Project mention: Building Sentence Transformers in Rust: A Practical Guide with Burn, ONNX Runtime, and Candle | dev.to | 2025-10-30

    HuggingFace Tokenizers: https://huggingface.co/docs/tokenizers

  21. koboldcpp

    Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

    Project mention: Best Free AI Chatbots Without Login (over TOR and Anonymous) | dev.to | 2025-10-07

    https://github.com/LostRuins/koboldcpp Download models at HuggingFace and run them locally. No logins, no spying, no hidden data harvesting.

  22. ChatRWKV

    ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

  23. MemOS

    Self-evolving memory OS for LLM & AI Agents: ultra-persistent memory, hybrid-retrieval, and cross-task skill reuse, with 35.24% token savings (by MemTensor)

    Project mention: Top 10 OpenClaw Development Patterns and Architecture Best Practices | dev.to | 2026-02-18

    Repository: https://github.com/MemTensor/MemOS Architecture Pattern: Layered Memory System

  24. LMFlow

    An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

language-model discussion

Log in or Post with

language-model related posts

  • LLM Fine-Tuning vs RAG: A Production Decision Framework for Engineering Teams

    2 projects | dev.to | 4 Jun 2026
  • Operator: cuando responder no basta

    3 projects | dev.to | 3 Jun 2026
  • What is an LLM evaluation harness? A deep dive into lm-eval-harness

    3 projects | dev.to | 3 Jun 2026
  • Vercel AI Gateway Appears to Block BYOK Requests When Account Balance Reaches $0

    1 project | news.ycombinator.com | 1 Jun 2026
  • Agentic engineering patterns that survive contact with production

    1 project | dev.to | 1 Jun 2026
  • Frontier AI in 2026, what actually changed and what did not

    1 project | dev.to | 1 Jun 2026
  • Liquid AI reveals 8B-A1B MoE trained on 38T

    1 project | news.ycombinator.com | 29 May 2026
  • A note from our sponsor - SaaSHub
    www.saashub.com | 7 Jun 2026
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source language-model projects? This list will help you:

# Project Stars
1 generative-ai-for-beginners 111,683
2 LLMs-from-scratch 96,593
3 Prompt-Engineering-Guide 75,276
4 gpt4free 66,281
5 Open-Assistant 37,413
6 stanford_alpaca 30,261
7 repomix 25,919
8 ai 24,695
9 mlc-llm 22,749
10 web-llm 18,110
11 DocsGPT 17,921
12 StableLM 15,740
13 RWKV-LM 14,548
14 open_clip 13,882
15 LoRA 13,316
16 lm-evaluation-harness 12,818
17 txtai 12,627
18 speechbrain 11,592
19 tokenizers 10,795
20 koboldcpp 10,708
21 ChatRWKV 9,491
22 MemOS 9,615
23 LMFlow 8,488

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com

Did you know that Python is
the 1st most popular programming language
based on number of references?