Top 23 llm Open-Source Projects

llama.cpp

775 57,463 10.0 C++

LLM inference in C/C++

Project mention: Ask HN: Affordable hardware for running local large language models? | news.ycombinator.com | 2024-05-05

Yes, Metal seems to allow a maximum of 1/2 of the RAM for one process, and 3/4 of the RAM allocated to the GPU overall. There’s a kernel hack to fix it, but that comes with the usual system integrity caveats. https://github.com/ggerganov/llama.cpp/discussions/2182

MetaGPT

32 39,468 10.0 Python

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Project mention: Can AI replace a co-founder? | news.ycombinator.com | 2024-01-07

https://github.com/geekan/MetaGPT :
> MetaGPT takes a one line requirement as input and outputs user stories / competitive analysis / requirements / data structures / APIs / documents, etc.
https://news.ycombinator.com/item?id=29141796 ; "Co-Founder Equity Calculator"
"Ask HN: What are your go to SaaS products for startups/MVPs?" (2020) https://news.ycombinator.com/item?id=23535828 ; FounderKit, StackShare
> USA Small Business Administration: "10 steps to start your business." https://www.sba.gov/starting-business/how-start-business/10-...
>> "Startup Incorporation Checklist: How to bootstrap a Delaware C-corp (or S-corp) with employee(s) in California" https://github.com/leonar15/startup-checklist

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
llama_index

75 31,389 10.0 Python

LlamaIndex is a data framework for your LLM applications

Project mention: LlamaIndex: A data framework for your LLM applications | news.ycombinator.com | 2024-04-07

llm-course

6 29,169 8.1 Jupyter Notebook

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Project mention: Ask HN: People who switched from GPT to their own models. How was it? | news.ycombinator.com | 2024-02-26

This is a very nice resource: https://github.com/mlabonne/llm-course

dify

13 27,030 9.9 TypeScript

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Project mention: Ask HN: LLM workflows to avoid copying and pasting from the web interfaces? | news.ycombinator.com | 2024-05-03

This visual IDE for LLM pipelines was posted recently: https://github.com/langgenius/dify
See if it helps.

Milvus

105 26,979 10.0 Go

A cloud-native vector database, storage for next generation AI applications

Project mention: Computer Vision Meetup: Develop a Legal Search Application from Scratch using Milvus and DSPy! | dev.to | 2024-05-02

Legal practitioners often need to find specific cases and clauses across thousands of dense documents. While traditional keyword-based search techniques are useful, they fail to fully capture semantic content of queries and case files. Vector search engines and large language models provide an intriguing alternative. In this talk, I will show you how to build a legal search application using the DSPy framework and the Milvus vector search engine.

Mr.-Ranedeer-AI-Tutor

26 26,708 8.2

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Project mention: The world's most-powerful AI model suddenly got 'lazier' and 'dumber.' A radical redesign of OpenAI's GPT-4 could be behind the decline in performance. | /r/ChatGPT | 2023-07-13

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
chatgpt-on-wechat

1 25,142 9.4 Python

基于大模型搭建的聊天机器人，同时支持企业微信、微信公众号、飞书、钉钉等接入，可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。
Flowise

21 24,426 9.9 TypeScript

Drag & drop UI to build your customized LLM flow

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

MindsDB

78 21,354 10.0 Python

The platform for customizing AI from enterprise data

Project mention: What’s the Difference Between Fine-tuning, Retraining, and RAG? | dev.to | 2024-04-08

Check us out on GitHub.

LLaMA-Factory

3 20,971 9.9 Python

Unify Efficient Fine-Tuning of 100+ LLMs

Project mention: FLaNK-AIM Weekly 06 May 2024 | dev.to | 2024-05-06

LocalAI

83 19,862 9.9 C++

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Project mention: LocalAI: Self-hosted OpenAI alternative reaches 2.14.0 | news.ycombinator.com | 2024-05-03

vllm

31 18,931 9.9 Python

A high-throughput and memory-efficient inference and serving engine for LLMs

Project mention: AI leaderboards are no longer useful. It's time to switch to Pareto curves | news.ycombinator.com | 2024-04-30

I guess the root cause of my claim is that OpenAI won't tell us whether or not GPT-3.5 is an MoE model, and I assumed it wasn't. Since GPT-3.5 is clearly nondeterministic at temp=0, I believed the nondeterminism was due to FPU stuff, and this effect was amplified with GPT-4's MoE. But if GPT-3.5 is also MoE then that's just wrong.
What makes this especially tricky is that small models are truly 100% deterministic at temp=0 because the relative likelihoods are too coarse for FPU issues to be a factor. I had thought 3.5 was big enough that some of its token probabilities were too fine-grained for the FPU. But that's probably wrong.
On the other hand, it's not just GPT, there are currently floating-point difficulties in vllm which significantly affect the determinism of any model run on it: https://github.com/vllm-project/vllm/issues/966 Note that a suggested fix is upcasting to float32. So it's possible that GPT-3.5 is using an especially low-precision float and introducing nondeterminism by saving money on compute costs.
Sadly I do not have the money[1] to actually run a test to falsify any of this. It seems like this would be a good little research project.
[1] Or the time, or the motivation :) But this stuff is expensive.

unilm

40 18,407 9.0 Python

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Project mention: The Era of 1-Bit LLMs: Training_Tips, Code And_FAQ [pdf] | news.ycombinator.com | 2024-03-21

open-webui

8 18,333 10.0 Svelte

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Project mention: Run Large and Small Language Models locally with ollama | dev.to | 2024-05-07

Luckily, there are some open-source projects like Open WebUI, which provide a web-based experience similar to ChatGPT, that you can also run locally and point to any model. To start the Open WebUI Docker container locally, run the command below in your Terminal (make sure, that ollama serve is still running).

semantic-kernel

47 18,332 9.9 C#

Integrate cutting-edge LLM technology quickly and easily into your apps

Project mention: #SemanticKernel – 📎Chat Service demo running Phi-2 LLM locally with #LMStudio | dev.to | 2024-02-08

There is an amazing sample on how to create your own LLM Service class to be used in Semantic Kernel. You can view the Sample here: https://github.com/microsoft/semantic-kernel/blob/3451a4ebbc9db0d049f48804c12791c681a326cb/dotnet/samples/KernelSyntaxExamples/Example16_CustomLLM.cs

Chinese-LLaMA-Alpaca

4 17,348 8.3 Python

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Project mention: Chinese-Alpaca-Plus-13B-GPTQ | /r/LocalLLaMA | 2023-05-30

I'd like to share with you today the Chinese-Alpaca-Plus-13B-GPTQ model, which is the GPTQ format quantised 4bit models of Yiming Cui's Chinese-LLaMA-Alpaca 13B for GPU reference.

mlc-llm

89 17,053 9.9 Python

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Project mention: FLaNK 04 March 2024 | dev.to | 2024-03-04

ChatGLM2-6B

4 15,514 6.6 Python

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Project mention: Are We Overlooking China's Progress in AI? | /r/singularity | 2023-06-26

LLMs-from-scratch

8 14,440 9.6 Jupyter Notebook

Implementing a ChatGPT-like LLM from scratch, step by step

Project mention: Finetune a GPT Model for Spam Detection on Your Laptop in Just 5 Minutes | news.ycombinator.com | 2024-05-03

peft

26 13,962 9.7 Python

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Project mention: LoftQ: LoRA-fine-tuning-aware Quantization | news.ycombinator.com | 2023-12-19

FastGPT

3 13,166 9.7 TypeScript

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

dalai

59 13,060 6.5 CSS

The simplest way to run LLaMA on your local machine

Project mention: Ask HN: What are the capabilities of consumer grade hardware to work with LLMs? | news.ycombinator.com | 2023-08-03

I agree, I've definitely seen way more information about running image synthesis models like Stable Diffusion locally than I have LLMs. It's counterintuitive to me that Stable Diffusion takes less RAM than an LLM, especially considering it still needs the word vectors. Goes to show I know nothing.
I guess it comes down to the requirement of a very high end (or multiple) GPU that makes it impractical for most vs just running it in Colab or something.
Tho there are some efforts:
https://github.com/cocktailpeanut/dalai

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

llm related posts

TimesFM (Time Series Foundation Model) for time-series forecasting

4 projects | news.ycombinator.com | 8 May 2024
Show HN: Text-to-SQL Focus on Semantics and UI/UX

1 project | news.ycombinator.com | 8 May 2024
WrenAI: Make Your Database RAG-Ready. Text-to-SQL Focus on Semantics and UI/UX

1 project | news.ycombinator.com | 8 May 2024
DeepSeek-V2 integrated, RAGFlow v0.5.0 is released

1 project | news.ycombinator.com | 7 May 2024
Mini-assistant: OpenAI Assistant compatible API at your service locally

1 project | news.ycombinator.com | 7 May 2024
Finnally LangChain for C++ World?

1 project | news.ycombinator.com | 7 May 2024
FinRAG Datasets and Study

1 project | news.ycombinator.com | 7 May 2024
A note from our sponsor - SaaSHub
www.saashub.com | 8 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source llm projects? This list will help you:

	Project	Stars
1	llama.cpp	57,463
2	MetaGPT	39,468
3	llama_index	31,389
4	llm-course	29,169
5	dify	27,030
6	Milvus	26,979
7	Mr.-Ranedeer-AI-Tutor	26,708
8	chatgpt-on-wechat	25,142
9	Flowise	24,426
10	MindsDB	21,354
11	LLaMA-Factory	20,971
12	LocalAI	19,862
13	vllm	18,931
14	unilm	18,407
15	open-webui	18,333
16	semantic-kernel	18,332
17	Chinese-LLaMA-Alpaca	17,348
18	mlc-llm	17,053
19	ChatGLM2-6B	15,514
20	LLMs-from-scratch	14,440
21	peft	13,962
22	FastGPT	13,166
23	dalai	13,060

llm

Top 23 llm Open-Source Projects

llm related posts

TimesFM (Time Series Foundation Model) for time-series forecasting

Show HN: Text-to-SQL Focus on Semantics and UI/UX

WrenAI: Make Your Database RAG-Ready. Text-to-SQL Focus on Semantics and UI/UX

DeepSeek-V2 integrated, RAGFlow v0.5.0 is released

Mini-assistant: OpenAI Assistant compatible API at your service locally

Finnally LangChain for C++ World?

FinRAG Datasets and Study

Index