Top 23 Python llama Projects

Chinese-LLaMA-Alpaca

4 17,251 8.8 Python

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Project mention: Chinese-Alpaca-Plus-13B-GPTQ | /r/LocalLLaMA | 2023-05-30

I'd like to share with you today the Chinese-Alpaca-Plus-13B-GPTQ model, which is the GPTQ format quantised 4bit models of Yiming Cui's Chinese-LLaMA-Alpaca 13B for GPU reference.

LLaMA-Factory

2 17,050 9.9 Python

Unify Efficient Fine-Tuning of 100+ LLMs

Project mention: Show HN: GPU Prices on eBay | news.ycombinator.com | 2024-02-23

Depends what model you want to train, and how well you want your computer to keep working while you're doing it.
If you're interested in large language models there's a table of vram requirements for fine-tuning at [1] which says you could do the most basic type of fine-tuning on a 7B parameter model with 8GB VRAM.
You'll find that training takes quite a long time, and as a lot of the GPU power is going on training, your computer's responsiveness will suffer - even basic things like scrolling in your web browser or changing tabs uses the GPU, after all.
Spend a bit more and you'll probably have a better time.
[1] https://github.com/hiyouga/LLaMA-Factory?tab=readme-ov-file#...

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
LLaVA

20 16,101 9.4 Python

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Project mention: Show HN: I Remade the Fake Google Gemini Demo, Except Using GPT-4 and It's Real | news.ycombinator.com | 2023-12-10

Update: For anyone else facing the commercial use question on LLaVA - it is licensed under Apache 2.0. Can be used commercially with attribution: https://github.com/haotian-liu/LLaVA/blob/main/LICENSE

petals

98 8,661 8.5 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Mistral Large | news.ycombinator.com | 2024-02-26

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/

shell_gpt

38 8,262 7.9 Python

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Project mention: Oh My Zsh | news.ycombinator.com | 2024-01-22

https://github.com/TheR1D/shell_gpt?tab=readme-ov-file#shell...

GPTCache

43 6,406 8.7 Python

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Project mention: Ask HN: What are the drawbacks of caching LLM responses? | news.ycombinator.com | 2024-03-15

Just found this: https://github.com/zilliztech/GPTCache which seems to address this idea/issue.

Baichuan-7B

1 5,633 7.6 Python

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Project mention: Baichuan 7B reaches top of LLM leaderboard for it's size (New foundation model 4K tokens) | /r/LocalLLaMA | 2023-06-17

GitHub: baichuan-inc/baichuan-7B: A large-scale 7B pretraining language model developed by BaiChuan-Inc. (github.com)

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
Huatuo-Llama-Med-Chinese

1 4,252 6.9 Python

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调

Project mention: Local medical LLM | /r/LocalLLaMA | 2023-06-09

Huatuo-Llama-Med-Chinese https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese

mergekit

6 3,326 9.1 Python

Tools for merging pretrained large language models.

Project mention: Language Models Are Super Mario: Absorbing Abilities from Homologous Models | news.ycombinator.com | 2024-04-06

For others like me who’d not heard of merging before, this seems to be one tool[0] (there may be others)
[0] https://github.com/arcee-ai/mergekit

InternGPT

5 3,121 8.8 Python

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Project mention: How do I use the programs on Github? | /r/github | 2023-06-16

You can also create an issue and ask the developers for help.

xTuring

31 2,515 8.4 Python

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Project mention: I'm developing an open-source AI tool called xTuring, enabling anyone to construct a Language Model with just 5 lines of code. I'd love to hear your thoughts! | /r/machinelearningnews | 2023-09-07

Explore the project on GitHub here.

Video-LLaMA

8 2,396 8.4 Python

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Project mention: Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding | /r/aipromptprogramming | 2023-06-19

EasyLM

8 2,228 7.7 Python

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

api-for-open-llm

1 1,952 9.5 Python

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

Project mention: FLaNK Stack Weekly for 14 Aug 2023 | dev.to | 2023-08-14

mPLUG-Owl

2 1,917 8.0 Python

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Project mention: Unleash the Power of Video-LLaMA: Revolutionizing Language Models with Video and Audio Understanding! | dev.to | 2023-06-12

We extend our deepest gratitude to the extraordinary projects that have influenced and contributed to the development of Video-LLaMA. We're indebted to MiniGPT-4, FastChat, BLIP-2, EVA-CLIP, ImageBind, LLaMA, VideoChat, LLaVA, WebVid, and mPLUG-Owl for their invaluable contributions. Special thanks to Midjourney for creating the stunning Video-LLaMA logo, encapsulating the essence of our groundbreaking project.

lightllm

1 1,795 9.3 Python

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Project mention: FLaNK Weekly 31 December 2023 | dev.to | 2023-12-31

Multimodal-GPT

4 1,401 5.4 Python

Multimodal-GPT

Project mention: Meet MultiModal-GPT: A Vision and Language Model for Multi-Round Dialogue with Humans | /r/machinelearningnews | 2023-05-19

safe-rlhf

1 1,149 8.3 Python

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Project mention: [R] Meet Beaver-7B: a Constrained Value-Aligned LLM via Safe RLHF Technique | /r/MachineLearning | 2023-05-16

LLMCompiler

2 1,056 7.6 Python

LLMCompiler: An LLM Compiler for Parallel Function Calling

Project mention: FLaNK Weekly 18 Dec 2023 | dev.to | 2023-12-18

lag-llama

2 942 7.8 Python

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Project mention: Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting | news.ycombinator.com | 2024-02-26

GenossGPT

1 727 8.7 Python

One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) 🌈🐂 Replace OpenAI GPT with any LLMs in your app with one line.

Project mention: Drop-in replacement for the OpenAI API based on open source LLMs | news.ycombinator.com | 2024-01-17

kani

1 525 9.3 Python

kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023) (by zhudotexe)

Project mention: Kani: A Lightweight, Highly Hackable Framework for Building LMs with Tool Usage | news.ycombinator.com | 2023-09-12

Cornucopia-LLaMA-Fin-Chinese

19 521 4.4 Python

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

Project mention: Cornucopia-LLaMA-Fin-Chinese: NEW Textual - star count:263.0 | /r/algoprojects | 2023-07-31

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llama related posts

Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
3 projects | news.ycombinator.com | 21 Apr 2024
Mixture-of-Depths: Dynamically allocating compute in transformers
3 projects | news.ycombinator.com | 8 Apr 2024
Language Models Are Super Mario: Absorbing Abilities from Homologous Models
1 project | news.ycombinator.com | 6 Apr 2024
AI21 Labs Unveils Jamba: The First Production-Grade Mamba-Based AI Model
3 projects | news.ycombinator.com | 28 Mar 2024
Show HN: GPU Prices on eBay
1 project | news.ycombinator.com | 23 Feb 2024
Tools for merging pretrained large language models
1 project | news.ycombinator.com | 23 Jan 2024
Stable Code 3B: Coding on the Edge
7 projects | news.ycombinator.com | 16 Jan 2024
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source llama projects in Python? This list will help you:

	Project	Stars
1	Chinese-LLaMA-Alpaca	17,251
2	LLaMA-Factory	17,050
3	LLaVA	16,101
4	petals	8,661
5	shell_gpt	8,262
6	GPTCache	6,406
7	Baichuan-7B	5,633
8	Huatuo-Llama-Med-Chinese	4,252
9	mergekit	3,326
10	InternGPT	3,121
11	xTuring	2,515
12	Video-LLaMA	2,396
13	EasyLM	2,228
14	api-for-open-llm	1,952
15	mPLUG-Owl	1,917
16	lightllm	1,795
17	Multimodal-GPT	1,401
18	safe-rlhf	1,149
19	LLMCompiler	1,056
20	lag-llama	942
21	GenossGPT	727
22	kani	525
23	Cornucopia-LLaMA-Fin-Chinese	521