Top 23 Python llama2 Projects

open-interpreter

24 47,712 9.9 Python

A natural language interface for computers

Project mention: OpenInterpreter – Natural language interface to your computer | news.ycombinator.com | 2024-04-23

LLaVA

20 16,101 9.4 Python

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Project mention: Show HN: I Remade the Fake Google Gemini Demo, Except Using GPT-4 and It's Real | news.ycombinator.com | 2023-12-10

Update: For anyone else facing the commercial use question on LLaVA - it is licensed under Apache 2.0. Can be used commercially with attribution: https://github.com/haotian-liu/LLaVA/blob/main/LICENSE

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
h2ogpt

28 10,398 10.0 Python

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Project mention: Ask HN: How do I train a custom LLM/ChatGPT on my own documents in Dec 2023? | news.ycombinator.com | 2023-12-24

As others have said you want RAG.
The most feature complete implementation I've seen is h2ogpt[0] (not affiliated).
The code is kind of a mess (most of the logic is in an ~8000 line python file) but it supports ingestion of everything from YouTube videos to docx, pdf, etc - either offline or from the web interface. It uses langchain and a ton of additional open source libraries under the hood. It can run directly on Linux, via docker, or with one-click installers for Mac and Windows.
It has various model hosting implementations built in - transformers, exllama, llama.cpp as well as support for model serving frameworks like vLLM, HF TGI, etc or just OpenAI.
You can also define your preferred embedding model along with various other parameters but I've found the out of box defaults to be pretty sane and usable.
[0] - https://github.com/h2oai/h2ogpt

petals

98 8,661 8.5 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Mistral Large | news.ycombinator.com | 2024-02-26

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/

Baichuan2

1 3,929 7.3 Python

A series of large language models developed by Baichuan Intelligent Technology

Project mention: Baichuan 2 | news.ycombinator.com | 2023-10-12

opencompass

1 2,481 9.7 Python

OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Project mention: Show HN: Times faster LLM evaluation with Bayesian optimization | news.ycombinator.com | 2024-02-13

Fair question.
Evaluate refers to the phase after training to check if the training is good.
Usually the flow goes training -> evaluation -> deployment (what you called inference). This project is aimed for evaluation. Evaluation can be slow (might even be slower than training if you're finetuning on a small domain specific subset)!
So there are [quite](https://github.com/microsoft/promptbench) [a](https://github.com/confident-ai/deepeval) [few](https://github.com/openai/evals) [frameworks](https://github.com/EleutherAI/lm-evaluation-harness) working on evaluation, however, all of them are quite slow, because LLM are slow if you don't have infinite money. [This](https://github.com/open-compass/opencompass) one tries to speed up by parallelizing on multiple computers, but none of them takes advantage of the fact that many evaluation queries might be similar and all try to evaluate on all given queries. And that's where this project might come in handy.

api-for-open-llm

1 1,952 9.5 Python

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

Project mention: FLaNK Stack Weekly for 14 Aug 2023 | dev.to | 2023-08-14

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
SolidGPT

9 1,948 8.8 Python

Developer AI Persona Search Agent

Project mention: Best coding AI to use with entire codebase | /r/ChatGPTCoding | 2023-12-10

Another thing to try is one of the repositories like SolidGPT: https://github.com/AI-Citizen/SolidGPT

DemoGPT

31 1,566 9.2 Python

Create 🦜️🔗 LangChain apps by just using prompts🌟 Star to support our work! | 只需使用句子即可创建 LangChain 应用程序。给个star支持我们的工作吧！

Project mention: Llama 2 Code Interpreter | news.ycombinator.com | 2023-07-23

LLMCompiler

2 1,056 7.6 Python

LLMCompiler: An LLM Compiler for Parallel Function Calling

Project mention: FLaNK Weekly 18 Dec 2023 | dev.to | 2023-12-18

autollm

1 908 9.0 Python

Ship RAG based LLM web apps in seconds.

Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06

LLM-Finetuning-Toolkit

1 659 9.6 Python

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07

code-llama-for-vscode

5 506 4.6 Python

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

Project mention: Stable Code 3B: Coding on the Edge | news.ycombinator.com | 2024-01-16

How are people using codellama and this in their workflows?
I found one option: https://github.com/xNul/code-llama-for-vscode
But I'm guessing there are others, and they might differ in how they provide context to the model.

Finetune_LLMs

2 438 8.5 Python

Repo for fine-tuning Casual LLMs
slowllama

4 413 8.3 Python

Finetune llama2-70b and codellama on MacBook Air without quantization

Project mention: AI — weekly megathread! | /r/artificial | 2023-10-15

slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization [Link].

hqq

7 409 9.3 Python

Official implementation of Half-Quadratic Quantization (HQQ)

Project mention: Half-Quadratic Quantization of Large Machine Learning Models | news.ycombinator.com | 2024-03-14

Owl

2 406 9.7 Python

A personal wearable AI that runs locally (by OwlAIProject)

Project mention: Limitless: Personalized AI powered by what you've seen, said, and heard | news.ycombinator.com | 2024-04-15

xllm

2 348 8.5 Python

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

Project mention: X–LLM: Cutting Edge and Easy LLM Finetuning | news.ycombinator.com | 2023-11-16

AnglE

12 341 9.3 Python

Angle-optimized Text Embeddings | 🔥 SOTA on STS and MTEB Leaderboard (by SeanLee97)

Project mention: FLaNK Stack Weekly 22 January 2024 | dev.to | 2024-01-22

chat.petals.dev

8 296 7.5 Python

💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client

Project mention: Make no mistake—AI is owned by Big Tech | /r/transhumanism | 2023-12-07

ETA: https://chat.petals.dev

zeta

1 242 9.8 Python

Build high-performance AI models with modular building blocks (by kyegomez)

Project mention: Zetascale, Build high-performance AI models with modular building blocks | news.ycombinator.com | 2024-02-09

simulatrex-engine

3 208 7.4 Python

Enable decision-making based on simulations

Project mention: Simulatrex, an open-source Large Language Model based simulation framework | news.ycombinator.com | 2024-02-17

GoLLIE

1 204 9.6 Python

Guideline following Large Language Model for Information Extraction

Project mention: A LLM trained to follow annotation guidelines, for information extraction tasks | news.ycombinator.com | 2023-10-30

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python llama2 related posts

Limitless: Personalized AI powered by what you've seen, said, and heard
1 project | news.ycombinator.com | 15 Apr 2024
Do you Know! Llama ?
1 project | dev.to | 11 Apr 2024
Ask HN: What is the current (Apr. 2024) gold standard of running an LLM locally?
11 projects | news.ycombinator.com | 1 Apr 2024
Half-Quadratic Quantization of Large Machine Learning Models
1 project | news.ycombinator.com | 14 Mar 2024
Simulatrex, an open-source Large Language Model based simulation framework
1 project | news.ycombinator.com | 17 Feb 2024
A list of system prompts used for biomedical RAG (KG-RAG) using LLM
1 project | news.ycombinator.com | 10 Jan 2024
Ask HN: Do you know any new llama2.c implementations not mentioned in the repo
2 projects | news.ycombinator.com | 10 Jan 2024
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source llama2 projects in Python? This list will help you:

	Project	Stars
1	open-interpreter	47,712
2	LLaVA	16,101
3	h2ogpt	10,398
4	petals	8,661
5	Baichuan2	3,929
6	opencompass	2,481
7	api-for-open-llm	1,952
8	SolidGPT	1,948
9	DemoGPT	1,566
10	LLMCompiler	1,056
11	autollm	908
12	LLM-Finetuning-Toolkit	659
13	code-llama-for-vscode	506
14	Finetune_LLMs	438
15	slowllama	413
16	hqq	409
17	Owl	406
18	xllm	348
19	AnglE	341
20	chat.petals.dev	296
21	zeta	242
22	simulatrex-engine	208
23	GoLLIE	204