Top 20 language-model Open-Source Projects

transformers

173 124,557 10.0 Python

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Project mention: AI enthusiasm #6 - Finetune any LLM you want💡 | dev.to | 2024-04-16

Most of this tutorial is based on Hugging Face course about Transformers and on Niels Rogge's Transformers tutorials: make sure to check their work and give them a star on GitHub, if you please ❤️

petals

98 8,631 8.5 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Mistral Large | news.ycombinator.com | 2024-02-26

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
argos-translate

30 3,208 7.6 Python

Open-source offline translation library written in Python

Project mention: Fast and secure translation on your local machine with a GUI | news.ycombinator.com | 2024-04-13

Interestingly, I think this is actually related to the offline translation features built into Firefox. Both are products of "Project Bergamot", but the Mozilla-maintained version was later merged into the Firefox application:
https://browser.mt/
https://blog.mozilla.org/en/mozilla/local-translation-add-on...
https://hacks.mozilla.org/2022/06/training-efficient-neural-...
https://github.com/mozilla/firefox-translations
https://firefox-source-docs.mozilla.org/toolkit/components/t...
Extra webpage with screenshot and links, impossible to search for normally:
https://translatelocally.com/downloads/
Does one thing and does it well.
Oh— For downloading models, it's much easier to pipe/`xargs` `translateLocally --available-models` into `translateLocally -d` than go through the GUI.
---
Other self-hostable translation tools:
https://www.apertium.org/index.eng.html
- Traditional rule-based translation. Seems to work pretty well, but no good desktop frontend.
https://www.argosopentech.com/
- Works, but crashy desktop app.
https://libretranslate.com/
- API wrapping Argos Translate.
https://lingva.thedaviddelta.com/
- Google Translate scraper/privacy frontend.
https://euroglot.com/
- Proprietary, subscription trialware.

ecco

6 1,899 3.6 Jupyter Notebook

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
FARM

3 1,723 0.0 Python

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Awesome-LLM-Reasoning

1 1,062 7.3

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

Project mention: Techbro says that GPT models will soon have over 9000 IQ in ~5 years | /r/SneerClub | 2023-05-04

Get-Things-Done-with-Prompt-Engineering-and-LangChain

18 922 8.2 Jupyter Notebook

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

Project mention: Get-Things-Done-with-Prompt-Engineering-and-LangChain: NEW Data - star count:617.0 | /r/algoprojects | 2023-12-10

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
tango

2 901 8.7 Python

A family of diffusion models for text-to-audio generation. (by declare-lab)

Project mention: [Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model | /r/MachineLearning | 2023-05-04

Found relevant code at https://github.com/declare-lab/tango + all code implementations here

happy-transformer

1 497 9.0 Python

Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
xmtf

2 493 5.9 Jupyter Notebook

Crosslingual Generalization through Multitask Finetuning
ontogpt

2 493 9.8 Jupyter Notebook

LLM-based ontological extraction tools, including SPIRES

Project mention: GPT-based ontological extraction tools, including SPIRES | news.ycombinator.com | 2023-07-24

adaptnlp

2 414 0.0 Jupyter Notebook

An easy to use Natural Language Processing library and framework for predicting, training, fine-tuning, and serving up state-of-the-art NLP models.
agency

3 374 8.2 Go

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach. (by neurocult)

Project mention: Agency: Pure Go LangChain Alternative | news.ycombinator.com | 2023-11-27

I would, at the very least, wrap the errors being returned inside the process function https://github.com/neurocult/agency/blob/14b14e50a7570189388...
Or, I suppose the user must handle exception behavior in their custom `OperationHandler`

chat.petals.dev

8 296 7.5 Python

💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client

Project mention: Make no mistake—AI is owned by Big Tech | /r/transhumanism | 2023-12-07

ETA: https://chat.petals.dev

extreme-bert

2 283 0.0 Python

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
datablations

6 282 6.9 Jupyter Notebook

Scaling Data-Constrained Language Models

Project mention: Gemini is only 1x Chinchilla, so it undertrained for production | /r/singularity | 2023-12-07

1x chinchilla means it's not really undertrained but that more could be squeezed without excessive difficulty https://arxiv.org/abs/2305.16264

voice-assistant-whisper-chatgpt

2 219 1.3 Jupyter Notebook

This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.
dsir

1 185 7.9 Python

DSIR large-scale data selection framework for language model training
AREkit

3 52 8.9 Python

Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML
code-representations-ml-brain

1 6 10.0 Python

[NeurIPS 2022] "Convergent Representations of Computer Programs in Human and Artificial Neural Networks" by Shashank Srikant*, Benjamin Lipkin*, Anna A. Ivanova, Evelina Fedorenko, Una-May O'Reilly.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-16.

language-models related posts

Mistral Large
4 projects | news.ycombinator.com | 26 Feb 2024
Gemini is only 1x Chinchilla, so it undertrained for production
1 project | /r/singularity | 7 Dec 2023
Can LLMs learn from a single example?
2 projects | news.ycombinator.com | 5 Sep 2023
Chinchilla’s Death
2 projects | news.ycombinator.com | 4 Sep 2023
GPT-based ontological extraction tools, including SPIRES
1 project | news.ycombinator.com | 24 Jul 2023
RWKV Pile+ seems to be training on far more tokens than any LLM ever has
1 project | /r/LocalLLaMA | 16 Jun 2023
[Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model
2 projects | /r/MachineLearning | 4 May 2023
A note from our sponsor - WorkOS
workos.com | 19 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source language-model projects? This list will help you:

	Project	Stars
1	transformers	124,557
2	petals	8,631
3	argos-translate	3,208
4	ecco	1,899
5	FARM	1,723
6	Awesome-LLM-Reasoning	1,062
7	Get-Things-Done-with-Prompt-Engineering-and-LangChain	922
8	tango	901
9	happy-transformer	497
10	xmtf	493
11	ontogpt	493
12	adaptnlp	414
13	agency	374
14	chat.petals.dev	296
15	extreme-bert	283
16	datablations	282
17	voice-assistant-whisper-chatgpt	219
18	dsir	185
19	AREkit	52
20	code-representations-ml-brain	6