Top 23 Python Transformer Projects

transformers

173 124,557 10.0 Python

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Project mention: AI enthusiasm #6 - Finetune any LLM you want💡 | dev.to | 2024-04-16

Most of this tutorial is based on Hugging Face course about Transformers and on Niels Rogge's Transformers tutorials: make sure to check their work and give them a star on GitHub, if you please ❤️

mmdetection

23 27,658 8.7 Python

OpenMMLab Detection Toolbox and Benchmark
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
vllm

30 17,656 9.9 Python

A high-throughput and memory-efficient inference and serving engine for LLMs

Project mention: Mistral AI Launches New 8x22B Moe Model | news.ycombinator.com | 2024-04-09

The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

best-of-ml-python

16 15,302 7.9 Python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
RWKV-LM

84 11,579 8.8 Python

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Project mention: Do LLMs need a context window? | news.ycombinator.com | 2023-12-25

https://github.com/BlinkDL/RWKV-LM#rwkv-discord-httpsdiscord... lists a number of implementations of various versions of RWKV.
https://github.com/BlinkDL/RWKV-LM#rwkv-parallelizable-rnn-w... :
> RWKV: Parallelizable RNN with Transformer-level LLM Performance (pronounced as "RwaKuv", from 4 major params: R W K V)
> RWKV is an RNN with Transformer-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). And it's 100% attention-free. You only need the hidden state at position t to compute the state at position t+1. You can use the "GPT" mode to quickly compute the hidden state for the "RNN" mode.
> So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding (using the final hidden state).
> "Our latest version is RWKV-6,*

LaTeX-OCR

21 10,711 3.6 Python

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14

PaddleSpeech

6 10,069 7.6 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

PaddlePaddle/PaddleSpeech

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
petals

98 8,631 8.5 Python

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Project mention: Mistral Large | news.ycombinator.com | 2024-02-26

So how long until we can do an open source Mistral Large?
We could make a start on Petals or some other open source distributed training network cluster possibly?
[0] https://petals.dev/

faster-whisper

22 8,578 8.3 Python

Faster Whisper transcription with CTranslate2

Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

For our real-time STT needs, we'll employ a fantastic library called faster-whisper.

PaddleSeg

17 8,227 7.9 Python

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
LMFlow

10 7,975 9.5 Python

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Project mention: Your weekly machine learning digest | /r/learnmachinelearning | 2023-07-03

trax

6 7,948 4.7 Python

Trax — Deep Learning with Clear Code and Speed

Project mention: Replit's new Code LLM was trained in 1 week | news.ycombinator.com | 2023-05-03

and the implementation https://github.com/google/trax/blob/master/trax/models/resea... if you are interested.
Hope you get to look into this!

text-generation-inference

28 7,722 9.6 Python

Large Language Model Text Generation Inference

Project mention: Zephyr 141B, a Mixtral 8x22B fine-tune, is now available in Hugging Chat | news.ycombinator.com | 2024-04-12

I wanted to write that TGI inference engine is not Open Source anymore, but they have reverted the license back to Apache 2.0 for the new version TGI v2.0: https://github.com/huggingface/text-generation-inference/rel...
Good news!

jukebox

129 7,554 0.0 Python

Code for the paper "Jukebox: A Generative Model for Music"

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

openai/jukebox: Music Generation

mmsegmentation

7 7,342 8.6 Python

OpenMMLab Semantic Segmentation Toolbox and Benchmark.
GPT2-Chinese

2 7,342 2.8 Python

Chinese version of GPT2 training code, using BERT tokenizer.
bertviz

15 6,356 3.9 Python

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Project mention: StreamingLLM: tiny tweak to KV LRU improves long conversations | news.ycombinator.com | 2024-02-13

This seems only to work cause large GPTs have redundant, undercomplex attentions. See this issue in BertViz about attention in Llama: https://github.com/jessevig/bertviz/issues/128

BERT-pytorch

1 5,979 0.0 Python

Google AI 2018 BERT pytorch implementation
Informer2020

2 4,890 0.6 Python

The GitHub repository for the paper "Informer" accepted by AAAI 2021.
lm-evaluation-harness

34 4,848 9.9 Python

A framework for few-shot evaluation of language models.

Project mention: Mistral AI Launches New 8x22B Moe Model | news.ycombinator.com | 2024-04-09

The easiest is to use vllm (https://github.com/vllm-project/vllm) to run it on a Couple of A100's, and you can benchmark this using this library (https://github.com/EleutherAI/lm-evaluation-harness)

OpenPrompt

1 4,141 4.4 Python

An Open-Source Framework for Prompt-Learning.
manga-image-translator

12 4,127 9.4 Python

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06

OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.

SwinIR

27 4,060 0.0 Python

SwinIR: Image Restoration Using Swin Transformer (official repository)
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-16.

Python Transformer related posts

Mistral AI Launches New 8x22B Moe Model
4 projects | news.ycombinator.com | 9 Apr 2024
LLMs on your local Computer (Part 1)
7 projects | dev.to | 11 Mar 2024
Voxos.ai – An Open-Source Desktop Voice Assistant
7 projects | news.ycombinator.com | 19 Jan 2024
RAG Using Structured Data: Overview and Important Questions
5 projects | news.ycombinator.com | 10 Jan 2024
I made an Educational Transformer from scratch
1 project | /r/pytorch | 10 Dec 2023
How can I make a better tokenizer?
1 project | /r/MLQuestions | 7 Dec 2023
Detexify LaTeX Handwriting Symbol Recognition
5 projects | news.ycombinator.com | 14 Nov 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 19 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Transformer projects in Python? This list will help you:

	Project	Stars
1	transformers	124,557
2	mmdetection	27,658
3	vllm	17,656
4	best-of-ml-python	15,302
5	RWKV-LM	11,579
6	LaTeX-OCR	10,711
7	PaddleSpeech	10,069
8	petals	8,631
9	faster-whisper	8,578
10	PaddleSeg	8,227
11	LMFlow	7,975
12	trax	7,948
13	text-generation-inference	7,722
14	jukebox	7,554
15	mmsegmentation	7,342
16	GPT2-Chinese	7,342
17	bertviz	6,356
18	BERT-pytorch	5,979
19	Informer2020	4,890
20	lm-evaluation-harness	4,848
21	OpenPrompt	4,141
22	manga-image-translator	4,127
23	SwinIR	4,060