Top 23 Python Whisper Projects

PaddleSpeech

6 10,120 7.6 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

PaddlePaddle/PaddleSpeech

buzz

21 9,869 8.5 Python

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Project mention: Buzz: Transcribe and translate audio offline on your personal computer | news.ycombinator.com | 2024-03-21

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
whisperX

24 8,965 8.4 Python

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06

It uses this, which does support diarization: https://github.com/m-bain/whisperX

faster-whisper

22 8,723 8.3 Python

Faster Whisper transcription with CTranslate2

Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

For our real-time STT needs, we'll employ a fantastic library called faster-whisper.

distil-whisper

9 3,125 8.5 Python

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05

FunASR

2 3,299 9.9 Python

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. ｜语音识别工具包，包含丰富的性能优越的开源预训练模型，支持语音识别、语音端点检测、文本后处理等，具备服务部署能力。

Project mention: FunASR: Fundamental End-to-End Speech Recognition Toolkit | news.ycombinator.com | 2024-01-13

chatgpt-telegram-bot

3 2,686 8.8 Python

🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python (by n3d1117)

Project mention: Are you selfhosting a ChatGPT alternative? | /r/selfhosted | 2023-06-09

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
inference

2 2,512 9.7 Python

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24

Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.

whisper-timestamped

2 1,501 8.3 Python

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28

Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]
[0] https://github.com/linto-ai/whisper-timestamped

yt-whisper

3 1,313 0.0 Python

Using OpenAI's Whisper to automatically generate YouTube subtitles
auto-subtitle

4 1,173 2.1 Python

Automatically generate and overlay subtitles for any video.
WhisperLive

4 1,180 9.4 Python

A nearly-live implementation of OpenAI's Whisper.

Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive

subsai

3 1,051 8.1 Python

🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

Project mention: Porting CP/M to the Brother SuperPowerNote Z80 laptop thing [video] | news.ycombinator.com | 2023-12-13

Adding Whisper subtitles was really easy and they're dramatically better than the automatic Google ones (I did it via https://github.com/abdeladim-s/subsai, which was really easy to use). So there is now a reasonably good transcript available in the video comments.

whisper.api

4 838 8.0 Python

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

Project mention: Do you know any quality FastAPI starter projects? | /r/flask | 2023-10-10

truss

3 833 9.6 Python

The simplest way to serve AI/ML models in production (by basetenlabs)
whisper-standalone-win

3 757 8.6 Python

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

Project mention: Question : is this a movie only tracker? | /r/Karagarga | 2023-07-03

On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win

whisper-playground

7 759 6.6 Python

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
whisper-ctranslate2

3 743 8.5 Python

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12

Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2

Whisper-WebUI

1 670 8.6 Python

A Web UI for easy subtitle using whisper model.
AI-Waifu-Vtuber

1 626 5.6 Python

AI Vtuber for Streaming on Youtube/Twitch
whisper_mic

2 615 7.3 Python

Project that allows one to use a microphone with OpenAI whisper.
agentchain

2 563 5.8 Python

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
pyannote-whisper

2 414 4.4 Python

Project mention: Summarization of long transcriptions | /r/LocalLLaMA | 2023-07-18

These will be 3-5 hour recordings of 4-5 people. I plan to use https://github.com/yinruiqing/pyannote-whisper to generate the transcript from the recording.

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Whisper related posts

Easy video transcription and subtitling with Whisper, FFmpeg, and Python
1 project | news.ycombinator.com | 6 Apr 2024
SOTA ASR Tooling: Long-Form Transcription
1 project | news.ycombinator.com | 31 Mar 2024
Deploying whisperX on AWS SageMaker as Asynchronous Endpoint
2 projects | dev.to | 31 Mar 2024
Buzz: Transcribe and translate audio offline on your personal computer
1 project | news.ycombinator.com | 21 Mar 2024
Voxos.ai – An Open-Source Desktop Voice Assistant
7 projects | news.ycombinator.com | 19 Jan 2024
Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram)
7 projects | news.ycombinator.com | 18 Dec 2023
Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
10 projects | news.ycombinator.com | 13 Dec 2023
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source Whisper projects in Python? This list will help you:

	Project	Stars
1	PaddleSpeech	10,120
2	buzz	9,869
3	whisperX	8,965
4	faster-whisper	8,723
5	distil-whisper	3,125
6	FunASR	3,299
7	chatgpt-telegram-bot	2,686
8	inference	2,512
9	whisper-timestamped	1,501
10	yt-whisper	1,313
11	auto-subtitle	1,173
12	WhisperLive	1,180
13	subsai	1,051
14	whisper.api	838
15	truss	833
16	whisper-standalone-win	757
17	whisper-playground	759
18	whisper-ctranslate2	743
19	Whisper-WebUI	670
20	AI-Waifu-Vtuber	626
21	whisper_mic	615
22	agentchain	563
23	pyannote-whisper	414