The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 Python Whisper Projects
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
-
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
-
chatgpt-telegram-bot
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python (by n3d1117)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
-
subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
-
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
-
whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
-
whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
-
whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
-
agentchain
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
PaddlePaddle/PaddleSpeech
Project mention: Buzz: Transcribe and translate audio offline on your personal computer | news.ycombinator.com | 2024-03-21
Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06It uses this, which does support diarization: https://github.com/m-bain/whisperX
For our real-time STT needs, we'll employ a fantastic library called faster-whisper.
Project mention: FunASR: Fundamental End-to-End Speech Recognition Toolkit | news.ycombinator.com | 2024-01-13
Project mention: GreptimeAI + Xinference - Efficient Deployment and Monitoring of Your LLM Applications | dev.to | 2024-01-24Xorbits Inference (Xinference) is an open-source platform to streamline the operation and integration of a wide array of AI models. With Xinference, you’re empowered to run inference using any open-source LLMs, embedding models, and multimodal models either in the cloud or on your own premises, and create robust AI-driven applications. It provides a RESTful API compatible with OpenAI API, Python SDK, CLI, and WebUI. Furthermore, it integrates third-party developer tools like LangChain, LlamaIndex, and Dify, facilitating model integration and development.
Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]
[0] https://github.com/linto-ai/whisper-timestamped
Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
Project mention: Porting CP/M to the Brother SuperPowerNote Z80 laptop thing [video] | news.ycombinator.com | 2023-12-13Adding Whisper subtitles was really easy and they're dramatically better than the automatic Google ones (I did it via https://github.com/abdeladim-s/subsai, which was really easy to use). So there is now a reasonably good transcript available in the video comments.
On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win
Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2
These will be 3-5 hour recordings of 4-5 people. I plan to use https://github.com/yinruiqing/pyannote-whisper to generate the transcript from the recording.
Python Whisper related posts
- Easy video transcription and subtitling with Whisper, FFmpeg, and Python
- SOTA ASR Tooling: Long-Form Transcription
- Deploying whisperX on AWS SageMaker as Asynchronous Endpoint
- Buzz: Transcribe and translate audio offline on your personal computer
- Voxos.ai – An Open-Source Desktop Voice Assistant
- Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram)
- Whisper: Nvidia RTX 4090 vs. M1 Pro with MLX
-
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024
Index
What are some of the best open-source Whisper projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleSpeech | 10,120 |
2 | buzz | 9,869 |
3 | whisperX | 8,965 |
4 | faster-whisper | 8,723 |
5 | distil-whisper | 3,125 |
6 | FunASR | 3,299 |
7 | chatgpt-telegram-bot | 2,686 |
8 | inference | 2,512 |
9 | whisper-timestamped | 1,501 |
10 | yt-whisper | 1,313 |
11 | auto-subtitle | 1,173 |
12 | WhisperLive | 1,180 |
13 | subsai | 1,051 |
14 | whisper.api | 838 |
15 | truss | 833 |
16 | whisper-standalone-win | 757 |
17 | whisper-playground | 759 |
18 | whisper-ctranslate2 | 743 |
19 | Whisper-WebUI | 670 |
20 | AI-Waifu-Vtuber | 626 |
21 | whisper_mic | 615 |
22 | agentchain | 563 |
23 | pyannote-whisper | 414 |
Sponsored