Top 23 Python speech-to-text Projects

whisperX

24 8,869 8.7 Python

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06

It uses this, which does support diarization: https://github.com/m-bain/whisperX

faster-whisper

22 8,723 8.3 Python

Faster Whisper transcription with CTranslate2

Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

For our real-time STT needs, we'll employ a fantastic library called faster-whisper.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
SpeechRecognition

16 8,040 8.7 Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Project mention: help with script (beginner) | /r/learnpython | 2023-12-07

Start and Stop Listening Example

speechbrain

26 7,869 9.8 Python

A PyTorch-based Speech Toolkit

Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28

pyvideotrans

1 5,556 9.7 Python

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06

lingvo

1 2,780 8.7 Python

Lingvo
kalliope

4 1,696 0.0 Python

Kalliope is a framework that will help you to create your own personal assistant.
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
whisper-asr-webservice

11 1,617 7.7 Python

OpenAI Whisper ASR Webservice API

Project mention: How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro | dev.to | 2023-05-23

whisper-timestamped

2 1,501 8.3 Python

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28

Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]
[0] https://github.com/linto-ai/whisper-timestamped

Dragonfire

2 1,372 0.0 Python

the open-source virtual assistant for Ubuntu based Linux distributions
dc_tts

4 1,150 0.0 Python

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
nonoCAPTCHA

1 897 0.0 Python

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
whisper-standalone-win

3 757 8.6 Python

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

Project mention: Question : is this a movie only tracker? | /r/Karagarga | 2023-07-03

On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win

whisper-playground

7 756 6.6 Python

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
whisper-ctranslate2

3 743 8.5 Python

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12

Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2

AI-Waifu-Vtuber

1 626 5.6 Python

AI Vtuber for Streaming on Youtube/Twitch
whisper_mic

2 615 7.3 Python

Project that allows one to use a microphone with OpenAI whisper.
speech-to-text-benchmark

5 585 3.8 Python

speech to text benchmark framework

Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16

AutoSub

2 552 4.1 Python

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui (by abhirooptalasila)
cheetah

5 552 8.3 Python

On-device streaming speech-to-text engine powered by deep learning (by Picovoice)
leopard

15 405 8.7 Python

On-device speech-to-text engine powered by deep learning
edenai-apis

13 357 9.8 Python

Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28

HackerNoon featured our latest article in the "Future of AI" category
We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.
You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis

kaldi-active-grammar

10 329 0.0 Python

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21

https://github.com/dictation-toolbox/dragonfly
https://github.com/daanzu/kaldi-active-grammar

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python speech-to-text related posts

Easy video transcription and subtitling with Whisper, FFmpeg, and Python
1 project | news.ycombinator.com | 6 Apr 2024
SOTA ASR Tooling: Long-Form Transcription
1 project | news.ycombinator.com | 31 Mar 2024
Deploying whisperX on AWS SageMaker as Asynchronous Endpoint
2 projects | dev.to | 31 Mar 2024
LLMs on your local Computer (Part 1)
7 projects | dev.to | 11 Mar 2024
SpeechBrain 1.0: A free and open-source AI toolkit for all things speech
1 project | news.ycombinator.com | 28 Feb 2024
Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old
1 project | news.ycombinator.com | 28 Feb 2024
Voxos.ai – An Open-Source Desktop Voice Assistant
7 projects | news.ycombinator.com | 19 Jan 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source speech-to-text projects in Python? This list will help you:

	Project	Stars
1	whisperX	8,869
2	faster-whisper	8,723
3	SpeechRecognition	8,040
4	speechbrain	7,869
5	pyvideotrans	5,556
6	lingvo	2,780
7	kalliope	1,696
8	whisper-asr-webservice	1,617
9	whisper-timestamped	1,501
10	Dragonfire	1,372
11	dc_tts	1,150
12	nonoCAPTCHA	897
13	whisper-standalone-win	757
14	whisper-playground	756
15	whisper-ctranslate2	743
16	AI-Waifu-Vtuber	626
17	whisper_mic	615
18	speech-to-text-benchmark	585
19	AutoSub	552
20	cheetah	552
21	leopard	405
22	edenai-apis	357
23	kaldi-active-grammar	329