Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python speech-to-text Projects
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
SpeechRecognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
-
whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
-
whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
-
whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
-
AutoSub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui (by abhirooptalasila)
-
edenai-apis
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
-
kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06It uses this, which does support diarization: https://github.com/m-bain/whisperX
For our real-time STT needs, we'll employ a fantastic library called faster-whisper.
Start and Stop Listening Example
Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
Project mention: How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro | dev.to | 2023-05-23
Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]
[0] https://github.com/linto-ai/whisper-timestamped
On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win
Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.
https://github.com/Softcatala/whisper-ctranslate2
Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28HackerNoon featured our latest article in the "Future of AI" category
We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.
You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis
Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21https://github.com/dictation-toolbox/dragonfly
https://github.com/daanzu/kaldi-active-grammar
Python speech-to-text related posts
- Easy video transcription and subtitling with Whisper, FFmpeg, and Python
- SOTA ASR Tooling: Long-Form Transcription
- Deploying whisperX on AWS SageMaker as Asynchronous Endpoint
- LLMs on your local Computer (Part 1)
- SpeechBrain 1.0: A free and open-source AI toolkit for all things speech
- Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old
- Voxos.ai – An Open-Source Desktop Voice Assistant
-
A note from our sponsor - InfluxDB
www.influxdata.com | 25 Apr 2024
Index
What are some of the best open-source speech-to-text projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | whisperX | 8,869 |
2 | faster-whisper | 8,723 |
3 | SpeechRecognition | 8,040 |
4 | speechbrain | 7,869 |
5 | pyvideotrans | 5,556 |
6 | lingvo | 2,780 |
7 | kalliope | 1,696 |
8 | whisper-asr-webservice | 1,617 |
9 | whisper-timestamped | 1,501 |
10 | Dragonfire | 1,372 |
11 | dc_tts | 1,150 |
12 | nonoCAPTCHA | 897 |
13 | whisper-standalone-win | 757 |
14 | whisper-playground | 756 |
15 | whisper-ctranslate2 | 743 |
16 | AI-Waifu-Vtuber | 626 |
17 | whisper_mic | 615 |
18 | speech-to-text-benchmark | 585 |
19 | AutoSub | 552 |
20 | cheetah | 552 |
21 | leopard | 405 |
22 | edenai-apis | 357 |
23 | kaldi-active-grammar | 329 |
Sponsored