Python speech-to-text

Open-source Python projects categorized as speech-to-text

Top 23 Python speech-to-text Projects

  • whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • Project mention: Easy video transcription and subtitling with Whisper, FFmpeg, and Python | news.ycombinator.com | 2024-04-06

    It uses this, which does support diarization: https://github.com/m-bain/whisperX

  • faster-whisper

    Faster Whisper transcription with CTranslate2

  • Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

    For our real-time STT needs, we'll employ a fantastic library called faster-whisper.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • SpeechRecognition

    Speech recognition module for Python, supporting several engines and APIs, online and offline.

  • Project mention: help with script (beginner) | /r/learnpython | 2023-12-07

    Start and Stop Listening Example

  • speechbrain

    A PyTorch-based Speech Toolkit

  • Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
  • pyvideotrans

    Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

  • Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06
  • lingvo

    Lingvo

  • kalliope

    Kalliope is a framework that will help you to create your own personal assistant.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • whisper-asr-webservice

    OpenAI Whisper ASR Webservice API

  • Project mention: How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro | dev.to | 2023-05-23
  • whisper-timestamped

    Multilingual Automatic Speech Recognition with word-level timestamps and confidence

  • Project mention: Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old | news.ycombinator.com | 2024-02-28

    Yes. But Whisper's word-level timings are actually quite inaccurate out of the box. There are some Python libraries that mitigate that. I tested several of them. whisper-timestamped seems to be the best one. [0]

    [0] https://github.com/linto-ai/whisper-timestamped

  • Dragonfire

    the open-source virtual assistant for Ubuntu based Linux distributions

  • dc_tts

    A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

  • nonoCAPTCHA

    An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

  • whisper-standalone-win

    Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

  • Project mention: Question : is this a movie only tracker? | /r/Karagarga | 2023-07-03

    On the other hand, if you need subtitles for a movie that doesn't have some. There are some automated solutions like Whisper that can do a very decent job in most cases : https://github.com/Purfview/whisper-standalone-win

  • whisper-playground

    Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

  • whisper-ctranslate2

    Whisper command line client compatible with original OpenAI client based on CTranslate2.

  • Project mention: Firefox slow to load YouTube? Just another front in Google's war on ad blockers | news.ycombinator.com | 2023-12-12

    Much better, actually. Try the large-v3 model, it's great. I use it via whisper-ctranslate2 which is a faster implementation.

    https://github.com/Softcatala/whisper-ctranslate2

  • AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

  • whisper_mic

    Project that allows one to use a microphone with OpenAI whisper.

  • speech-to-text-benchmark

    speech to text benchmark framework

  • Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16
  • AutoSub

    A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui (by abhirooptalasila)

  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  • leopard

    On-device speech-to-text engine powered by deep learning

  • edenai-apis

    Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

  • Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28

    HackerNoon featured our latest article in the "Future of AI" category

    We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.

    You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis

  • kaldi-active-grammar

    Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

  • Project mention: Ask HN: How do you get started with adding voice commands to a computer system? | news.ycombinator.com | 2023-11-21

    https://github.com/dictation-toolbox/dragonfly

    https://github.com/daanzu/kaldi-active-grammar

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python speech-to-text related posts

Index

What are some of the best open-source speech-to-text projects in Python? This list will help you:

Project Stars
1 whisperX 8,869
2 faster-whisper 8,723
3 SpeechRecognition 8,040
4 speechbrain 7,869
5 pyvideotrans 5,556
6 lingvo 2,780
7 kalliope 1,696
8 whisper-asr-webservice 1,617
9 whisper-timestamped 1,501
10 Dragonfire 1,372
11 dc_tts 1,150
12 nonoCAPTCHA 897
13 whisper-standalone-win 757
14 whisper-playground 756
15 whisper-ctranslate2 743
16 AI-Waifu-Vtuber 626
17 whisper_mic 615
18 speech-to-text-benchmark 585
19 AutoSub 552
20 cheetah 552
21 leopard 405
22 edenai-apis 357
23 kaldi-active-grammar 329

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com