Python voice-recognition

Open-source Python projects categorized as voice-recognition

Top 22 Python voice-recognition Projects

  • PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    PaddlePaddle/PaddleSpeech

  • speechbrain

    A PyTorch-based Speech Toolkit

  • Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • silero-vad

    Silero VAD: pre-trained enterprise-grade Voice Activity Detector

  • Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06

    >How do you detect speech starting and stopping?

    https://github.com/snakers4/silero-vad

  • WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

  • Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

    Everything runs locally, we use:

    - WhisperLive for the transcription - https://github.com/collabora/WhisperLive

  • Python-ai-assistant

    Python AI assistant 🧠

  • Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18

    There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant

    Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.

  • mycroft-precise

    A lightweight, simple-to-use, RNN wake word listener

  • rhino

    On-device Speech-to-Intent engine powered by deep learning (by Picovoice)

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • speech-to-text-benchmark

    speech to text benchmark framework

  • Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16
  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  • picovoice

    On-device voice assistant platform powered by deep learning

  • leopard

    On-device speech-to-text engine powered by deep learning

  • Caster

    Dragonfly-Based Voice Programming and Accessibility Toolkit

  • gpt-voice-conversation-chatbot

    Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • chatgpt-voice-assistant

    A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.

  • Project mention: ChatGPT Voice Assistant | news.ycombinator.com | 2023-06-13
  • M.I.L.E.S

    M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.

  • Project mention: Show HN: I made M.I.L.E.S, the worlds best voice assistant | news.ycombinator.com | 2024-01-06
  • gpt_chatbot

    This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

  • octopus

    On-device Speech-to-Index engine powered by deep learning (by Picovoice)

  • autosrt

    Offline srt producer gui with whisper.cpp

  • Universal-MacAssistant

    Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.

  • Project mention: Your AI MacOS Voice Assistant | /r/coolgithubprojects | 2023-07-03
  • ameli-ai

    Ameli, a cross platform personal voice assistant for Windows/Linux/MacOS/Android/iOS

  • hollow-knight-voice-commands

    A fun little python tool to play Hollow Knight with only voice commands

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python voice-recognition related posts

Index

What are some of the best open-source voice-recognition projects in Python? This list will help you:

Project Stars
1 PaddleSpeech 10,120
2 speechbrain 7,869
3 silero-vad 2,829
4 WhisperLive 1,180
5 Python-ai-assistant 853
6 mycroft-precise 793
7 rhino 591
8 speech-to-text-benchmark 586
9 cheetah 552
10 picovoice 497
11 leopard 406
12 Caster 328
13 gpt-voice-conversation-chatbot 283
14 LiveWhisper 283
15 chatgpt-voice-assistant 105
16 M.I.L.E.S 65
17 gpt_chatbot 53
18 octopus 34
19 autosrt 23
20 Universal-MacAssistant 9
21 ameli-ai 6
22 hollow-knight-voice-commands 1

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com