Top 22 Python voice-recognition Projects

PaddleSpeech

6 10,120 7.6 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

PaddlePaddle/PaddleSpeech

speechbrain

26 7,869 9.8 Python

A PyTorch-based Speech Toolkit

Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
silero-vad

10 2,829 6.9 Python

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06

>How do you detect speech starting and stopping?
https://github.com/snakers4/silero-vad

WhisperLive

4 1,180 9.4 Python

A nearly-live implementation of OpenAI's Whisper.

Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive

Python-ai-assistant

1 853 0.0 Python

Python AI assistant 🧠

Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18

There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant
Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.

mycroft-precise

3 793 0.0 Python

A lightweight, simple-to-use, RNN wake word listener
rhino

5 591 8.9 Python

On-device Speech-to-Intent engine powered by deep learning (by Picovoice)
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
speech-to-text-benchmark

5 586 3.8 Python

speech to text benchmark framework

Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16

cheetah

5 552 8.3 Python

On-device streaming speech-to-text engine powered by deep learning (by Picovoice)
picovoice

13 497 8.9 Python

On-device voice assistant platform powered by deep learning
leopard

15 406 8.6 Python

On-device speech-to-text engine powered by deep learning
Caster

7 328 2.9 Python

Dragonfly-Based Voice Programming and Accessibility Toolkit
gpt-voice-conversation-chatbot

4 283 5.4 Python

Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
LiveWhisper

2 283 0.0 Python

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
chatgpt-voice-assistant

5 105 5.7 Python

A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.

Project mention: ChatGPT Voice Assistant | news.ycombinator.com | 2023-06-13

M.I.L.E.S

1 65 9.1 Python

M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.

Project mention: Show HN: I made M.I.L.E.S, the worlds best voice assistant | news.ycombinator.com | 2024-01-06

gpt_chatbot

1 53 6.8 Python

This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
octopus

2 34 6.5 Python

On-device Speech-to-Index engine powered by deep learning (by Picovoice)
autosrt

2 23 4.8 Python

Offline srt producer gui with whisper.cpp
Universal-MacAssistant

2 9 8.3 Python

Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.

Project mention: Your AI MacOS Voice Assistant | /r/coolgithubprojects | 2023-07-03

ameli-ai

1 6 0.0 Python

Ameli, a cross platform personal voice assistant for Windows/Linux/MacOS/Android/iOS
hollow-knight-voice-commands

1 1 3.9 Python

A fun little python tool to play Hollow Knight with only voice commands
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python voice-recognition related posts

Speech-to-Text Benchmark
1 project | news.ycombinator.com | 16 Jan 2024
New models and developer products announced at OpenAI DevDay
8 projects | news.ycombinator.com | 6 Nov 2023
[Discussion] Video Translation Task
2 projects | /r/MachineLearning | 13 Jul 2023
Your AI MacOS Voice Assistant
1 project | /r/coolgithubprojects | 3 Jul 2023
Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency
4 projects | /r/webdev | 9 Jun 2023
I made a simple gui to use whisper.cpp in python.
2 projects | /r/Python | 13 Apr 2023
Automatic Speech Recognition with AWS Lambda and Leopard
2 projects | dev.to | 1 Feb 2023
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source voice-recognition projects in Python? This list will help you:

	Project	Stars
1	PaddleSpeech	10,120
2	speechbrain	7,869
3	silero-vad	2,829
4	WhisperLive	1,180
5	Python-ai-assistant	853
6	mycroft-precise	793
7	rhino	591
8	speech-to-text-benchmark	586
9	cheetah	552
10	picovoice	497
11	leopard	406
12	Caster	328
13	gpt-voice-conversation-chatbot	283
14	LiveWhisper	283
15	chatgpt-voice-assistant	105
16	M.I.L.E.S	65
17	gpt_chatbot	53
18	octopus	34
19	autosrt	23
20	Universal-MacAssistant	9
21	ameli-ai	6
22	hollow-knight-voice-commands	1