SaaSHub helps you find the best software and product alternatives Learn more →
Top 22 Python voice-recognition Projects
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
gpt-voice-conversation-chatbot
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
-
LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
-
chatgpt-voice-assistant
A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.
-
M.I.L.E.S
M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song, adjusts system and Spotify volume, performs calculations, browses the web and internet, searches global weather, delivers date and time, autonomously chooses and retains long-term memories. Available for macOS and Windows.
-
gpt_chatbot
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
-
Universal-MacAssistant
Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.
-
hollow-knight-voice-commands
A fun little python tool to play Hollow Knight with only voice commands
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
PaddlePaddle/PaddleSpeech
Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06>How do you detect speech starting and stopping?
https://github.com/snakers4/silero-vad
Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant
Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.
Project mention: Show HN: I made M.I.L.E.S, the worlds best voice assistant | news.ycombinator.com | 2024-01-06
Python voice-recognition related posts
- Speech-to-Text Benchmark
- New models and developer products announced at OpenAI DevDay
- [Discussion] Video Translation Task
- Your AI MacOS Voice Assistant
- Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency
- I made a simple gui to use whisper.cpp in python.
- Automatic Speech Recognition with AWS Lambda and Leopard
-
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024
Index
What are some of the best open-source voice-recognition projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleSpeech | 10,120 |
2 | speechbrain | 7,869 |
3 | silero-vad | 2,829 |
4 | WhisperLive | 1,180 |
5 | Python-ai-assistant | 853 |
6 | mycroft-precise | 793 |
7 | rhino | 591 |
8 | speech-to-text-benchmark | 586 |
9 | cheetah | 552 |
10 | picovoice | 497 |
11 | leopard | 406 |
12 | Caster | 328 |
13 | gpt-voice-conversation-chatbot | 283 |
14 | LiveWhisper | 283 |
15 | chatgpt-voice-assistant | 105 |
16 | M.I.L.E.S | 65 |
17 | gpt_chatbot | 53 |
18 | octopus | 34 |
19 | autosrt | 23 |
20 | Universal-MacAssistant | 9 |
21 | ameli-ai | 6 |
22 | hollow-knight-voice-commands | 1 |
Sponsored