The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 voice-recognition Open-Source Projects
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
PaddlePaddle/PaddleSpeech
-
Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Project mention: Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency | /r/webdev | 2023-06-09 -
Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06
>How do you detect speech starting and stopping?
-
STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Project mention: Rest in Peas: The Unrecognized Death of Speech Recognition (2010) | news.ycombinator.com | 2023-05-04What has happened since then? I know Common Voice has come and gone https://en.wikipedia.org/wiki/Common_Voice https://github.com/coqui-ai/STT
And I've seen some neural approaches too
No idea where to look for comparisons though.
-
voice
:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support) (by react-native-voice)
-
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Project mention: Where to begin - ML model for speech recognition | /r/learnmachinelearning | 2023-04-04- https://github.com/jim-schwoebel/voice_datasets
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29
Everything runs locally, we use:
- WhisperLive for the transcription - https://github.com/collabora/WhisperLive
-
Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18
There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant
Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.
-
-
EDDiscovery
-
-
-
-
Voice Overlay iOS
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
-
-
-
-
-
-
-
gpt-voice-conversation-chatbot
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
-
LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
voice-recognition related posts
- Speech-to-Text Benchmark
- New models and developer products announced at OpenAI DevDay
- [Discussion] Video Translation Task
- Your AI MacOS Voice Assistant
- Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency
- Working Vosk model?
- What are the aplications of rust in machine learning ?
-
A note from our sponsor - WorkOS
workos.com | 28 Mar 2024
Index
What are some of the best open-source voice-recognition projects? This list will help you:
Project | Stars | |
---|---|---|
1 | PaddleSpeech | 9,957 |
2 | speechbrain | 7,694 |
3 | vosk-api | 6,894 |
4 | silero-vad | 2,666 |
5 | STT | 2,092 |
6 | voice | 1,694 |
7 | voice_datasets | 1,502 |
8 | WhisperLive | 994 |
9 | Python-ai-assistant | 841 |
10 | mycroft-precise | 785 |
11 | EDDiscovery | 744 |
12 | rhino | 588 |
13 | speech-to-text-benchmark | 581 |
14 | cheetah | 546 |
15 | Voice Overlay iOS | 531 |
16 | picovoice | 484 |
17 | SwiftSpeech | 404 |
18 | leopard | 401 |
19 | vosk | 348 |
20 | Caster | 327 |
21 | FDSoundActivatedRecorder | 284 |
22 | gpt-voice-conversation-chatbot | 277 |
23 | LiveWhisper | 271 |