voice-recognition

Top 23 voice-recognition Open-Source Projects

  • PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

    Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

    PaddlePaddle/PaddleSpeech

  • speechbrain

    A PyTorch-based Speech Toolkit

    Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • vosk-api

    Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

    Project mention: Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency | /r/webdev | 2023-06-09
  • silero-vad

    Silero VAD: pre-trained enterprise-grade Voice Activity Detector

    Project mention: New models and developer products announced at OpenAI DevDay | news.ycombinator.com | 2023-11-06

    >How do you detect speech starting and stopping?

    https://github.com/snakers4/silero-vad

  • STT

    🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

    Project mention: Rest in Peas: The Unrecognized Death of Speech Recognition (2010) | news.ycombinator.com | 2023-05-04

    What has happened since then? I know Common Voice has come and gone https://en.wikipedia.org/wiki/Common_Voice https://github.com/coqui-ai/STT

    And I've seen some neural approaches too

    No idea where to look for comparisons though.

  • voice

    :microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support) (by react-native-voice)

  • voice_datasets

    🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

    Project mention: Where to begin - ML model for speech recognition | /r/learnmachinelearning | 2023-04-04

    - https://github.com/jim-schwoebel/voice_datasets

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

    Project mention: Show HN: WhisperFusion – Ultra-low latency conversations with an AI chatbot | news.ycombinator.com | 2024-01-29

    Everything runs locally, we use:

    - WhisperLive for the transcription - https://github.com/collabora/WhisperLive

  • Python-ai-assistant

    Python AI assistant 🧠

    Project mention: Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram) | news.ycombinator.com | 2023-12-18

    There is another one (Also Jarvis) that's been around for a while and is more useful, wonder if they can combine forces? https://github.com/ggeop/Python-ai-assistant

    Not sure if anyone has noticed but OpenAI now has a mobile app (I've been using the PWA all this time) and the voice assistant on there is really strong. Sounds good, fast, and seems to even run a pass on my voice before it submits the query.

  • mycroft-precise

    A lightweight, simple-to-use, RNN wake word listener

  • EDDiscovery

    Captains log and 3d star map for Elite Dangerous

    Project mention: What are your must-have plugins/resources for ED? | /r/EliteDangerous | 2023-11-29

    EDDiscovery

  • rhino

    On-device Speech-to-Intent engine powered by deep learning (by Picovoice)

  • speech-to-text-benchmark

    speech to text benchmark framework

    Project mention: Speech-to-Text Benchmark | news.ycombinator.com | 2024-01-16
  • cheetah

    On-device streaming speech-to-text engine powered by deep learning (by Picovoice)

  • Voice Overlay iOS

    🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

  • picovoice

    On-device voice assistant platform powered by deep learning

  • SwiftSpeech

    A speech recognition framework designed for SwiftUI.

  • leopard

    On-device speech-to-text engine powered by deep learning

  • vosk

    VOSK Speech Recognition Toolkit

  • Caster

    Dragonfly-Based Voice Programming and Accessibility Toolkit

  • FDSoundActivatedRecorder

    Start recording when the user speaks

  • gpt-voice-conversation-chatbot

    Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

  • LiveWhisper

    A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-02-28.

voice-recognition related posts

Index

What are some of the best open-source voice-recognition projects? This list will help you:

Project Stars
1 PaddleSpeech 9,957
2 speechbrain 7,694
3 vosk-api 6,894
4 silero-vad 2,666
5 STT 2,092
6 voice 1,694
7 voice_datasets 1,502
8 WhisperLive 994
9 Python-ai-assistant 841
10 mycroft-precise 785
11 EDDiscovery 744
12 rhino 588
13 speech-to-text-benchmark 581
14 cheetah 546
15 Voice Overlay iOS 531
16 picovoice 484
17 SwiftSpeech 404
18 leopard 401
19 vosk 348
20 Caster 327
21 FDSoundActivatedRecorder 284
22 gpt-voice-conversation-chatbot 277
23 LiveWhisper 271
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com