vosk-server
ovos-stt-plugin-vosk
vosk-server | ovos-stt-plugin-vosk | |
---|---|---|
4 | 1 | |
837 | 14 | |
1.1% | - | |
5.5 | 2.9 | |
24 days ago | 4 months ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vosk-server
- Self-hosted audio transcription?
-
Open Source ASR with user-specific custom vocabularies?
Through my research, the most promising real-time transcription options appear to be Vosk or Kaldi Gstreamer. Iβve set them both up & they appear to work well for general transcription, but Iβm not sure how to handle the user-specific custom vocabularies.
- Voice2json: Offline speech and intent recognition on Linux
- Connecting vosk python model with react
ovos-stt-plugin-vosk
-
Slow responses from picroft
for STT there is streaming support which should improve things, google cloud is supported in mycroft-core, but there are some plugins out there that support streaming like vosk
What are some alternatives?
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
wenet - Production First and Production Ready End-to-End Speech Recognition Toolkit
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
vosk-browser - A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
kaldi-gstreamer-server - Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
pykaldi - A Python wrapper for Kaldi
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
werpy - ππ¦ Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
julius - Open-Source Large Vocabulary Continuous Speech Recognition Engine
mock-backend - A Flask personal backend alternative for running your own version of https://home.mycroft.ai
vosk-android-demo - Offline speech recognition for Android with Vosk library.
elograf - Utility for launching and configuring nerd-dictation