pocketsphinx
C_to_Python_translator
pocketsphinx | C_to_Python_translator | |
---|---|---|
6 | 1 | |
3,745 | 27 | |
0.9% | - | |
7.4 | 0.0 | |
about 1 month ago | almost 3 years ago | |
C | C | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pocketsphinx
- [Discussion] Looking for an Open-Source Speech to Text model (english) that captures filler words, pauses and also records timestamps for each word.
-
I Created A Web Speech API NPM Package Called SpeechKit
There are espeak-ng https://github.com/espeak-ng/espeak-ng and pocketsphinx https://github.com/cmusphinx/pocketsphinx which can be used locally without making external requests.
-
"Why not just transcribe the audio?" I thought
And so I installed PocketSphinx, "one of Carnegie Mellon University's open source large vocabulary, speaker-independent continuous speech recognition engines."
-
How to train large deep learning models as a startup
- https://github.com/cmusphinx/pocketsphinx
This avoids having to stream audio 24x7 to a cloud model which would be super expensive. This being said, I'm pretty sure what the Alexa does, for example, is send any positive wake word to a cloud model (that is bigger and more accurate) to verify the prediction of the local wake word detection model AFAIK.
- Speech recognition library for financial markets
-
Speech recognition
PocketSphinx is generally regarded among voice assistant communities as a less reliable, but straight OOTB, alternative to a robust listener. It's a good solution when you want multiple hotwords (or just aren't in a position to train even one word.)
C_to_Python_translator
What are some alternatives?
vosk - VOSK Speech Recognition Toolkit
snowboy - Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Spoken-Keyword-Spotting - In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keyword Spotting task.
localcroft - Bits for locally-served Mycroft instances
determined - Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
xla - Enabling PyTorch on XLA Devices (e.g. Google TPU)
speech-kit - Simplifying the Speech Synthesis and Speech Recognition engines for Javascript. Listen for commands and perform callback actions, make the browser speak and transcribe your speech!
espeak-ng - eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
GoogleNetworkSpeechSynthesis - Google's Network Speech Synthesis: Bring your own Google API key and proxy
AudioWorkletStream - fetch() => ReadableStream => AudioWorklet