pocketsphinx-python
allosaurus
pocketsphinx-python | allosaurus | |
---|---|---|
1 | 2 | |
367 | 507 | |
- | - | |
0.0 | 0.0 | |
10 months ago | 8 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pocketsphinx-python
allosaurus
-
Complete table of all IPA vowels' formant frequencies
Thank you for a great reply! If I catch your drift, how does this bode with phonemic transcription? Suppose we have an automatic phone recognizer tool such as Allosaurus.
-
Python and Speech recognition
And for phonemes recognition: - this looks like it could be useful (I'm sure you won't mind if it's "phones" instead of "phonemes"): https://github.com/xinjli/allosaurus - about using standard speech recognition tools: https://cmusphinx.github.io/wiki/phonemerecognition/
What are some alternatives?
kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
pyaudio - http://people.csail.mit.edu/hubert/pyaudio/
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
PaddleSpeech - Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
edgedict - Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
forced-alignment-tools - A collection of links and notes on forced alignment tools
SpeechLoop - Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
lingvo - Lingvo