lingvo
allosaurus
Our great sponsors
lingvo | allosaurus | |
---|---|---|
1 | 2 | |
2,781 | 505 | |
0.2% | - | |
8.7 | 0.0 | |
11 days ago | 10 months ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lingvo
-
Voice assistant that can be taught how to swear (Part 1)
To calculate the Word Error Rate I took a python script from the tensorflow/lingvo project and rewrote it in js. In essence, it is just a simple solution of the Edit Distance task, in addition to error calculation for each of the three types: deletion, insertion, and replacement. In the end, I did not the most intelligent method of comparing texts, and yet it was sufficient enough to later on add parameters to queries to Speech-to-Tex.
allosaurus
-
Complete table of all IPA vowels' formant frequencies
Thank you for a great reply! If I catch your drift, how does this bode with phonemic transcription? Suppose we have an automatic phone recognizer tool such as Allosaurus.
-
Python and Speech recognition
And for phonemes recognition: - this looks like it could be useful (I'm sure you won't mind if it's "phones" instead of "phonemes"): https://github.com/xinjli/allosaurus - about using standard speech recognition tools: https://cmusphinx.github.io/wiki/phonemerecognition/
What are some alternatives?
TTS-Voice-Wizard - Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
seq2seq - A general-purpose encoder-decoder framework for Tensorflow
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
awesome-speech-recognition-speech-synthesis-papers - Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
edgedict - Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Mava - 🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries
deepspeech-playbook - A crash course for training speech recognition models using DeepSpeech.
forced-alignment-tools - A collection of links and notes on forced alignment tools
SpeechLoop - Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?