awesome-speech-recognition-speech-synthesis-papers
lingvo
Our great sponsors
awesome-speech-recognition-speech-synthesis-papers | lingvo | |
---|---|---|
- | 1 | |
2,870 | 2,780 | |
- | 0.2% | |
3.5 | 8.7 | |
6 months ago | 16 days ago | |
Python | ||
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-speech-recognition-speech-synthesis-papers
We haven't tracked posts mentioning awesome-speech-recognition-speech-synthesis-papers yet.
Tracking mentions began in Dec 2020.
lingvo
-
Voice assistant that can be taught how to swear (Part 1)
To calculate the Word Error Rate I took a python script from the tensorflow/lingvo project and rewrote it in js. In essence, it is just a simple solution of the Edit Distance task, in addition to error calculation for each of the three types: deletion, insertion, and replacement. In the end, I did not the most intelligent method of comparing texts, and yet it was sufficient enough to later on add parameters to queries to Speech-to-Tex.
What are some alternatives?
pytorch-seq2seq - Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
TTS-Voice-Wizard - Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
CodeSearchNet - Datasets, tools, and benchmarks for representation learning of code.
seq2seq - A general-purpose encoder-decoder framework for Tensorflow
Best_AI_paper_2020 - A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, andĀ code
allosaurus - Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Awesome-Efficient-LLM - A curated list for Efficient Large Language Models
Mava - š¦ A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
nnsvs-english-support - The Original Support for English NNSVS Dataset Creation
deepspeech-playbook - A crash course for training speech recognition models using DeepSpeech.
pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries