lingvo
deepspeech-playbook
Our great sponsors
lingvo | deepspeech-playbook | |
---|---|---|
1 | 1 | |
2,780 | 23 | |
0.2% | - | |
8.7 | 0.0 | |
14 days ago | almost 3 years ago | |
Python | ||
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
lingvo
-
Voice assistant that can be taught how to swear (Part 1)
To calculate the Word Error Rate I took a python script from the tensorflow/lingvo project and rewrote it in js. In essence, it is just a simple solution of the Edit Distance task, in addition to error calculation for each of the three types: deletion, insertion, and replacement. In the end, I did not the most intelligent method of comparing texts, and yet it was sufficient enough to later on add parameters to queries to Speech-to-Tex.
deepspeech-playbook
-
DeepSpeech PlayBook v1.0 Alpha now available for feedback and testing
The PlayBook is written in MarkDown, and we welcome Issues and PRs to the GitHub repository.
What are some alternatives?
TTS-Voice-Wizard - Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
DeepSpeech - Install Mozilla DeepSpeech on a Raspberry Pi 4
seq2seq - A general-purpose encoder-decoder framework for Tensorflow
awesome-speech-recognition-speech-synthesis-papers - Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
allosaurus - Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
LocalSTT - Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Mava - 🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
DolphinAttack - Inaudible Voice Commands
pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries
spinorama - A library to display and compare spinorama (speakers measurements) graphs.