DNS-Challenge
vakyansh-models
DNS-Challenge | vakyansh-models | |
---|---|---|
2 | 2 | |
973 | 267 | |
1.8% | 0.0% | |
4.4 | 0.0 | |
about 1 month ago | over 1 year ago | |
Python | ||
Creative Commons Attribution 4.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DNS-Challenge
-
Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech
Is anyone aware of classification (e.g. word prediction) datasets for low-resource and endangered languages?
If so, we would like to use it for the HEAR NeurIPS competition: https://github.com/microsoft/DNS-Challenge/tree/master/datas...
The challenge is restricted only to classification tasks, and sequence modeling like full ASR is unfortunately beyond the scope of the competition.
-
How to clone while skipping some of the directories?
Repository
vakyansh-models
What are some alternatives?
flashlight - A C++ standalone library for machine learning
STT - 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
PaddleSpeech - Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
edgedict - Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.