WOLOF-ASR-Wav2Vec2
vid2cleantxt
WOLOF-ASR-Wav2Vec2 | vid2cleantxt | |
---|---|---|
2 | 1 | |
12 | 156 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | over 1 year ago | |
Jupyter Notebook | Jupyter Notebook | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
WOLOF-ASR-Wav2Vec2
-
My first contribution into hugging face
I have finetune wav2vec2 large xlsr53 on WOLOF audio data set, for more info visit the here. You can also check my Github repo. You can also look at my Kaggle notebook.
- [P] Finetuning Facebook wav2vec2 large xlsr model on Wolof audio data
vid2cleantxt
What are some alternatives?
awesome-deep-learning-music - List of articles related to deep learning applied to music
SpecVQGAN - Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
awesome-python-applications - 💿 Free software that works great, and also happens to be open-source Python.
PipeWire-Guide - PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
essentia - C++ library for audio and music analysis, description and synthesis, including Python bindings
distil-whisper - Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
auto-editor - Auto-Editor: Effort free video editing!
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
OTTO - Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
beep - A little package that brings sound to any Go application. Suitable for playback and audio-processing.
web-speech-synthesis-and-recognition - Speech to Text and Text to Speech on a web browser