julius
vid2cleantxt
julius | vid2cleantxt | |
---|---|---|
1 | 1 | |
1,778 | 156 | |
0.6% | - | |
2.7 | 0.0 | |
3 days ago | over 1 year ago | |
C | Jupyter Notebook | |
BSD 3-clause "New" or "Revised" License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
julius
vid2cleantxt
What are some alternatives?
vosk-server - WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
SpecVQGAN - Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
PipeWire-Guide - PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
soloud - Free, easy, portable audio engine for games
distil-whisper - Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
web-whisper - OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
Tenacity - Tenacity is an easy-to-use, privacy-friendly, FLOSS, cross-platform multi-track audio editor/recorder for Windows, macOS, Linux and other operating systems. Project currently on an indefinite hiatus.
WOLOF-ASR-Wav2Vec2 - Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.
praat - Praat: Doing Phonetics By Computer
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple