wenet
DeepFilterNet
wenet | DeepFilterNet | |
---|---|---|
5 | 10 | |
3,699 | 1,933 | |
1.6% | - | |
9.6 | 8.9 | |
1 day ago | 5 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wenet
-
Open Source Libraries
wenet-e2e/wenet
-
Deploying speech recognition models at scale
Try wenet wenet
-
Ask HN: Are there any good open source Text-to-Speech tools?
For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet
They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.
- Project Alice – an open source virtual assistant that can run offline
- Wenet results on Gigaspeech - on par with best results (Espnet). Pretrained model is available .
DeepFilterNet
-
Anyone know of a good TTS pipeline for raw speech data?
You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.
-
Open Source Libraries
Rikorose/DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering
- DeepFilterNet: Noise supression using deep filtering
-
Linux Audio Noise suppression using deep filtering in Rust
It looks like the library in Rust is using `tract-onnx` to do the inference: https://github.com/Rikorose/DeepFilterNet/blob/2a84d2a1750a5... I am wondering whether using Python for research, training in big data center, and Rust at edge for efficient inference would be a trend in the future. We do have a larger community of C++ right now for inference (e.g. ggml). But Rust crate as component to build applications of AI is joy to use.
-
Real-Time Noise Suppression for PipeWire writen in Rust
Repo: https://github.com/Rikorose/DeepFilterNet
What are some alternatives?
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
NoiseTorch - Real-time microphone noise suppression on Linux.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
audio-webui - A webui for different audio related Neural Networks
FasterTransformer - Transformer related optimization, including BERT, GPT
noise-repellent - Lv2 suite of plugins for broadband noise reduction
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
PiDTLN - Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
fstalign - An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
rnnoise - Recurrent neural network for audio noise reduction
functorch - functorch is a prototype of JAX-like composable function transforms for PyTorch. [Moved to: https://github.com/pytorch/functorch]
noise-suppression-for-voice - Noise suppression plugin based on Xiph's RNNoise