DeepFilterNet
wenet
DeepFilterNet | wenet | |
---|---|---|
10 | 5 | |
1,969 | 3,712 | |
- | 2.0% | |
8.9 | 9.6 | |
9 days ago | 7 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DeepFilterNet
-
Anyone know of a good TTS pipeline for raw speech data?
You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.
-
Open Source Libraries
Rikorose/DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering
- DeepFilterNet: Noise supression using deep filtering
-
Linux Audio Noise suppression using deep filtering in Rust
It looks like the library in Rust is using `tract-onnx` to do the inference: https://github.com/Rikorose/DeepFilterNet/blob/2a84d2a1750a5... I am wondering whether using Python for research, training in big data center, and Rust at edge for efficient inference would be a trend in the future. We do have a larger community of C++ right now for inference (e.g. ggml). But Rust crate as component to build applications of AI is joy to use.
-
Real-Time Noise Suppression for PipeWire writen in Rust
Repo: https://github.com/Rikorose/DeepFilterNet
wenet
-
Open Source Libraries
wenet-e2e/wenet
-
Deploying speech recognition models at scale
Try wenet wenet
-
Ask HN: Are there any good open source Text-to-Speech tools?
For STT, take a look at Wenet: https://github.com/wenet-e2e/wenet
They provide support for running in a Raspberry Pi and it runs in real-time. I have tried the desktop version and the quality is good enough when the audio is clean.
- Project Alice – an open source virtual assistant that can run offline
- Wenet results on Gigaspeech - on par with best results (Espnet). Pretrained model is available .
What are some alternatives?
NoiseTorch - Real-time microphone noise suppression on Linux.
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
audio-webui - A webui for different audio related Neural Networks
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
noise-repellent - Lv2 suite of plugins for broadband noise reduction
FasterTransformer - Transformer related optimization, including BERT, GPT
PiDTLN - Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
whisper - Robust Speech Recognition via Large-Scale Weak Supervision
rnnoise - Recurrent neural network for audio noise reduction
fstalign - An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
noise-suppression-for-voice - Noise suppression plugin based on Xiph's RNNoise
functorch - functorch is a prototype of JAX-like composable function transforms for PyTorch. [Moved to: https://github.com/pytorch/functorch]