DeepFilterNet
PiDTLN
DeepFilterNet | PiDTLN | |
---|---|---|
10 | 1 | |
1,969 | 52 | |
- | - | |
8.9 | 10.0 | |
9 days ago | over 2 years ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DeepFilterNet
-
Anyone know of a good TTS pipeline for raw speech data?
You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.
-
Open Source Libraries
Rikorose/DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering
- DeepFilterNet: Noise supression using deep filtering
-
Linux Audio Noise suppression using deep filtering in Rust
It looks like the library in Rust is using `tract-onnx` to do the inference: https://github.com/Rikorose/DeepFilterNet/blob/2a84d2a1750a5... I am wondering whether using Python for research, training in big data center, and Rust at edge for efficient inference would be a trend in the future. We do have a larger community of C++ right now for inference (e.g. ggml). But Rust crate as component to build applications of AI is joy to use.
-
Real-Time Noise Suppression for PipeWire writen in Rust
Repo: https://github.com/Rikorose/DeepFilterNet
PiDTLN
-
Open Source Libraries
SaneBow/PiDTLN: DTLN model for noise suppression and acoustic echo cancellation on Raspberry Pi
What are some alternatives?
NoiseTorch - Real-time microphone noise suppression on Linux.
basic-pitch - A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
audio-webui - A webui for different audio related Neural Networks
noise-repellent - Lv2 suite of plugins for broadband noise reduction
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
wenet - Production First and Production Ready End-to-End Speech Recognition Toolkit
audiocraft - Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
rnnoise - Recurrent neural network for audio noise reduction
whisper.cpp - Port of OpenAI's Whisper model in C/C++
noise-suppression-for-voice - Noise suppression plugin based on Xiph's RNNoise
PaddleSpeech - Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.