DeepFilterNet
fish-diffusion
DeepFilterNet | fish-diffusion | |
---|---|---|
10 | 1 | |
1,969 | 579 | |
- | 4.0% | |
8.9 | 7.9 | |
8 days ago | 8 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
DeepFilterNet
-
Anyone know of a good TTS pipeline for raw speech data?
You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.
-
Open Source Libraries
Rikorose/DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering
- DeepFilterNet: Noise supression using deep filtering
-
Linux Audio Noise suppression using deep filtering in Rust
It looks like the library in Rust is using `tract-onnx` to do the inference: https://github.com/Rikorose/DeepFilterNet/blob/2a84d2a1750a5... I am wondering whether using Python for research, training in big data center, and Rust at edge for efficient inference would be a trend in the future. We do have a larger community of C++ right now for inference (e.g. ggml). But Rust crate as component to build applications of AI is joy to use.
-
Real-Time Noise Suppression for PipeWire writen in Rust
Repo: https://github.com/Rikorose/DeepFilterNet
fish-diffusion
-
Open Source Libraries
fishaudio/fish-diffusion: Singing Voice Conversion
What are some alternatives?
NoiseTorch - Real-time microphone noise suppression on Linux.
audio-webui - A webui for different audio related Neural Networks
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
noise-repellent - Lv2 suite of plugins for broadband noise reduction
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
PiDTLN - Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
jukebox - Code for the paper "Jukebox: A Generative Model for Music"
wenet - Production First and Production Ready End-to-End Speech Recognition Toolkit
minDiffusion - Self-contained, minimalistic implementation of diffusion models with Pytorch.
rnnoise - Recurrent neural network for audio noise reduction
basic-pitch - A lightweight yet powerful audio-to-MIDI converter with pitch bend detection