Wave-U-Net-for-Speech-Enhancement
asteroid
Wave-U-Net-for-Speech-Enhancement | asteroid | |
---|---|---|
1 | 2 | |
302 | 2,111 | |
- | 1.7% | |
0.0 | 5.5 | |
over 1 year ago | 24 days ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Wave-U-Net-for-Speech-Enhancement
-
[P] Malaya-Speech, Speech-Toolkit library for Malay language, powered by Deep Learning Tensorflow
Speech Enhancement UNET, https://github.com/haoxiangsnr/Wave-U-Net-for-Speech-Enhancement
asteroid
- Speech separation
-
[D] Is it possible to extract certain sounds from a mixture of sounds and noise
Here is a link to a tutorial about that will teach you everything you need to know about source separation, from data, to losses, to commonly used model architectures. That tutorial is built around the nussl source separation library, but some other nice ones exist as well, such as asteroid.
What are some alternatives?
Pytorch-UNet - PyTorch implementation of the U-Net for image semantic segmentation with high quality images
nussl - A flexible source separation library in Python
malaya-speech - Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
unet-nested-multiple-classification - This repository contains code for a multiple classification image segmentation model based on UNet and UNet++
Speech-Separation-Paper-Tutorial - A must-read paper for speech separation based on neural networks
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
mmf - A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
espnet - End-to-End Speech Processing Toolkit
voicefilter - Unofficial PyTorch implementation of Google AI's VoiceFilter system
segmentation_models.pytorch - Segmentation models with pretrained backbones. PyTorch.
mayavoz - Pytorch based speech enhancement toolkit.