asteroid
Wave-U-Net-for-Speech-Enhancement
asteroid | Wave-U-Net-for-Speech-Enhancement | |
---|---|---|
2 | 1 | |
2,118 | 302 | |
2.0% | - | |
5.5 | 0.0 | |
28 days ago | over 1 year ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
asteroid
- Speech separation
-
[D] Is it possible to extract certain sounds from a mixture of sounds and noise
Here is a link to a tutorial about that will teach you everything you need to know about source separation, from data, to losses, to commonly used model architectures. That tutorial is built around the nussl source separation library, but some other nice ones exist as well, such as asteroid.
Wave-U-Net-for-Speech-Enhancement
-
[P] Malaya-Speech, Speech-Toolkit library for Malay language, powered by Deep Learning Tensorflow
Speech Enhancement UNET, https://github.com/haoxiangsnr/Wave-U-Net-for-Speech-Enhancement
What are some alternatives?
nussl - A flexible source separation library in Python
Pytorch-UNet - PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Conv-TasNet - A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
malaya-speech - Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
Speech-Separation-Paper-Tutorial - A must-read paper for speech separation based on neural networks
unet-nested-multiple-classification - This repository contains code for a multiple classification image segmentation model based on UNet and UNet++
mmf - A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
pyannote-audio - Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
voicefilter - Unofficial PyTorch implementation of Google AI's VoiceFilter system
espnet - End-to-End Speech Processing Toolkit
mayavoz - Pytorch based speech enhancement toolkit.
segmentation_models.pytorch - Segmentation models with pretrained backbones. PyTorch.