Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT). (by kaituoxu)
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system (by maum-ai)
Conv-TasNet | voicefilter | |
---|---|---|
2 | 1 | |
632 | 1,031 | |
- | 1.0% | |
1.3 | 0.0 | |
about 1 year ago | 3 months ago | |
Python | Python | |
MIT License | - |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Conv-TasNet
Posts with mentions or reviews of Conv-TasNet.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-04-08.
-
Conv-tasnet
The makefile in question is here: https://github.com/kaituoxu/Conv-TasNet/blob/master/tools/Makefile
voicefilter
Posts with mentions or reviews of voicefilter.
We have used some of these posts to build our list of alternatives
and similar projects.
-
What does width mean in this context? I can't find it anywhere. It's not stride because it would be in compatible with the dilation rate in layer 4. Any ideas? This is from the VoiceFilter paper
Here's the link: https://github.com/mindslab-ai/voicefilter/blob/master/model/model.py
What are some alternatives?
When comparing Conv-TasNet and voicefilter you can also consider the following projects:
pytorch-tutorial - PyTorch Tutorial for Deep Learning Researchers
asteroid - The PyTorch-based audio source separation toolkit for researchers
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
UniSpeech - UniSpeech - Large Scale Self-Supervised Learning for Speech
speechbrain - A PyTorch-based Speech Toolkit
espnet - End-to-End Speech Processing Toolkit
nussl - A flexible source separation library in Python
demucs - Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
Chocolatey - Chocolatey - the package manager for Windows