Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
-
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
-
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation, but the goddamm motherfucker doesn't work.
-
PiDTLN
Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
coqui-ai/TTS
neonbjb/tortoise-tts
suno-ai/bark
rhasspy/piper
shivammehta25/Matcha-TTS
openai/whisper
ggerganov/whisper.cpp
guillaumekln/faster-whisper
wenet-e2e/wenet
facebookresearch/seamless_communication: Speech translation
pyannote/pyannote-audio
PaddlePaddle/PaddleSpeech
gitmylo/audio-webui
facebookresearch/audiocraft/MUSICGEN: Music Generation
openai/jukebox: Music Generation
RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Singing Voice Conversion
fishaudio/fish-diffusion: Singing Voice Conversion
facebookresearch/demucs: Stem seperation
Anjok07/UltimateVocalRemoverGUI: Vocal isolation
Rikorose/DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) using on Deep Filtering
SaneBow/PiDTLN: DTLN model for noise suppression and acoustic echo cancellation on Raspberry Pi
haoheliu/versatile_audio_super_resolution: any -> 48kHz high fidelity Enhancer
spotify/basic-pitch: Audio to midi converter
spotify/pedalboard: audio effects for Python and TensorFlow
librosa/librosa: Python library for audio and music analysis