Top 10 Python speech-enhancement Projects

espnet

15 7,916 10.0 Python

End-to-End Speech Processing Toolkit

Project mention: WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper | news.ycombinator.com | 2024-01-17

You might check out this list from espnet. They list the different corpuses they use to train their models sorted by language and task (ASR, TTS etc):
https://github.com/espnet/espnet/blob/master/egs2/README.md

speechbrain

26 7,914 9.8 Python

A PyTorch-based Speech Toolkit

Project mention: SpeechBrain 1.0: A free and open-source AI toolkit for all things speech | news.ycombinator.com | 2024-02-28

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
asteroid

2 2,118 5.5 Python

The PyTorch-based audio source separation toolkit for researchers
DeepFilterNet

10 1,952 8.9 Python

Noise supression using deep filtering

Project mention: Anyone know of a good TTS pipeline for raw speech data? | /r/AudioAI | 2023-10-03

You mean remove background noise and transcribe? Then you can use DeepFilterNet to remove noise, and Whisper to transcribe.

resemble-enhance

3 931 6.3 Python

AI powered speech denoising and enhancement

Project mention: Ask HN: Who is hiring? (February 2024) | news.ycombinator.com | 2024-02-01

Resemble AI | San Francisco Bay Area (office in Santa Clara, CA) | Full-Time | Full-Stack Engineer, Frontend Engineer, Product Designer
Resemble AI creates high-quality synthetic voices that capture human emotion. We're a venture-backed high-growth startup that's looking to shake up an entire industry with state of the art AI. Our product changes the way that thousands of brands, media companies, creative agencies, and game studios create speech content. We believe that to build an enticing product and solid team is by encouraging innovation is by enabling continuous education. That's why every Friday is a day that you can use to work on anything you want, Resemble-related or not.
Recently, we open sourced a state of the art speech enhancement model: https://github.com/resemble-ai/resemble-enhance
We're hiring for three roles:
Full Stack Engineer - Can you break the entire stack? You're the right person for this job. Work on our Rails app, with sprinkles of React, and Python for the deep learning. Everything is dockerized, and we use Kubernetes to deploy.
Frontend Engineer - We're hiring a Frontend Engineer proficient in React, TypeScript, and Ruby on Rails to shape our user experience. Join our team to develop user-friendly interfaces and collaborate on building exceptional web experiences.
Product Designer - As a Product Designer, you will lead the end-to-end design process, from concept to implementation, ensuring a seamless and delightful user experience. You will collaborate with cross-functional teams to define product vision, conduct user research, create visually compelling interfaces, and develop interactive prototypes.
If interested, reach out directly to me: zohaib [at] resemble.ai

voicefixer

2 917 5.4 Python

General Speech Restoration

Project mention: Linux Audio Noise suppression using deep filtering in Rust | news.ycombinator.com | 2023-06-06

mayavoz

6 317 0.0 Python

Pytorch based speech enhancement toolkit.
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Wave-U-Net-for-Speech-Enhancement

1 302 0.0 Python

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Neural-Speech-Dereverberation

1 92 1.8 Python

Machine and Deep Learning models for speech dereverberation
NLP-Guide

2 66 3.5 Python

Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python speech-enhancement related posts

SpeechBrain 1.0: A free and open-source AI toolkit for all things speech

1 project | news.ycombinator.com | 28 Feb 2024
Anyone know of a good TTS pipeline for raw speech data?

2 projects | /r/AudioAI | 3 Oct 2023
DeepFilterNet: Noise supression using deep filtering

1 project | /r/patient_hackernews | 7 Jun 2023
DeepFilterNet: Noise supression using deep filtering

1 project | /r/hackernews | 7 Jun 2023
Linux Audio Noise suppression using deep filtering in Rust

1 project | /r/hypeurls | 6 Jun 2023
[D] Training ASR model using SpeechBrain

1 project | /r/MachineLearning | 5 Jun 2023
Specific Voice recognition

1 project | /r/learnpython | 13 Jan 2023
A note from our sponsor - SaaSHub
www.saashub.com | 10 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source speech-enhancement projects in Python? This list will help you:

	Project	Stars
1	espnet	7,916
2	speechbrain	7,914
3	asteroid	2,118
4	DeepFilterNet	1,952
5	resemble-enhance	931
6	voicefixer	917
7	mayavoz	317
8	Wave-U-Net-for-Speech-Enhancement	302
9	Neural-Speech-Dereverberation	92
10	NLP-Guide	66

Python speech-enhancement

Top 10 Python speech-enhancement Projects

Python speech-enhancement related posts

SpeechBrain 1.0: A free and open-source AI toolkit for all things speech

Anyone know of a good TTS pipeline for raw speech data?

DeepFilterNet: Noise supression using deep filtering

DeepFilterNet: Noise supression using deep filtering

Linux Audio Noise suppression using deep filtering in Rust

[D] Training ASR model using SpeechBrain

Specific Voice recognition

Index