Top 23 Python Tt Projects

Real-Time-Voice-Cloning

96 50,652 0.0 Python

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

MockingBird

9 33,796 5.8 Python

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
TTS

231 28,959 9.5 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Project mention: OpenAI deems its voice cloning tool too risky for general release | news.ycombinator.com | 2024-03-31

lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.

OpenVoice

12 17,263 8.8 Python

Instant voice cloning by MyShell.

Project mention: Ask HN: Voice ID adoption at financial institutions | news.ycombinator.com | 2024-04-03

Given the inevitability of easy voice cloning[1], it seems irresponsible to be using voice as a positive authentication signal.
Unfortunately, major US financial institutions seem to be ramping up adoption of this technology[2].
Am I missing something?
[1] https://github.com/myshell-ai/OpenVoice

PaddleSpeech

6 10,120 7.6 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

PaddlePaddle/PaddleSpeech

NeMo

29 10,021 9.8 Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Project mention: [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN) | /r/MachineLearning | 2023-07-06

I don't test WaveRNN but from the ones that I know the best that is open source is FastPitch. And it's easy to use, here is the tutorial for voice cloning.

VALL-E-X

2 7,138 8.8 Python

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
EmotiVoice

5 6,270 8.9 Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

vits

6 6,230 0.0 Python

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Project mention: [D] TTS systems to download & run offline | /r/MachineLearning | 2023-05-14

And the voice encapsulation system VITS https://github.com/jaywalnut310/vits

DiffSinger

1 4,102 2.5 Python

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
TensorFlowTTS

6 3,697 0.0 Python

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Project mention: Ask HN: On-Device Text to Speech | news.ycombinator.com | 2023-08-31

Hey HN, has anyone found a viable solution for doing this locally and offline on iOS? I'd like to offer a privacy-friendly text to speech feature to my App, and Apple's speech synthesis sounds awful compared to some newer models and TTS engines. The only thing I've found is an older TensorflowTTS example here: https://github.com/TensorSpeech/TensorFlowTTS/tree/master/examples/ios
Any pointers or tips appreciated.

edge-tts

4 3,503 6.4 Python

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Project mention: [discussion] text to voice generation for textbooks (non-math part) | /r/MachineLearning | 2023-12-01

i would very much like to use it to turn the text parts of a book into an audio where i could listen to it while reading. i used edge's tts for speech by giving a paragraph to clipboard and to edge-tts in order to listen the text but it causes two problems: 1. you need internet connection and have the book opened 2. can only do paragraph by paragraph, and is prone to errors or sometimes if you use it too much it wont convert the full text afterwards.

tacotron

3 2,921 0.0 Python

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
vall-e

3 2,868 0.0 Python

An unofficial PyTorch implementation of the audio LM VALL-E
lingvo

1 2,781 8.7 Python

Lingvo
aeneas

4 2,379 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
gTTS

3 2,139 7.6 Python

Python library and CLI tool to interface with Google Translate's text-to-speech API

Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

For our real-time TTS needs, we'll employ the fantastic library called gTTS.

WaveRNN

5 2,086 0.0 Python

WaveRNN Vocoder + TTS
hifi-gan

5 1,744 0.0 Python

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
tts-generation-webui

5 1,260 8.6 Python

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

https://github.com/rsxdalv/tts-generation-webui

dc_tts

4 1,150 0.0 Python

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
TransformerTTS

1 1,096 0.0 Python

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
NATSpeech

4 944 1.8 Python

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Tts related posts

Using Groq to Build a Real-Time Language Translation App
3 projects | dev.to | 5 Apr 2024
Ask HN: Voice ID adoption at financial institutions
1 project | news.ycombinator.com | 3 Apr 2024
OpenVoice: Versatile Instant Voice Cloning
7 projects | news.ycombinator.com | 29 Mar 2024
OpenAI: Navigating the Challenges and Opportunities of Synthetic Voices
1 project | news.ycombinator.com | 29 Mar 2024
WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper
9 projects | news.ycombinator.com | 17 Jan 2024
Building a local AI smart Home Assistant
11 projects | news.ycombinator.com | 13 Jan 2024
OpenVoice: Versatile Instant Voice Cloning
1 project | news.ycombinator.com | 4 Jan 2024
A note from our sponsor - WorkOS
workos.com | 23 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source Tt projects in Python? This list will help you:

	Project	Stars
1	Real-Time-Voice-Cloning	50,652
2	MockingBird	33,796
3	TTS	28,959
4	OpenVoice	17,263
5	PaddleSpeech	10,120
6	NeMo	10,021
7	VALL-E-X	7,138
8	EmotiVoice	6,270
9	vits	6,230
10	DiffSinger	4,102
11	TensorFlowTTS	3,697
12	edge-tts	3,503
13	tacotron	2,921
14	vall-e	2,868
15	lingvo	2,781
16	aeneas	2,379
17	gTTS	2,139
18	WaveRNN	2,086
19	hifi-gan	1,744
20	tts-generation-webui	1,260
21	dc_tts	1,150
22	TransformerTTS	1,096
23	NATSpeech	944