Top 23 Python text-to-speech Projects

MockingBird

9 33,796 5.8 Python

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
TTS

231 28,959 9.5 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Project mention: OpenAI deems its voice cloning tool too risky for general release | news.ycombinator.com | 2024-03-31

lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
OpenVoice

12 17,263 8.8 Python

Instant voice cloning by MyShell.

Project mention: Ask HN: Voice ID adoption at financial institutions | news.ycombinator.com | 2024-04-03

Given the inevitability of easy voice cloning[1], it seems irresponsible to be using voice as a positive authentication signal.
Unfortunately, major US financial institutions seem to be ramping up adoption of this technology[2].
Am I missing something?
[1] https://github.com/myshell-ai/OpenVoice

VALL-E-X

2 7,138 8.8 Python

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

EmotiVoice

5 6,270 8.9 Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

vits

6 6,230 0.0 Python

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Project mention: [D] TTS systems to download & run offline | /r/MachineLearning | 2023-05-14

And the voice encapsulation system VITS https://github.com/jaywalnut310/vits

pyvideotrans

1 5,556 9.7 Python

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
DiffSinger

1 4,102 2.5 Python

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Amphion

4 3,864 8.7 Python

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Project mention: FLaNK Stack Weekly 11 Dec 2023 | dev.to | 2023-12-11

TensorFlowTTS

6 3,697 0.0 Python

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Project mention: Ask HN: On-Device Text to Speech | news.ycombinator.com | 2023-08-31

Hey HN, has anyone found a viable solution for doing this locally and offline on iOS? I'd like to offer a privacy-friendly text to speech feature to my App, and Apple's speech synthesis sounds awful compared to some newer models and TTS engines. The only thing I've found is an older TensorflowTTS example here: https://github.com/TensorSpeech/TensorFlowTTS/tree/master/examples/ios
Any pointers or tips appreciated.

edge-tts

4 3,503 6.4 Python

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Project mention: [discussion] text to voice generation for textbooks (non-math part) | /r/MachineLearning | 2023-12-01

i would very much like to use it to turn the text parts of a book into an audio where i could listen to it while reading. i used edge's tts for speech by giving a paragraph to clipboard and to edge-tts in order to listen the text but it causes two problems: 1. you need internet connection and have the book opened 2. can only do paragraph by paragraph, and is prone to errors or sometimes if you use it too much it wont convert the full text afterwards.

Awesome-Prompt-Engineering

9 3,196 5.8 Python

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

Project mention: AI lessons | /r/ChatGPT | 2023-05-09

Yes, there are a lot of different resources online, especially for generative AI. The Awesome Prompt Engineering github is probably a good place to start https://github.com/promptslab/Awesome-Prompt-Engineering. If you're focusing directly on OpenAI's models then the OpenAI Prompt Engineering Guide would be my recommendation https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-openai-api.

vall-e

3 2,868 0.0 Python

An unofficial PyTorch implementation of the audio LM VALL-E
bark-with-voice-clone

19 2,798 7.5 Python

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Project mention: I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now. | /r/FlutterDev | 2023-12-08

And more stuff I’m often checking back on: - https://github.com/staghado/vit.cpp - https://github.com/serp-ai/bark-with-voice-clone - https://github.com/leejet/stable-diffusion.cpp (generate images) - etc … there’s too much fun stuff out there. Wish I had more free time haha.

aeneas

4 2,379 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Tacotron-2

1 2,231 0.0 Python

DeepMind's Tacotron-2 Tensorflow implementation
gTTS

3 2,139 7.6 Python

Python library and CLI tool to interface with Google Translate's text-to-speech API

Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

For our real-time TTS needs, we'll employ the fantastic library called gTTS.

WaveRNN

5 2,086 0.0 Python

WaveRNN Vocoder + TTS
pyttsx3

11 1,898 0.0 Python

Offline Text To Speech synthesis for python

Project mention: Pyttsx3 help | /r/learnpython | 2023-05-26

Looks like a segfault to me. This current issue may help you solve it (scroll to the bottom to get a suggested fix).

elevenlabs-python

40 1,822 8.2 Python

The official Python API for ElevenLabs Text to Speech.

Project mention: babel_fish - real time language translation | dev.to | 2024-04-14

Since a true Babel fish doesn't produce text we also used ElevenLabs for text-to-speech.

hifi-gan

5 1,744 0.0 Python

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Dragonfire

2 1,372 0.0 Python

the open-source virtual assistant for Ubuntu based Linux distributions
tts-generation-webui

5 1,260 8.6 Python

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

https://github.com/rsxdalv/tts-generation-webui

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python text-to-speech related posts

babel_fish - real time language translation
3 projects | dev.to | 14 Apr 2024
Ask HN: Voice ID adoption at financial institutions
1 project | news.ycombinator.com | 3 Apr 2024
OpenVoice: Versatile Instant Voice Cloning
7 projects | news.ycombinator.com | 29 Mar 2024
OpenAI deems its voice cloning tool too risky for general release
1 project | news.ycombinator.com | 31 Mar 2024
OpenAI: Navigating the Challenges and Opportunities of Synthetic Voices
1 project | news.ycombinator.com | 29 Mar 2024
English for Brazilians: A Fresh Start
1 project | dev.to | 27 Feb 2024
Base TTS (Amazon): The largest text-to-speech model to-date
3 projects | news.ycombinator.com | 14 Feb 2024
A note from our sponsor - WorkOS
workos.com | 23 Apr 2024

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →

Index

What are some of the best open-source text-to-speech projects in Python? This list will help you:

	Project	Stars
1	MockingBird	33,796
2	TTS	28,959
3	OpenVoice	17,263
4	VALL-E-X	7,138
5	EmotiVoice	6,270
6	vits	6,230
7	pyvideotrans	5,556
8	DiffSinger	4,102
9	Amphion	3,864
10	TensorFlowTTS	3,697
11	edge-tts	3,503
12	Awesome-Prompt-Engineering	3,196
13	vall-e	2,868
14	bark-with-voice-clone	2,798
15	aeneas	2,379
16	Tacotron-2	2,231
17	gTTS	2,139
18	WaveRNN	2,086
19	pyttsx3	1,898
20	elevenlabs-python	1,822
21	hifi-gan	1,744
22	Dragonfire	1,372
23	tts-generation-webui	1,260