The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more โ
Top 23 Python Tt Projects
-
MockingBird
๐AIๆๅฃฐ: 5็งๅ ๅ ้ๆจ็ๅฃฐ้ณๅนถ็ๆไปปๆ่ฏญ้ณๅ ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
-
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
-
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
-
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
-
tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
-
tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
-
TransformerTTS
๐ค๐ฌ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
-
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: OpenAI deems its voice cloning tool too risky for general release | news.ycombinator.com | 2024-03-31lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.
Project mention: Ask HN: Voice ID adoption at financial institutions | news.ycombinator.com | 2024-04-03Given the inevitability of easy voice cloning[1], it seems irresponsible to be using voice as a positive authentication signal.
Unfortunately, major US financial institutions seem to be ramping up adoption of this technology[2].
Am I missing something?
[1] https://github.com/myshell-ai/OpenVoice
PaddlePaddle/PaddleSpeech
Project mention: [P] Making a TTS voice, HK-47 from Kotor using Tortoise (Ideally WaveRNN) | /r/MachineLearning | 2023-07-06I don't test WaveRNN but from the ones that I know the best that is open source is FastPitch. And it's easy to use, here is the tutorial for voice cloning.
And the voice encapsulation system VITS https://github.com/jaywalnut310/vits
Hey HN, has anyone found a viable solution for doing this locally and offline on iOS? I'd like to offer a privacy-friendly text to speech feature to my App, and Apple's speech synthesis sounds awful compared to some newer models and TTS engines. The only thing I've found is an older TensorflowTTS example here: https://github.com/TensorSpeech/TensorFlowTTS/tree/master/examples/ios
Any pointers or tips appreciated.
Project mention: [discussion] text to voice generation for textbooks (non-math part) | /r/MachineLearning | 2023-12-01i would very much like to use it to turn the text parts of a book into an audio where i could listen to it while reading. i used edge's tts for speech by giving a paragraph to clipboard and to edge-tts in order to listen the text but it causes two problems: 1. you need internet connection and have the book opened 2. can only do paragraph by paragraph, and is prone to errors or sometimes if you use it too much it wont convert the full text afterwards.
For our real-time TTS needs, we'll employ the fantastic library called gTTS.
https://github.com/rsxdalv/tts-generation-webui
Python Tts related posts
- Using Groq to Build a Real-Time Language Translation App
- Ask HN: Voice ID adoption at financial institutions
- OpenVoice: Versatile Instant Voice Cloning
- OpenAI: Navigating the Challenges and Opportunities of Synthetic Voices
- WhisperSpeech โ An Open Source text-to-speech system built by inverting Whisper
- Building a local AI smart Home Assistant
- OpenVoice: Versatile Instant Voice Cloning
-
A note from our sponsor - WorkOS
workos.com | 23 Apr 2024
Index
What are some of the best open-source Tt projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | Real-Time-Voice-Cloning | 50,652 |
2 | MockingBird | 33,796 |
3 | TTS | 28,959 |
4 | OpenVoice | 17,263 |
5 | PaddleSpeech | 10,120 |
6 | NeMo | 10,021 |
7 | VALL-E-X | 7,138 |
8 | EmotiVoice | 6,270 |
9 | vits | 6,230 |
10 | DiffSinger | 4,102 |
11 | TensorFlowTTS | 3,697 |
12 | edge-tts | 3,503 |
13 | tacotron | 2,921 |
14 | vall-e | 2,868 |
15 | lingvo | 2,781 |
16 | aeneas | 2,379 |
17 | gTTS | 2,139 |
18 | WaveRNN | 2,086 |
19 | hifi-gan | 1,744 |
20 | tts-generation-webui | 1,260 |
21 | dc_tts | 1,150 |
22 | TransformerTTS | 1,096 |
23 | NATSpeech | 944 |
Sponsored