Python text-to-speech

Open-source Python projects categorized as text-to-speech

Top 23 Python text-to-speech Projects

  • MockingBird

    🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

  • TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

  • Project mention: OpenAI deems its voice cloning tool too risky for general release | news.ycombinator.com | 2024-03-31

    lol this marketing technique is getting very old. https://github.com/coqui-ai/TTS is already amazing and open source.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • OpenVoice

    Instant voice cloning by MyShell.

  • Project mention: Ask HN: Voice ID adoption at financial institutions | news.ycombinator.com | 2024-04-03

    Given the inevitability of easy voice cloning[1], it seems irresponsible to be using voice as a positive authentication signal.

    Unfortunately, major US financial institutions seem to be ramping up adoption of this technology[2].

    Am I missing something?

    [1] https://github.com/myshell-ai/OpenVoice

  • VALL-E-X

    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • EmotiVoice

    EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • vits

    VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

  • Project mention: [D] TTS systems to download & run offline | /r/MachineLearning | 2023-05-14

    And the voice encapsulation system VITS https://github.com/jaywalnut310/vits

  • pyvideotrans

    Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

  • Project mention: FLaNK Stack Weekly 06 Nov 2023 | dev.to | 2023-11-06
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • DiffSinger

    DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

  • Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • Project mention: FLaNK Stack Weekly 11 Dec 2023 | dev.to | 2023-12-11
  • TensorFlowTTS

    :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

  • Project mention: Ask HN: On-Device Text to Speech | news.ycombinator.com | 2023-08-31

    Hey HN, has anyone found a viable solution for doing this locally and offline on iOS? I'd like to offer a privacy-friendly text to speech feature to my App, and Apple's speech synthesis sounds awful compared to some newer models and TTS engines. The only thing I've found is an older TensorflowTTS example here: https://github.com/TensorSpeech/TensorFlowTTS/tree/master/examples/ios

    Any pointers or tips appreciated.

  • edge-tts

    Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

  • Project mention: [discussion] text to voice generation for textbooks (non-math part) | /r/MachineLearning | 2023-12-01

    i would very much like to use it to turn the text parts of a book into an audio where i could listen to it while reading. i used edge's tts for speech by giving a paragraph to clipboard and to edge-tts in order to listen the text but it causes two problems: 1. you need internet connection and have the book opened 2. can only do paragraph by paragraph, and is prone to errors or sometimes if you use it too much it wont convert the full text afterwards.

  • Awesome-Prompt-Engineering

    This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

  • Project mention: AI lessons | /r/ChatGPT | 2023-05-09

    Yes, there are a lot of different resources online, especially for generative AI. The Awesome Prompt Engineering github is probably a good place to start https://github.com/promptslab/Awesome-Prompt-Engineering. If you're focusing directly on OpenAI's models then the OpenAI Prompt Engineering Guide would be my recommendation https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-openai-api.

  • vall-e

    An unofficial PyTorch implementation of the audio LM VALL-E

  • bark-with-voice-clone

    🔊 Text-prompted Generative Audio Model - With the ability to clone voices

  • Project mention: I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now. | /r/FlutterDev | 2023-12-08

    And more stuff I’m often checking back on: - https://github.com/staghado/vit.cpp - https://github.com/serp-ai/bark-with-voice-clone - https://github.com/leejet/stable-diffusion.cpp (generate images) - etc … there’s too much fun stuff out there. Wish I had more free time haha.

  • aeneas

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

  • Tacotron-2

    DeepMind's Tacotron-2 Tensorflow implementation

  • gTTS

    Python library and CLI tool to interface with Google Translate's text-to-speech API

  • Project mention: Using Groq to Build a Real-Time Language Translation App | dev.to | 2024-04-05

    For our real-time TTS needs, we'll employ the fantastic library called gTTS.

  • WaveRNN

    WaveRNN Vocoder + TTS

  • pyttsx3

    Offline Text To Speech synthesis for python

  • Project mention: Pyttsx3 help | /r/learnpython | 2023-05-26

    Looks like a segfault to me. This current issue may help you solve it (scroll to the bottom to get a suggested fix).

  • elevenlabs-python

    The official Python API for ElevenLabs Text to Speech.

  • Project mention: babel_fish - real time language translation | dev.to | 2024-04-14

    Since a true Babel fish doesn't produce text we also used ElevenLabs for text-to-speech.

  • hifi-gan

    HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

  • Dragonfire

    the open-source virtual assistant for Ubuntu based Linux distributions

  • tts-generation-webui

    TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

  • Project mention: OpenVoice: Versatile Instant Voice Cloning | news.ycombinator.com | 2024-03-29

    https://github.com/rsxdalv/tts-generation-webui

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python text-to-speech related posts

Index

What are some of the best open-source text-to-speech projects in Python? This list will help you:

Project Stars
1 MockingBird 33,796
2 TTS 28,959
3 OpenVoice 17,263
4 VALL-E-X 7,138
5 EmotiVoice 6,270
6 vits 6,230
7 pyvideotrans 5,556
8 DiffSinger 4,102
9 Amphion 3,864
10 TensorFlowTTS 3,697
11 edge-tts 3,503
12 Awesome-Prompt-Engineering 3,196
13 vall-e 2,868
14 bark-with-voice-clone 2,798
15 aeneas 2,379
16 Tacotron-2 2,231
17 gTTS 2,139
18 WaveRNN 2,086
19 pyttsx3 1,898
20 elevenlabs-python 1,822
21 hifi-gan 1,744
22 Dragonfire 1,372
23 tts-generation-webui 1,260

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com