tortoise-tts-modal-api
OpenVoice
tortoise-tts-modal-api | OpenVoice | |
---|---|---|
6 | 14 | |
111 | 26,595 | |
0.9% | 21.7% | |
0.0 | 8.7 | |
9 months ago | 9 days ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tortoise-tts-modal-api
-
[D] Any model like VALL-E available currently?
We put up a free, open source API for tortoise recently, will be adding improvements to this over time & appreciate contributions: https://github.com/metavoicexyz/tortoise-tts-modal-api
- Open-source, serverless text-to-speech
- Open-source serverless text-to-speech
-
[P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!)
It can be used via a UI on: https://tts.themetavoice.xyz
-
Show HN: Tortoise TTS as an at-cost open-source pay-per-second API
Tortoise TTS is the best TTS available today. We built an open-source, at cost, pay per second API for it. The quality of intonation it generates is unparalleled, and we hope our at-cost API will make it easier for people to build on top!
This allows folks to run via a single API call - it costs $0.03/query. The WAV file is downloadable, we apply no restrictions.
We're open-sourcing all our work โ we made Tortoise run 30% faster, and have more improvements coming. If you're keen to contribute we can help with ideas, pointers, compute and data; just DM us. Our fork with the improvements can be found at https://github.com/metavoicexyz/tortoise-tts. The deployment code can be found at https://github.com/metavoicexyz/tortoise-tts-modal-api.
There are already great alternatives for using : i) @mdnest_r's awesome Huggingface Spaces, ii) original Google Colab, iii) host it yourself. Our work should accelerate those who need an API, don't want to spend time/$ hosting and need a scalable infra backing them.
We're especially excited about combining text-to-speech with content generated from LLMs, and about how it fits into video creation tools.
Tortoise in its current form is also inaccessible to non-technical users, which is why we are also providing a simple UI on top (also "at-cost"): https://tts.themetavoice.xyz
To use, generate an API key on https://tts.themetavoice.xyz and call via POST request. Or use the web UI. Or run your own deployment.
OpenVoice
- OpenVoice: Instant Voice Cloning
- OpenVoice V2 Released
-
Ask HN: Voice ID adoption at financial institutions
Given the inevitability of easy voice cloning[1], it seems irresponsible to be using voice as a positive authentication signal.
Unfortunately, major US financial institutions seem to be ramping up adoption of this technology[2].
Am I missing something?
[1] https://github.com/myshell-ai/OpenVoice
-
OpenAI: Navigating the Challenges and Opportunities of Synthetic Voices
They might have been forced to give a signal after this rose on HN today:
https://research.myshell.ai/open-voice
https://news.ycombinator.com/item?id=39861578
- OpenVoice: Versatile Instant Voice Cloning
- FLaNK Weekly 31 December 2023
What are some alternatives?
unilm - Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
piper - A fast, local neural text to speech system
tortoise-tts-Windows - A multi-voice TTS system trained with an emphasis on quality
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
MockingBird - ๐AIๆๅฃฐ: 5็งๅ ๅ ้ๆจ็ๅฃฐ้ณๅนถ็ๆไปปๆ่ฏญ้ณๅ ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time
Stirling-PDF - #1 Locally hosted web application that allows you to perform various operations on PDF files
FLaNK-Ice - Apache Iceberg - Cloud Data Lakehouse
JavaOnRaspberryPi - Sources and scripts for the book "Getting started with Java on the Raspberry Pi"
temporian - Temporian is an open-source Python library for preprocessing โก and feature engineering ๐ temporal data ๐ for machine learning applications ๐ค
awesome-ml - Curated list of useful LLM / Analytics / Datascience resources