tortoise-tts-modal-api
MockingBird
tortoise-tts-modal-api | MockingBird | |
---|---|---|
6 | 9 | |
111 | 34,282 | |
0.9% | - | |
0.0 | 4.9 | |
9 months ago | 25 days ago | |
Python | Python | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tortoise-tts-modal-api
-
[D] Any model like VALL-E available currently?
We put up a free, open source API for tortoise recently, will be adding improvements to this over time & appreciate contributions: https://github.com/metavoicexyz/tortoise-tts-modal-api
- Open-source, serverless text-to-speech
- Open-source serverless text-to-speech
-
[P] Built an at-cost, pay per second, open-source API for Tortoise text-to-speech (best I've heard!)
It can be used via a UI on: https://tts.themetavoice.xyz
-
Show HN: Tortoise TTS as an at-cost open-source pay-per-second API
Tortoise TTS is the best TTS available today. We built an open-source, at cost, pay per second API for it. The quality of intonation it generates is unparalleled, and we hope our at-cost API will make it easier for people to build on top!
This allows folks to run via a single API call - it costs $0.03/query. The WAV file is downloadable, we apply no restrictions.
We're open-sourcing all our work — we made Tortoise run 30% faster, and have more improvements coming. If you're keen to contribute we can help with ideas, pointers, compute and data; just DM us. Our fork with the improvements can be found at https://github.com/metavoicexyz/tortoise-tts. The deployment code can be found at https://github.com/metavoicexyz/tortoise-tts-modal-api.
There are already great alternatives for using : i) @mdnest_r's awesome Huggingface Spaces, ii) original Google Colab, iii) host it yourself. Our work should accelerate those who need an API, don't want to spend time/$ hosting and need a scalable infra backing them.
We're especially excited about combining text-to-speech with content generated from LLMs, and about how it fits into video creation tools.
Tortoise in its current form is also inaccessible to non-technical users, which is why we are also providing a simple UI on top (also "at-cost"): https://tts.themetavoice.xyz
To use, generate an API key on https://tts.themetavoice.xyz and call via POST request. Or use the web UI. Or run your own deployment.
MockingBird
-
TIL cyber criminals with the help of A.I voice cloning software, used a deepfaked voice of a company executive to fool a Emirati bank manager to transfer 35 million dollars into their personal accounts. The bank manager had recognized the executive's voice from having worked with him before.
Actually, there are already open source implementations available, for example, the MockingBird project on GitHub. It supports English and Mandarin Chinese. For those with enough computation power and willingness to try, you can even make your own voice dataset and train the model to generate ‘your’ sound, simply following the project docs.
-
Need a deep fake for my girlfriend
piece of recordings MockingBird
-
在GitHub上看到一个有意思的project,可以导入声音素材用ai训练,训练完成后可以克隆声音并且让他读一些自定义文本,有无码老嗨来分析一下用这个工具制造乳制品的可行性有多大?
项目地址:https://github.com/babysor/MockingBird
-
Hacker News top posts: Dec 28, 2021
Mocking Bird – Realtime Voice Clone for Chinese\ (45 comments)
- Mocking Bird – Realtime Voice Clone for Chinese
- Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Danke schön Sir
I wonder if there is a way to tie in this AI voice changer. https://github.com/babysor/MockingBird
-
Top 10 trending github repos of the week🌟.
View on GitHub
- [P] Clone a voice in 5 seconds to generate arbitrary speech in real-time
What are some alternatives?
unilm - Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Real-Time-Voice-Cloning - Clone a voice in 5 seconds to generate arbitrary speech in real-time
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
cuckoo - Cuckoo Sandbox is an automated dynamic malware analysis system
tortoise-tts-Windows - A multi-voice TTS system trained with an emphasis on quality
Speech-enhancement - Deep learning for audio denoising
SimSwap - An arbitrary face-swapping framework on images and videos with one single trained model!
uptrace - Open source APM: OpenTelemetry traces, metrics, and logs
CombineExpectations - Utilities for tests that wait for Combine publishers
Cuckoo - Boilerplate-free mocking framework for Swift!
Mockingjay - An elegant library for stubbing HTTP requests with ease in Swift
Guava - A Swift test double library. Guava - looks like an apple but it's not.