The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more β
Flowtron Alternatives
Similar projects and alternatives to flowtron
-
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
-
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
-
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
-
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
-
radtts
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.
-
STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021 (by keonlee9420)
-
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
-
DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech) (by keonlee9420)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
flowtron reviews and mentions
- [D] What is the best open source text to speech model?
- A thought: we need language and voice synthesis models as free as Stable Diffusion
-
Ask HN: Best FOSS software to read text allowed
If you want free (as open source) software, the NVIDIA research GitHub also has some good tools. For example : https://github.com/NVIDIA/flowtron
-
Visas Marr on the tragedy of Darth Plagueis
Voice in this video was synthesized using a Flowtron trained on Visas' speech patterns.(https://github.com/NVIDIA/flowtron)
-
Bastila Shan reads the Sith and Jedi Codes
The voicelines in this video was created using a Flowtron Text-to-Speech (TTS) model trained on Bastila's voice patterns to read the Sith and Jedi Codes. For more information: https://github.com/NVIDIA/flowtron I created a small tutorial for how to use it on Google Colab: https://www.youtube.com/watch?v=1Bmg1c5U5Bg
-
I created a Text-to-Speech model based on Bastila's voice patterns.
For more information on Flowtron: https://github.com/NVIDIA/flowtron/
-
A note from our sponsor - WorkOS
workos.com | 24 Apr 2024
Stats
NVIDIA/flowtron is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of flowtron is Jupyter Notebook.
Sponsored