Hifi-gan Alternatives

Similar projects and alternatives to hifi-gan

tortoise-tts

144 11,686 8.2 Jupyter Notebook hifi-gan VS tortoise-tts

A multi-voice TTS system trained with an emphasis on quality
NeMo

29 9,951 9.8 Python hifi-gan VS NeMo

NeMo: a framework for generative AI
InfluxDB

www.influxdata.com
sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
tacotron2

28 4,882 0.0 Jupyter Notebook hifi-gan VS tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference
speechbrain

26 7,836 9.8 Python hifi-gan VS speechbrain

A PyTorch-based Speech Toolkit
WaveRNN

5 2,086 0.0 Python hifi-gan VS WaveRNN

WaveRNN Vocoder + TTS
wavegrad

1 265 1.8 Python hifi-gan VS wavegrad

A fast, high-quality neural vocoder.
Parallel-Tacotron2

1 184 0.0 Python hifi-gan VS Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
WorkOS

workos.com
sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
diffwave

3 720 1.5 Python hifi-gan VS diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
PaddleSpeech

6 10,069 7.6 Python hifi-gan VS PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
vits

6 6,206 0.0 Python hifi-gan VS vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
flowtron

6 882 0.0 Jupyter Notebook hifi-gan VS flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
TTS

231 28,959 9.5 Python hifi-gan VS TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
TensorFlowTTS

6 3,690 0.0 Python hifi-gan VS TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
FastSpeech2

4 1,604 0.0 Python hifi-gan VS FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
mlp-singer

2 113 0.0 Python hifi-gan VS mlp-singer

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
tacotron

3 2,919 0.0 Python hifi-gan VS tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
STYLER

3 150 1.8 Python hifi-gan VS STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021 (by keonlee9420)
so-vits-svc-fork

16 8,287 9.4 Python hifi-gan VS so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.
waveglow

2 2,218 0.0 Python hifi-gan VS waveglow

A Flow-based Generative Network for Speech Synthesis
Speech-Backbones

1 518 0.0 Jupyter Notebook hifi-gan VS Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
SaaSHub

www.saashub.com
sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better hifi-gan alternative or higher similarity.

Suggest an alternative to hifi-gan

hifi-gan reviews and mentions

Posts with mentions or reviews of hifi-gan. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-13.

[D] What is the best open source text to speech model?
15 projects | /r/MachineLearning | 13 Apr 2023
I made Lisa-nee TTS (Imai Lisa)
2 projects | /r/BanGDream | 4 Feb 2023
HiFi-GAN: Generative Adversarial Networks for Efficient and Hi-Fi Speech Synth
1 project | news.ycombinator.com | 28 Dec 2021
[2108.13320] Neural HMMs are all you need (for high-quality attention-free TTS)
1 project | /r/speechtech | 1 Sep 2021

It will be interesting to see if the artefacts you noticed persist once we've trained the model for longer and switch to a better vocoder such as HiFi-GAN. (The paper and audio examples use WaveGlow since that's the default of the repository we compared ourselves to.) That said, "choppiness" sounds to me like it might be related to the temporal evolution, in which case it's something that a non-causal, convolutional post-net might be able to smooth over.
The dangers of AI
1 project | /r/videos | 25 Jan 2021

Hey, as far as I know this paper is the current SoTA on public data that is open source. Github is here. If you are interested in really getting into speech synthesis, this page has everything (modern stuff on the bottom.)
A note from our sponsor - InfluxDB
www.influxdata.com | 19 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Stats

Basic hifi-gan repo stats

Mentions

Stars

1,744

Activity

0.0

Last Commit

9 months ago

jik876/hifi-gan is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of hifi-gan is Python.

hifi-gan

Hifi-gan Alternatives

Similar projects and alternatives to hifi-gan

hifi-gan reviews and mentions

Stats

Popular Comparisons