Python vits

Open-source Python projects categorized as vits

Top 4 Python vit Projects

  • Retrieval-based-Voice-Conversion-WebUI

    Easily train a good VC model with voice data <= 10 mins!

    Project mention: Train an AI voice model in 10 minutes | news.ycombinator.com | 2024-06-06
  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
  • so-vits-svc-fork

    so-vits-svc fork with realtime support, improved interface and more features.

  • fish-speech

    Brand new TTS solution

    Project mention: Fish Speech TTS: clone OpenAI TTS in 30 minutes | news.ycombinator.com | 2024-05-22

    While we are still figuring out ways to improve the agent's emotional response to OpenAI GPT-4 level, we have already made significant progress in aligning OpenAI's TTS performance. To begin this experiment, we collected 10 hours of OpenAI TTS data to perform supervised fine-tuning (SFT) on both the LLM and VITS models, which took approximately 30 minutes. After that, we used 15 seconds of audio as a prompt during inference.

    Demos Available: https://firefly-ai.notion.site/OpenAI-Examples-34975ae263a9496c84e89fb7b1ea25a4?pvs=4

    As you can see, the model's emotion, rhythm, accent, and timbre match the OpenAI speakers, though there is some degradation in audio quality, which we are working on. To avoid any legal issues, we are unable to release the fine-tuned model, but I believe everyone can tune Fish Speech to this level within hours and for around $20.

    Our experiment shows that with only 25 seconds of prompts (few-shot learning), without any fine-tuning, the model can mimic most behaviors except for how it reads numbers. To the best of our knowledge, you can clone how someone speaks in English, Chinese, and Japanese with 30 minutes of data using this framework.

    Repo: https://github.com/fishaudio/fish-speech

  • Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

    Project mention: FLaNK Stack Weekly 11 Dec 2023 | dev.to | 2023-12-11
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python vits discussion

Log in or Post with

Python vits related posts

Index

What are some of the best open-source vit projects in Python? This list will help you:

Project Stars
1 Retrieval-based-Voice-Conversion-WebUI 21,055
2 so-vits-svc-fork 8,528
3 fish-speech 5,492
4 Amphion 4,201

Sponsored
Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com