AutoPST
Amphion
AutoPST | Amphion | |
---|---|---|
1 | 4 | |
248 | 3,956 | |
- | 6.1% | |
0.0 | 8.6 | |
over 1 year ago | 4 days ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
AutoPST
-
[D] is there a voice-cloning tool that can "revoice" a spoken performance but make it sound like another person?
Just want to point out that the ML task you mentioned ("hiring a voice actor and then use an AI tool to 'revoice' the spoken audio") is known as voice conversion. In my opinion, 2022 state of the art voice conversion (e.g. AutoVC/AutoPST, https://github.com/auspicious3000/AutoPST) can still sound a bit garbled and occasionally robotic, but the rhythm and emotion will be better than a custom-trained TTS model.
Amphion
- FLaNK Stack Weekly 11 Dec 2023
- Technique makes Taylor Swift to sing perfect Mandarin Chinese song
-
Novel vocoder for high-quality audio generation
Code: https://github.com/open-mmlab/Amphion/blob/main/models/vocod...
What are some alternatives?
tortoise-tts - A multi-voice TTS system trained with an emphasis on quality
VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
autovc - AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
vall-e - An unofficial PyTorch implementation of the audio LM VALL-E
canopy - Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
furnace - a multi-system chiptune tracker compatible with DefleMask modules
Retrieval-based-Voice-Conversion-WebUI - Easily train a good VC model with voice data <= 10 mins!
Gooey - Turn (almost) any Python command line program into a full GUI application with one line
table-transformer - Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
ava - All-in-one desktop app for running LLMs locally.
ast-grep - ⚡A CLI tool for code structural search, lint and rewriting. Written in Rust
FLiPStackWeekly - FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...