wtpsplit
razdel
wtpsplit | razdel | |
---|---|---|
1 | 1 | |
499 | 246 | |
- | 1.2% | |
7.4 | 2.1 | |
5 days ago | 10 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
wtpsplit
-
Typo correction using NLP
Source: I'm the author of nlprule and nnsplit which are quite well used for grammatical error correction and sentence boundary detection, respectively.
razdel
-
Silero V3: fast high-quality text-to-speech in 20 languages with 173 voices
Also currently we abandoned batching, so GPUs are not really required at all.
> the quality (as in: what I'm hearing, not a formally measured metric) is good but (YMMV) not as good as turtle.
I believe the compute required during training and inference … may differ by 3 or 4 orders of magnitude (!).
Also note, that some speakers and languages just sound better due to high quality of source material and the amount of work invested and polish.
> it breaks with strange error messages if the text you feed it is too long
Well, there should be a warning somewhere, but it works with text no longer than 512-1024 symbols.
> there is mention of "a model for text repunctuation and recapitalization", which I wonder if it could be used to break a very long text (eg a book) into pieces that can be digested by the tts engine
This model only restores some punctuation marks and capital letters.
There are libraries like razdel for this - https://github.com/natasha/razdel
What are some alternatives?
SymSpell - SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
tangram - Tangram makes it easy for programmers to train, deploy, and monitor machine learning models.
ttsprech - Simple text2speech for the command line
tch-rs - Rust bindings for the C++ api of PyTorch.
Voice-Cloning-App - A Python/Pytorch app for easily synthesising human voices
NLP-progress - Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
tangram - Tangram is an all-in-one automated machine learning framework. [Moved to: https://github.com/tangramdotdev/tangram]
add-stress-to-epub - A program that sets the stress and the letter ё of Russian text and ebooks using Wiktionary data and grammar analysis.
nlprule - A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
pySBD - 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.