add-stress-to-epub
razdel
add-stress-to-epub | razdel | |
---|---|---|
4 | 1 | |
19 | 246 | |
- | 1.2% | |
8.7 | 2.1 | |
3 months ago | 10 months ago | |
Python | Python | |
GNU Affero General Public License v3.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
add-stress-to-epub
-
Text resources with IPA?
You can take any ebook and use my program to add stress marks: https://github.com/Vuizur/add-stress-to-epub
-
How can I find the stress in a russian word ?
Simply read ebooks with added stress marks by my tool: https://github.com/Vuizur/add-stress-to-epub and you won't have that problem anymore.
-
Eugene Onegin with stress marks
Using the Eugene Onegin Wikisource text ran through my stress setting application I get the following result (epub format): https://drive.google.com/file/d/1d1hVQEyegqDm3\_GgY-O4ZCZ-ik5W\_T1N/view?usp=sharing
-
I developed an open source program to add stress marks to an ebook (link in comments)
In my opinion this program should have existed a long time ago, but I didn't find anything, so here is my version: https://github.com/Vuizur/add-stress-to-epub
razdel
-
Silero V3: fast high-quality text-to-speech in 20 languages with 173 voices
Also currently we abandoned batching, so GPUs are not really required at all.
> the quality (as in: what I'm hearing, not a formally measured metric) is good but (YMMV) not as good as turtle.
I believe the compute required during training and inference … may differ by 3 or 4 orders of magnitude (!).
Also note, that some speakers and languages just sound better due to high quality of source material and the amount of work invested and polish.
> it breaks with strange error messages if the text you feed it is too long
Well, there should be a warning somewhere, but it works with text no longer than 512-1024 symbols.
> there is mention of "a model for text repunctuation and recapitalization", which I wonder if it could be used to break a very long text (eg a book) into pieces that can be digested by the tts engine
This model only restores some punctuation marks and capital letters.
There are libraries like razdel for this - https://github.com/natasha/razdel
What are some alternatives?
transcript_to_ebook - Transcript to ebook is a tool that will help you get transcript from your favorite Youtube video in various formats.
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
ttsprech - Simple text2speech for the command line
Voice-Cloning-App - A Python/Pytorch app for easily synthesising human voices
wtpsplit - Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
spaCy - 💫 Industrial-strength Natural Language Processing (NLP) in Python
pySBD - 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.