praat | wsay | |
---|---|---|
4 | 2 | |
1,384 | 133 | |
2.0% | - | |
9.4 | 5.6 | |
2 days ago | 19 days ago | |
C++ | C++ | |
- | BSD 3-clause "New" or "Revised" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
praat
-
Praat: Doing Phonetics by Computer
This brings back memories.
I worked my way through some of its source code many years ago during my post-graduate studies and it was very _strange_. I see it is now on GitHub [0].
They used C macros to implement object oriented programming, with symbols like `me` and `my` and `thee` scattered throughout the source code. It seems the code has been converted to C++ (IIRC it used to be in C), but I still see the `my` keyword in there.
They have their own BASIC-like scripting language. The weirdest property for me was that it allowed for whitespace in the identifiers. Just look at the example in [1]: The `Create simple Matrix` is actually a function in the scripting language that constructs a matrix object. The function name corresponds to a menu item and IIRC they used some more preprocessor magic to reuse the same code for the menus on the GUI and the functions in the scripting language.
I don't think you're supposed to write the scripts by hand. Rather it recorded your actions as you worked your way through the GUI and then you could export and modify those recordings as scripts.
They also implemented their own cross platform GUI toolkit rather than using one of the existing cross-platform GUI toolkits, so it works on Windows, Linux (or any X Windows I believe) and MacOS.
[0]: https://github.com/praat/praat
-
Yllish knots are recorded sound, not written language
IPA is great, but even it fails to perfectly transcribe the subtleties of sound. Everyone has their own unique auditory space β your schwa is not the same as mine. Think about the range of sounds that could reasonably transcribed as a certain vowel or consonant, you can use a tool like Praat (https://github.com/praat/praat) to visualise it.
-
"point number too large" error in Praat
I wrote a script that worked beautifully on all of my test files (yay!). When I went to run it on all 240 real files, it kicked up this error on file 6. I am measuring VOT by having two point tiers - one marking the burst and one marking the onset of voicing. The script is just doing simple subtraction. I thought, maybe this is a negative VOT and Praat doesn't like that for some reason, but I checked and that is not the case. When I Google the error, I just get a script written by Paul Boersma that has this line in it, but I am not sure what it is doing there (https://github.com/praat/praat/blob/master/fon/praat_TextGrid_init.cpp).
-
Porting some c++ code
From here
wsay
-
Is there a better way to use Text to Speech?
I'm sorry, Dave. I'm afraid I can't do that... Info about wsay-- https://github.com/p-groarke/wsay info about Nircmd-- https://www.nirsoft.net/utils/nircmd.html */
-
Convert Microsoft Text to Speech into MP3
Try this https://github.com/p-groarke/wsay/releases
What are some alternatives?
julius - Open-Source Large Vocabulary Continuous Speech Recognition Engine
Reset-Windows-Update-Tool - Troubleshooting Tool with Windows Updates (Developed in Dev-C++).
SpeechLoop - Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
drachtio-freeswitch-modules - A collection of open-sourced freeswitch modules that I use in various drachtio applications
silero-models - Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
TTS - πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
TensorVox - Desktop application for neural speech synthesis written in C++
aeneas - aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
RHVoice - a free and open source speech synthesizer for Russian and other languages