forced-alignment-tools
A collection of links and notes on forced alignment tools (by pettarin)
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages (by xinjli)
forced-alignment-tools | allosaurus | |
---|---|---|
2 | 2 | |
831 | 507 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | 7 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
forced-alignment-tools
Posts with mentions or reviews of forced-alignment-tools.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-02-22.
- What's the state-of-the-art, word-level forced alignment tool that is ok to use commercially?
-
Python and Speech recognition
Since you know that you have one or two phonemes in each recordings (one for vowel, two for a consonant) you will be able to find where on the recordings the utterances takes place. Which is a simplified approach of "forced alignment".
allosaurus
Posts with mentions or reviews of allosaurus.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-02-22.
-
Complete table of all IPA vowels' formant frequencies
Thank you for a great reply! If I catch your drift, how does this bode with phonemic transcription? Suppose we have an automatic phone recognizer tool such as Allosaurus.
-
Python and Speech recognition
And for phonemes recognition: - this looks like it could be useful (I'm sure you won't mind if it's "phones" instead of "phonemes"): https://github.com/xinjli/allosaurus - about using standard speech recognition tools: https://cmusphinx.github.io/wiki/phonemerecognition/
What are some alternatives?
When comparing forced-alignment-tools and allosaurus you can also consider the following projects:
common-voice - Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
SpeechRecognition - Speech recognition module for Python, supporting several engines and APIs, online and offline.
edgedict - Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
pocketsphinx-python - Python interface to CMU Sphinxbase and Pocketsphinx libraries
lingvo - Lingvo
SpeechLoop - Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
wikipron - Massively multilingual pronunciation mining
diffwave - DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.