Our great sponsors
-
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
-
silero-models
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
How can it be used for transcription?
In their website I only see an interface for either uploading audio or submitting transcriptions:
https://commonvoice.mozilla.org/es
The Github repo they mention (https://github.com/common-voice/common-voice) seems to be just that sample collection software. I do not see where I can download the software to transcribe audio.
Some months ago I tried the Silero Models: https://github.com/snakers4/silero-models
With the audio sources I had, in English, the transcription had many mistakes. The good side is that installing and running the software worked as described in their documentation, so maybe it’s worth giving it a try by yourself.
Related posts
- Mozilla Common Voice - Korean Language is live - Help Build a Korean Corpus for Training AI/Navi/etc
- Show HN: AI Dub Tool I Made to Watch Foreign Language Videos with My 7-Year-Old
- Common Voice
- Now I Can Just Print That Video
- Distil-Whisper: distilled version of Whisper that is 6 times faster, 49% smaller