athena
STT
athena | STT | |
---|---|---|
- | 11 | |
952 | 2,309 | |
0.0% | 0.7% | |
0.0 | 0.6 | |
about 2 years ago | 11 months ago | |
C++ | C++ | |
Apache License 2.0 | Mozilla Public License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
athena
We haven't tracked posts mentioning athena yet.
Tracking mentions began in Dec 2020.
STT
-
Rest in Peas: The Unrecognized Death of Speech Recognition (2010)
What has happened since then? I know Common Voice has come and gone https://en.wikipedia.org/wiki/Common_Voice https://github.com/coqui-ai/STT
And I've seen some neural approaches too
No idea where to look for comparisons though.
-
Numen - FOSS voice control for handsfree computing
I basically just used coqui stt https://github.com/coqui-ai/STT
-
Are there any OCR and Speech-to-Text services that are privacy friendly?
This speech-to-text works well: https://github.com/coqui-ai/STT. openai's "whisper" is probably better but I haven't tried it: https://towardsdatascience.com/transcribe-audio-files-with-openais-whisper-e973ae348aa7
-
Introducing Whisper
I use two SST to live-translate audio that I listen to so I can look back (in paragraph form) to see things that I or the youtube has previously said: https://github.com/coqui-ai/STT https://github.com/ratwithacompiler/OBS-captions-plugin
-
You can now tether any prod Vector to Wire's Open Source Escape Pod • thedroidyouarelookingfor
I did have to install Coqui STT and go-asticoqui manually before i was able to run Chipper.
-
Currently working on a custom Virtual Assistant ('Randy') to help automate things in my shed (mainly CNC equipment) and also perform basic tasks. This morning I was able to get it to publish events on my google calendar.
What do you use as STT? I have heard good things about coqui (https://github.com/coqui-ai/STT) and will use it for my Assistant-build.
- Speech to Text Best Resource
-
I put together a tutorial and overview on how to use DeepSpeech to do Speech Recognition in Python
If anyone is looking for a maintained version of DeepSpeech, checkout Coqui's repositories for STT and TTS. Coqui is lead by the engineers that used to work on DeepSpeech at Mozilla.
-
CoquiTTS: 🐸💬 - Open Source Text-to-Speech framework.
Link: https://github.com/coqui-ai/STT
- Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech
What are some alternatives?
OpenSeq2Seq - Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
intel-extension-for-transformers - ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
OBS-captions-plugin - Closed Captioning OBS plugin using Google Speech Recognition
wenet - Production First and Production Ready End-to-End Speech Recognition Toolkit
DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
lingvo - Lingvo
vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
drachtio-freeswitch-modules - A collection of open-sourced freeswitch modules that I use in various drachtio applications
NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
kaldi-offline-transcriber - Offline transcription system for Estonian using Kaldi
flashlight - A C++ standalone library for machine learning