Caster
voice_datasets
Caster | voice_datasets | |
---|---|---|
7 | 3 | |
329 | 1,548 | |
0.6% | - | |
2.9 | 3.5 | |
about 1 month ago | about 2 months ago | |
Python | ||
GNU General Public License v3.0 or later | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Caster
- Ask HN: I'm disabled and out of money. Now what?
- Is there a Foundry VTT module that helps people who have difficulty moving their hands and fingers?
- Dragonfly-Based Voice Programming and Accessibility Toolkit
-
Ask HN: Who Wants to Collaborate?
Unfortunately Dragon development has mostly stalled for the last 5 years (Dragon 15 was a leap forward but that was quite some time ago now).
You can still make use of it via Dragonfly (see also Caster[0]) as mentioned by a sibling comment or by using Talon[1] or Vocola.
Having used a computer 90% hands free for about a year and a half back in 2019, I chose Dragonfly then, but would probably choose Talon nowadays - less futsing about and it has alternative speech engine options.
I also recommend looking into eye tracking: the Tobii gaming products[2] work well for general computer mousing with some software like Talon or Precision Gaze[3] - well enough for me to make a hands free mod[4] for Factorio, for example.
[0]: https://github.com/dictation-toolbox/Caster
- How can I make Mycroft recognize non verbal audio sounds to command it?
- Linux Voice recognition/dictation/voice assistant/ one handed operation?
-
Any programmers using dictation?
so I found this thing called Caster today that miiight save my job. it does allow you to format code with Dragon and navigate VS Code (albeit poorly.) It's also open-source, so you can add features.
voice_datasets
-
Where to begin - ML model for speech recognition
- https://github.com/jim-schwoebel/voice_datasets
-
List of filler words?
I also came across this list of voice datasets: https://github.com/jim-schwoebel/voice_datasets
- Datasets containing human-written transcripts of videos or audio files
What are some alternatives?
kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
izabela-desktop - A proof of concept text-to-speech application allowing global typing. Can be used over applications such as voice chats, games and much more.
dragonfly - Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
talk2windows - Add voice commands to control the Windows 10+ desktop.
silero-vad - Silero VAD: pre-trained enterprise-grade Voice Activity Detector
video-game-text-dataset - Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.
rhino - Rhino is an open-source implementation of JavaScript written entirely in Java
porcupine-web-vaadin-demo - Example of using Picovoice Porcupine wake word detection in Vaadin 24
Common-Voice - Audio Classification with machine learning
um_detector - detector for filler words
Speech-Recognition - Speech Recognition library for adding Voice Commands and Controls to all your applications. Whether you are building web apps, native apps or desktop apps, this technology can be integrated into any system with an internet connection.