voice_datasets
Caster
voice_datasets | Caster | |
---|---|---|
3 | 7 | |
1,551 | 332 | |
- | 1.5% | |
3.5 | 2.9 | |
about 2 months ago | about 1 month ago | |
Python | ||
- | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
voice_datasets
-
Where to begin - ML model for speech recognition
- https://github.com/jim-schwoebel/voice_datasets
-
List of filler words?
I also came across this list of voice datasets: https://github.com/jim-schwoebel/voice_datasets
- Datasets containing human-written transcripts of videos or audio files
Caster
- Ask HN: I'm disabled and out of money. Now what?
- Is there a Foundry VTT module that helps people who have difficulty moving their hands and fingers?
- Dragonfly-Based Voice Programming and Accessibility Toolkit
-
Ask HN: Who Wants to Collaborate?
Unfortunately Dragon development has mostly stalled for the last 5 years (Dragon 15 was a leap forward but that was quite some time ago now).
You can still make use of it via Dragonfly (see also Caster[0]) as mentioned by a sibling comment or by using Talon[1] or Vocola.
Having used a computer 90% hands free for about a year and a half back in 2019, I chose Dragonfly then, but would probably choose Talon nowadays - less futsing about and it has alternative speech engine options.
I also recommend looking into eye tracking: the Tobii gaming products[2] work well for general computer mousing with some software like Talon or Precision Gaze[3] - well enough for me to make a hands free mod[4] for Factorio, for example.
[0]: https://github.com/dictation-toolbox/Caster
- How can I make Mycroft recognize non verbal audio sounds to command it?
- Linux Voice recognition/dictation/voice assistant/ one handed operation?
-
Any programmers using dictation?
so I found this thing called Caster today that miiight save my job. it does allow you to format code with Dragon and navigate VS Code (albeit poorly.) It's also open-source, so you can add features.
What are some alternatives?
izabela-desktop - A proof of concept text-to-speech application allowing global typing. Can be used over applications such as voice chats, games and much more.
kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
talk2windows - Add voice commands to control the Windows 10+ desktop.
dragonfly - Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
video-game-text-dataset - Collection of videogame text datasets from Library of Codexes. Text data from 30+ different videogame series.
silero-vad - Silero VAD: pre-trained enterprise-grade Voice Activity Detector
porcupine-web-vaadin-demo - Example of using Picovoice Porcupine wake word detection in Vaadin 24
rhino - Rhino is an open-source implementation of JavaScript written entirely in Java
um_detector - detector for filler words
Common-Voice - Audio Classification with machine learning
Speech-Recognition - Speech Recognition library for adding Voice Commands and Controls to all your applications. Whether you are building web apps, native apps or desktop apps, this technology can be integrated into any system with an internet connection.