The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Kaldi-active-grammar Alternatives
Similar projects and alternatives to kaldi-active-grammar
-
dragonfly
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx (by dictation-toolbox)
-
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
-
nerd-dictation
Simple, hackable offline speech to text - using the VOSK-API.
-
pocketsphinx-python
Discontinued Python interface to CMU Sphinxbase and Pocketsphinx libraries
-
mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
-
concise-encoding
The secure data format for a modern world
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
futurecoder
100% free and interactive Python course for beginners
-
Common-Voice
Audio Classification with machine learning (by dachosen1)
-
whisper-writer
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
-
todomvc
Helping you select an MV* framework - Todo apps for React.js, Ember.js, Angular, and many more
-
-
awayto
Awayto is a curated development platform, producing great value with minimal investment. With all the ways there are to reach a solution, it's important to understand the landscape of tools to use.
-
hammerspoon
A hammerspoon config with a bunch of custom spoons (sleep timer, resolution changer, paywall buster, safari hotkey utilities, window management with undo, etc). (by nonissue)
-
realworld
"The mother of all demo apps" — Exemplary fullstack Medium.com clone powered by React, Angular, Node, Django, and many more
-
Joplin
Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.
-
-
Phaser
Discontinued Phaser is a fun, free and fast 2D game framework for making HTML5 games for desktop and mobile web browsers, supporting Canvas and WebGL rendering. [Moved to: https://github.com/phaserjs/phaser] (by photonstorm)
-
ShellCheck
ShellCheck, a static analysis tool for shell scripts
-
Typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
kaldi-active-grammar reviews and mentions
- Ask HN: How do you get started with adding voice commands to a computer system?
- AMD Screws Gamers: Sponsorships Likely Block DLSS
- Software I’m Thankful For
-
Why, in 2022, is there no high quality method for voice control of a PC?
"Everything other than talon has terrible latency": False! I develop kaldi-active-grammar (https://github.com/daanzu/kaldi-active-grammar), a free and open source speech recognition backend, which has extremely low latency. You can adjust how aggressive the VAD (voice activity detection) is to suit your preference, but the speech engine latency can be almost negligible, especially for voice commands (vs prose dictation). However, I agree that "most existing speech recognition engines were not designed with the kind of latency you want for quick one syllable commands", and that low latency is pivotal to being productive with voice commands. I also agree with your other points.
With an open system/engine, you can train your own personal speech model. For kaldi-active-grammar (https://github.com/daanzu/kaldi-active-grammar), you can do so without all that much difficulty, although the process/documentation could certainly use improvement.
I bootstrapped my personal speech model by retaining the commands from me using WSR. My voice is quite abnormal, and it took only 10 hours of speech data to train a model orders of magnitude more accurate than any generic model I've ever used. And of course, I retain much of my usage now with Kaldi, so my model improves more and more over time. A virtuous flywheel!
-
Ask HN: Anyone voice code? I had a stroke and can't use my left side
I have been coding entirely by voice for approximately 10 years now (by hand long before that). Most of that time I have been using the Dragonfly (https://github.com/dictation-toolbox/dragonfly) library to construct my own customized voice coding system. The library is highly flexible and open source, allowing you to easily customize everything to suit what you need to be productive. It is perhaps the power user analogue to Dragon Naturally Speaking. With it, you can certainly be highly productive coding by voice. In fact, I develop kaldi-active-grammar (https://github.com/daanzu/kaldi-active-grammar), a free and open source speech recognition backend usable by Dragonfly, itself entirely by voice. There's also a community of voice coders using Dragonfly and other tools that build on top of it, such as Caster (https://github.com/dictation-toolbox/Caster).
-
Ask HN: Who Wants to Collaborate?
- Demo: https://www.youtube.com/watch?v=Qk1mGbIJx3s / Software: https://github.com/daanzu/kaldi-active-grammar
Far field audio is usually harder for any speech system to get correct, so having a good quality mic and using it nearby will _usually_ help with the transcription quality. As a long time Linux user, I would love to see it get some more powerful voice tools - really hope that this opens up over the next few years. Feel free to drop me an email (on my profile) happy to help with setup on any of the above.
- How can I make Mycroft recognize non verbal audio sounds to command it?
- Linux Voice recognition/dictation/voice assistant/ one handed operation?
-
Disabled computer science student ISO advice about single-handed keyboards
kaldi repo: https://github.com/daanzu/kaldi-active-grammar
-
A note from our sponsor - WorkOS
workos.com | 28 Mar 2024
Stats
daanzu/kaldi-active-grammar is an open source project licensed under GNU Affero General Public License v3.0 which is an OSI approved license.
The primary programming language of kaldi-active-grammar is Python.
Popular Comparisons
- kaldi-active-grammar VS silero-vad
- kaldi-active-grammar VS nerd-dictation
- kaldi-active-grammar VS pocketsphinx-python
- kaldi-active-grammar VS mycroft-precise
- kaldi-active-grammar VS Caster
- kaldi-active-grammar VS dragonfly
- kaldi-active-grammar VS Common-Voice
- kaldi-active-grammar VS whisper-writer
- kaldi-active-grammar VS futurecoder
- kaldi-active-grammar VS cursorless