Caster vs silero-vad

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

Caster		silero-vad
	Project
7	Mentions	10
329	Stars	2,866
0.6%	Growth	-
2.9	Activity	6.9
about 1 month ago	Latest Commit	12 days ago
Python	Language	Python
GNU General Public License v3.0 or later	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Caster

Posts with mentions or reviews of Caster. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-07-04.

Ask HN: I'm disabled and out of money. Now what?
2 projects | news.ycombinator.com | 4 Jul 2022
Is there a Foundry VTT module that helps people who have difficulty moving their hands and fingers?
1 project | /r/FoundryVTT | 18 May 2022
Dragonfly-Based Voice Programming and Accessibility Toolkit
1 project | news.ycombinator.com | 14 Jan 2022
Ask HN: Who Wants to Collaborate?
58 projects | news.ycombinator.com | 1 Jan 2022

Unfortunately Dragon development has mostly stalled for the last 5 years (Dragon 15 was a leap forward but that was quite some time ago now).
You can still make use of it via Dragonfly (see also Caster[0]) as mentioned by a sibling comment or by using Talon[1] or Vocola.
Having used a computer 90% hands free for about a year and a half back in 2019, I chose Dragonfly then, but would probably choose Talon nowadays - less futsing about and it has alternative speech engine options.
I also recommend looking into eye tracking: the Tobii gaming products[2] work well for general computer mousing with some software like Talon or Precision Gaze[3] - well enough for me to make a hands free mod[4] for Factorio, for example.
[0]: https://github.com/dictation-toolbox/Caster
How can I make Mycroft recognize non verbal audio sounds to command it?
3 projects | /r/Mycroftai | 29 Jul 2021
Linux Voice recognition/dictation/voice assistant/ one handed operation?
4 projects | /r/linuxquestions | 27 Jul 2021
Any programmers using dictation?
1 project | /r/disability | 2 Jul 2021

so I found this thing called Caster today that miiight save my job. it does allow you to format code with Dragon and navigate VS Code (albeit poorly.) It's also open-source, so you can add features.

silero-vad

Posts with mentions or reviews of silero-vad. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-11-06.

New models and developer products announced at OpenAI DevDay
8 projects | news.ycombinator.com | 6 Nov 2023

>How do you detect speech starting and stopping?
https://github.com/snakers4/silero-vad
[Discussion] Video Translation Task
2 projects | /r/MachineLearning | 13 Jul 2023

you could look into https://github.com/guillaumekln/faster-whisper especially the VAD section (Voice Activity Detector) using https://github.com/snakers4/silero-vad
Using Whisper to transcribe the entire Forensic Files series
5 projects | /r/DataHoarder | 4 Jun 2023

I also had the same synchronization issue, so I wrote a WebUI/CLI that uses Silero-VAD that first splits the audio whenever there a silent portion (or every 30 seconds), and I haven't experienced it since:
Whisper - A new free AI model from OpenAI that can transcribe Japanese (and many other languages) at up to "human level" accuracy
5 projects | /r/LearnJapanese | 22 Sep 2022

By the way, I've updated the WebUI to now also support using Silero VAD to break up the audio into distinct sections, and run Whisper on each section and then combine them into one single transcript/SRT file.
[P] A more detailed post about Silero VAD on The Gradient
1 project | /r/MachineLearning | 19 Feb 2022

The VAD is always available on Github
Silero VAD: pre-trained enterprise-grade voice activity detector
1 project | news.ycombinator.com | 30 Dec 2021
[P] Silero VAD: One voice detector to rule them all
2 projects | /r/MachineLearning | 18 Dec 2021

I also pinned some interesting comments here regarding mobile and IOT usage here - https://github.com/snakers4/silero-vad/issues/37
One voice detector to rule them all
1 project | news.ycombinator.com | 7 Dec 2021

What are some alternatives?

When comparing Caster and silero-vad you can also consider the following projects:

kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

dragonfly - Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

cheetah - On-device streaming speech-to-text engine powered by deep learning

voice_datasets - 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

rhino - Rhino is an open-source implementation of JavaScript written entirely in Java

GassistPi - Google Assistant for Single Board Computers

Common-Voice - Audio Classification with machine learning

mr-robot - A multi-utility discord bot. Playback hilarious voice tracks on-demand, wiki for anything, turn on/off IoT enabled devices, and more!

talk2windows - Add voice commands to control the Windows 10+ desktop.

hollow-knight-voice-commands - A fun little python tool to play Hollow Knight with only voice commands

Caster vs kaldi-active-grammar silero-vad vs whisper Caster vs dragonfly silero-vad vs cheetah Caster vs voice_datasets silero-vad vs kaldi-active-grammar Caster vs rhino silero-vad vs GassistPi Caster vs Common-Voice silero-vad vs mr-robot Caster vs talk2windows silero-vad vs hollow-knight-voice-commands

Compare Caster vs silero-vad and see what are their differences.

Caster

silero-vad

Caster

silero-vad

What are some alternatives?