nerd-dictation vs STT-examples

nerd-dictation

Simple, hackable offline speech to text - using the VOSK-API. (by ideasman42)

Suggest topics

Source Code

Suggest alternative

Edit details

STT-examples

🐸STT integration examples (by coqui-ai)

Suggest topics

Source Code

github.com

Suggest alternative

Edit details

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

workos.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

nerd-dictation		STT-examples
	Project
28	Mentions	5
1,164	Stars	111
-	Growth	2.7%
2.9	Activity	0.0
about 1 month ago	Latest Commit	over 1 year ago
Python	Language	Python
GNU General Public License v3.0 only	License	Mozilla Public License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

nerd-dictation

Posts with mentions or reviews of nerd-dictation. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-07.

why nerd-dictation support in NixOS is stuck ?
2 projects | /r/NixOS | 7 Jul 2023
Is anyone doing always-on voice to text with a local llama at home?
5 projects | /r/LocalLLaMA | 25 Jun 2023
Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency
4 projects | /r/webdev | 9 Jun 2023

nerd-dictation
How to use notion in gnome
1 project | /r/gnome | 1 May 2023

There's no built-in way of doing this in GNOME, but you might already get a bit further with tools like https://github.com/ideasman42/nerd-dictation
What voice transcriber do you use?
1 project | /r/foss | 20 Apr 2023
Disability accessibility tools for Linux such as eyetrackers and voice commands?
2 projects | /r/linux | 19 Feb 2023

I'm not familiar with Talon so I don't know if this is a suitable suggestion but nerd-dictation seemed to have been well received here when it was last promoted and it looks like it's still in active development.
Voice Control was supposed to be the Future. Is Linux lagging behind?
4 projects | /r/linux | 3 Dec 2022

TBF Microsoft dropped IE, windows phone... that is not uncommon. But the OP is right, maybe not much for voice control but for dictation certainly. The FLOSS community is always far behind and thus always struggle with new technologies. We should be prepared. Since you've mentioned small open source project here's a demo of NerdDitaction. FYI Linux do have mobile devices developing.
I've made voice input for Linux that I use instead of a keyboard and mouse
1 project | /r/RSI | 6 Nov 2022

Yeah you get me. I did have RSI which was amplified by my other issue, but it was that issue that progressed and why can't type now, not RSI. I'd be interested in hearing about using numen in combination with typing, but it's likely not ideal yet. Maybe just using speech to text for some things could help? It's not my project but there's: https://github.com/ideasman42/nerd-dictation that uses the same speech recognition as numen.
Voice to text for Linux
1 project | /r/privacy | 1 Nov 2022
nerd-dictation: Simple, hackable offline speech to text - using the VOSK-API.
1 project | /r/planetemacs | 28 Sep 2022

STT-examples

Posts with mentions or reviews of STT-examples. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-12-04.

Web Speech API is not available in the Quest browser
2 projects | /r/WebXR | 4 Dec 2022

You're welcome! Your post actually got me a little interested in seeing what's new with Coqui STT (where all the old Mozilla STT folks moved to) and it seems someone was working on a WebAssembly binding for it, so one could probably finagle something themselves for testing purposes (the bandwidth of loading the model for every user on every page load is unfeasible from a production cost standpoint though)
DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
6 projects | news.ycombinator.com | 9 Mar 2022

I will add https://github.com/coqui-ai/STT, which is a continuation of DeepSpeech. Also, I've been messing around with https://github.com/ideasman42/nerd-dictation, which works on a VOSK backend - accuracy is decent, especially with the bigger model.
Any privacy friendly automated transcript app?
1 project | /r/privacy | 17 Feb 2022

I don't know of a complete app, but https://github.com/coqui-ai/STT, which grew out of the now-unmaintained Mozilla Deepspeech project, works well and is easy to use. It could be a good starting point if you're comfortable writing a little code.
[N] 🐸Coqui and OVHCloud are organizing an open-source Speech Recognition Hackaton
1 project | /r/MachineLearning | 11 Nov 2021

👉CoquiSTT - https://github.com/coqui-ai/STT
Coqui, a startup providing open speech tech for everyone
5 projects | news.ycombinator.com | 14 Apr 2021

https://github.com/coqui-ai/STT-examples
If you have any more specific requirements then we can point you in the right direction. Or just join us on Matrix: https://app.element.io/#/room/#coqui-ai_STT:gitter.im :)

What are some alternatives?

When comparing nerd-dictation and STT-examples you can also consider the following projects:

vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

recasepunc - Model for recasing and repunctuating ASR transcripts

LocalSTT - Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

cursorless - Don't let the cursor slow you down

STT - 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

tortoise-tts - A multi-voice TTS system trained with an emphasis on quality

speech-to-text-benchmark - speech to text benchmark framework

kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

leopard - On-device speech-to-text engine powered by deep learning

monkeytype - The most customizable typing website with a minimalistic design and a ton of features. Test yourself in various modes, track your progress and improve your speed.

TTS - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

nerd-dictation vs vosk-api STT-examples vs vosk-api nerd-dictation vs recasepunc STT-examples vs LocalSTT nerd-dictation vs cursorless STT-examples vs STT nerd-dictation vs tortoise-tts STT-examples vs speech-to-text-benchmark nerd-dictation vs kaldi-active-grammar STT-examples vs leopard nerd-dictation vs monkeytype STT-examples vs TTS

Compare nerd-dictation vs STT-examples and see what are their differences.

nerd-dictation

STT-examples

nerd-dictation

STT-examples

What are some alternatives?