nerd-dictation vs leopard

nerd-dictation

Simple, hackable offline speech to text - using the VOSK-API. (by ideasman42)

Suggest topics

Source Code

Suggest alternative

Edit details

leopard

On-device speech-to-text engine powered by deep learning (by Picovoice)

stt speech-to-text Asr automatic-speech-recognition on-device speech-recognition transcription voice-recognition voice-to-text

Source Code

picovoice.ai

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

workos.com

featured

nerd-dictation		leopard
	Project
28	Mentions	15
1,164	Stars	406
-	Growth	2.7%
2.9	Activity	8.6
about 1 month ago	Latest Commit	12 days ago
Python	Language	Python
GNU General Public License v3.0 only	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

nerd-dictation

Posts with mentions or reviews of nerd-dictation. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-07-07.

why nerd-dictation support in NixOS is stuck ?
2 projects | /r/NixOS | 7 Jul 2023
Is anyone doing always-on voice to text with a local llama at home?
5 projects | /r/LocalLLaMA | 25 Jun 2023
Apollo dev posts backend code to Git to disprove Reddit’s claims of scrapping and inefficiency
4 projects | /r/webdev | 9 Jun 2023

nerd-dictation
How to use notion in gnome
1 project | /r/gnome | 1 May 2023

There's no built-in way of doing this in GNOME, but you might already get a bit further with tools like https://github.com/ideasman42/nerd-dictation
What voice transcriber do you use?
1 project | /r/foss | 20 Apr 2023
Disability accessibility tools for Linux such as eyetrackers and voice commands?
2 projects | /r/linux | 19 Feb 2023

I'm not familiar with Talon so I don't know if this is a suitable suggestion but nerd-dictation seemed to have been well received here when it was last promoted and it looks like it's still in active development.
Voice Control was supposed to be the Future. Is Linux lagging behind?
4 projects | /r/linux | 3 Dec 2022

TBF Microsoft dropped IE, windows phone... that is not uncommon. But the OP is right, maybe not much for voice control but for dictation certainly. The FLOSS community is always far behind and thus always struggle with new technologies. We should be prepared. Since you've mentioned small open source project here's a demo of NerdDitaction. FYI Linux do have mobile devices developing.
I've made voice input for Linux that I use instead of a keyboard and mouse
1 project | /r/RSI | 6 Nov 2022

Yeah you get me. I did have RSI which was amplified by my other issue, but it was that issue that progressed and why can't type now, not RSI. I'd be interested in hearing about using numen in combination with typing, but it's likely not ideal yet. Maybe just using speech to text for some things could help? It's not my project but there's: https://github.com/ideasman42/nerd-dictation that uses the same speech recognition as numen.
Voice to text for Linux
1 project | /r/privacy | 1 Nov 2022
nerd-dictation: Simple, hackable offline speech to text - using the VOSK-API.
1 project | /r/planetemacs | 28 Sep 2022

leopard

Posts with mentions or reviews of leopard. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-01.

Automatic Speech Recognition with AWS Lambda and Leopard
2 projects | dev.to | 1 Feb 2023

Take a look at Leopard GitHub Repository or Leopard Docs Page to learn more about Leopard.
Day 19: Local Transcription w .NET
1 project | dev.to | 26 Jan 2023

Looking for more: Open-source demo code Leopard GitHub repository Speech-to-text Benchmark
Day 13: Voice Recognition with Ubuntu
1 project | dev.to | 18 Jan 2023

Voila! Reach out to Picovoice team on GitHub if you have any questions
Day 8: Making Cool Raspberry Pi Projects even Cooler with Voice AI (3/4)
1 project | dev.to | 11 Jan 2023

This tutorial is intended for Raspberry Pi 4. If you're looking for Raspberry Pi 3 or Raspberry Pi 400 or Raspberry Pi 4 (64-bit) check out Leopard C Demos on GitHub
Day5: Building a local audio transcription engine running on your web browser with JavaScript
1 project | dev.to | 6 Jan 2023

2. Serving the Model Leopard is an on-device speech-to-text solution. So we need to transfer the model (deep neural network) to the client to enable voice processing within the browser.
Making a Podcast Transcription Server with Express.js and Picovoice Leopard
1 project | news.ycombinator.com | 19 May 2022

How does Picovoice Leopard compare to other speech-to-text options?
https://github.com/Picovoice/leopard
Making a Podcast Transcription Server with Express.js (source code in comments)
2 projects | /r/javascript | 19 May 2022

Check out the source code here
On-device speech-to-text engine powered by deep learning
1 project | /r/deeplearning | 12 Mar 2022
[P] On-device speech-to-text engine powered by deep learning
1 project | /r/MachineLearning | 12 Mar 2022
picovoice/leopard - DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
1 project | /r/realtech | 9 Mar 2022

What are some alternatives?

When comparing nerd-dictation and leopard you can also consider the following projects:

vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

speech-to-text-benchmark - speech to text benchmark framework

recasepunc - Model for recasing and repunctuating ASR transcripts

cursorless - Don't let the cursor slow you down

cheetah - On-device streaming speech-to-text engine powered by deep learning

tortoise-tts - A multi-voice TTS system trained with an emphasis on quality

STT-examples - 🐸STT integration examples

kaldi-active-grammar - Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

werpy - 🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

monkeytype - The most customizable typing website with a minimalistic design and a ton of features. Test yourself in various modes, track your progress and improve your speed.

serverless-leopard