vosk-browser vs ovos-stt-plugin-vosk

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

vosk-browser		ovos-stt-plugin-vosk
	Project
3	Mentions	1
326	Stars	14
-	Growth	-
0.0	Activity	2.9
4 months ago	Latest Commit	4 months ago
JavaScript	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

vosk-browser

Posts with mentions or reviews of vosk-browser. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-15.

Show HN: I record myself on audio 24x7 and use an AI to process the information
13 projects | news.ycombinator.com | 15 Nov 2022

Not the OP but I've been tinkering with the same concept (24/7 processing).
'm using vosk browser: https://github.com/ccoreilly/vosk-browser
To do speech to text locally and it works very well for English.
Speech-to-Text Client-Side?
1 project | news.ycombinator.com | 19 Aug 2022
On-device browser translations with Firefox Translations
5 projects | news.ycombinator.com | 10 Jul 2022

I believe this is called the Bergamot project, more can be found here: https://browser.mt/
The GitHub repo for it is here: https://github.com/browsermt/bergamot-translator
The repo contains some details about how to run it in WASM which is quite interesting for embedding it in pages. I've been playing around with using WASM to capture speech to text (https://github.com/ccoreilly/vosk-browser) and automatically translating it using Bergamot.
Results have been, ok. I don't think the tech is quite there yet and the speech to text obviously struggles with multiple speakers.

ovos-stt-plugin-vosk

Posts with mentions or reviews of ovos-stt-plugin-vosk. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-03-30.

Slow responses from picroft
2 projects | /r/Mycroftai | 30 Mar 2021

for STT there is streaming support which should improve things, google cloud is supported in mycroft-core, but there are some plugins out there that support streaming like vosk

What are some alternatives?

When comparing vosk-browser and ovos-stt-plugin-vosk you can also consider the following projects:

cheetah - On-device streaming speech-to-text engine powered by deep learning

vosk-server - WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

wenet - Production First and Production Ready End-to-End Speech Recognition Toolkit

pykaldi - A Python wrapper for Kaldi

react-native-vosk - Speech recognition module for react native using Vosk library

werpy - 🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

haven - Haven is for people who need a way to protect their personal spaces and possessions without compromising their own privacy, through an Android app and on-device sensors

mock-backend - A Flask personal backend alternative for running your own version of https://home.mycroft.ai

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

elograf - Utility for launching and configuring nerd-dictation

vosk-browser vs cheetah ovos-stt-plugin-vosk vs vosk-server vosk-browser vs vosk-api ovos-stt-plugin-vosk vs wenet vosk-browser vs vosk-server ovos-stt-plugin-vosk vs pykaldi vosk-browser vs react-native-vosk ovos-stt-plugin-vosk vs werpy vosk-browser vs haven ovos-stt-plugin-vosk vs mock-backend vosk-browser vs whisper ovos-stt-plugin-vosk vs elograf

Compare vosk-browser vs ovos-stt-plugin-vosk and see what are their differences.

vosk-browser

ovos-stt-plugin-vosk

vosk-browser

ovos-stt-plugin-vosk

What are some alternatives?