Our great sponsors
-
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
IIRC every browser that supports the Web Speech API does so via cloud services. Mozilla being the only major browser maker without it's own cloud services and having slightly fewer phone-home features didn't want to do that. Mozilla has been doing quite a bit of work in the area though (for example https://github.com/mozilla/DeepSpeech), hopefully to enable these features locally in the future.
-
Nerd dictation is a purely on-device speech to text program that works pretty well if your computer is fast enough.
https://github.com/ideasman42/nerd-dictation
get speech models here:
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node