Ask HN: Any technical reasons Google Docs can't do voice typing in Firefox?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

    IIRC every browser that supports the Web Speech API does so via cloud services. Mozilla being the only major browser maker without it's own cloud services and having slightly fewer phone-home features didn't want to do that. Mozilla has been doing quite a bit of work in the area though (for example https://github.com/mozilla/DeepSpeech), hopefully to enable these features locally in the future.

  • nerd-dictation

    Simple, hackable offline speech to text - using the VOSK-API.

    Nerd dictation is a purely on-device speech to text program that works pretty well if your computer is fast enough.

    https://github.com/ideasman42/nerd-dictation

    get speech models here:

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • vosk-api

    Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts