DeepSpeech 60x Smaller, 9x faster, and 2x accuracy

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. speech-to-text-benchmark

    speech to text benchmark framework

    The Mozilla DeepSpeech tests on LibreSpeech listed in your link were out of date back in 2020[1], and Coqui.ai (the continuation of Mozilla DeepSpeech) isn't even benchmarked.

    https://github.com/Picovoice/speech-to-text-benchmark/issues...

  2. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  3. leopard

    On-device speech-to-text engine powered by deep learning

  4. STT-examples

    🐸STT integration examples

    I will add https://github.com/coqui-ai/STT, which is a continuation of DeepSpeech. Also, I've been messing around with https://github.com/ideasman42/nerd-dictation, which works on a VOSK backend - accuracy is decent, especially with the bigger model.

  5. nerd-dictation

    Simple, hackable offline speech to text - using the VOSK-API.

    I will add https://github.com/coqui-ai/STT, which is a continuation of DeepSpeech. Also, I've been messing around with https://github.com/ideasman42/nerd-dictation, which works on a VOSK backend - accuracy is decent, especially with the bigger model.

  6. vosk-api

    Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Making a Podcast Transcription Server with Express.js (source code in comments)

    2 projects | /r/javascript | 19 May 2022
  • Cohere Transcribe: Speech Recognition

    2 projects | news.ycombinator.com | 31 Mar 2026
  • Audio-to-Text Transcriber Automated via Termux

    1 project | dev.to | 1 Oct 2025
  • Infini-Gram: Scaling unbounded n-gram language models to a trillion tokens

    4 projects | news.ycombinator.com | 5 May 2024
  • VOSK Offline Speech Recognition API

    1 project | news.ycombinator.com | 13 Apr 2024

Did you know that Python is
the 1st most popular programming language
based on number of references?