Universal Speech Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Access the most powerful time series database as a service
  • Sonar - Write Clean Python Code. Always.
  • SaaSHub - Software Alternatives and Reviews
  • languagetool

    Style and Grammar Checker for 25+ Languages

    Not op, but probably he literally meant LanguageTool [0], an open-source grammarly alternative.

    [0] https://languagetool.org/

  • whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    just found out about this today, maybe it's helpful:


  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • stable-ts

    ASR with reliable word-level timestamps using OpenAI's Whisper

    I like output from https://github.com/jianfch/stable-ts way more

  • faster-whisper

    Faster Whisper transcription with CTranslate2

    Faster Whisper is 8x faster than real time on CPU and even faster on GPU. https://github.com/guillaumekln/faster-whisper

    Vocode uses Whisper for real-time zero latency voicechat with chatGPT. Give their demo line a call to see how well it works: +1-650-729-9536

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts