Universal Speech Model

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • languagetool

    Style and Grammar Checker for 25+ Languages

  • Not op, but probably he literally meant LanguageTool [0], an open-source grammarly alternative.

    [0] https://languagetool.org/

  • whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • just found out about this today, maybe it's helpful:

    https://github.com/m-bain/whisperX

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • stable-ts

    Transcription, forced alignment, and audio indexing with OpenAI's Whisper

  • I like output from https://github.com/jianfch/stable-ts way more

  • faster-whisper

    Faster Whisper transcription with CTranslate2

  • Faster Whisper is 8x faster than real time on CPU and even faster on GPU. https://github.com/guillaumekln/faster-whisper

    Vocode uses Whisper for real-time zero latency voicechat with chatGPT. Give their demo line a call to see how well it works: +1-650-729-9536

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts