Whisper Turbo: transcribe 20x faster than realtime using Rust and WebGPU

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. whisper-turbo

    Cross-Platform, GPU Accelerated Whisper 🏎️

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. faster-whisper

    Faster Whisper transcription with CTranslate2

    Neat to see a new implementation, although I'll note that for those looking for a drop-in replacement for the whisper library, I believe that both faster-whisper https://github.com/guillaumekln/faster-whisper and https://github.com/m-bain/whisperX are easier (PyTorch-based, doesn't require a web browser), and a lot faster (WhisperX is up to 70X realtime).

  4. whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Neat to see a new implementation, although I'll note that for those looking for a drop-in replacement for the whisper library, I believe that both faster-whisper https://github.com/guillaumekln/faster-whisper and https://github.com/m-bain/whisperX are easier (PyTorch-based, doesn't require a web browser), and a lot faster (WhisperX is up to 70X realtime).

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Now I Can Just Print That Video

    5 projects | news.ycombinator.com | 4 Dec 2023
  • LeMUR: LLMs for Audio and Speech

    1 project | news.ycombinator.com | 27 Jul 2023
  • Faster Whisper Transcription with CTranslate2

    1 project | /r/hypeurls | 24 Jul 2023
  • Faster Whisper Transcription with CTranslate2

    5 projects | news.ycombinator.com | 20 Jul 2023
  • OpenAI Whisper Audio Transcription Benchmarked on 18 GPUs: Up to 3,000 WPM | Tom's Hardware

    1 project | /r/hardware | 11 May 2023