How can I get word-level timestamps in OpenAI's Whisper ASR?

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/LanguageTechnology

Our great sponsors
  • Scout APM - Truly a developer’s best friend
  • InfluxDB - Build time-series-based applications quickly and at scale.
  • SonarQube - Static code analysis for 29 languages.
  • Zigi - Workflow assistant built for devs & their teams
  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

    Great work u/jina890 Here's another one for world level timestamps- https://github.com/openai/whisper/discussions/209

  • stable-ts

    Stabilizing timestamps of OpenAI's Whisper outputs down to word-level

  • Scout APM

    Truly a developer’s best friend. Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts