[Discussion] Looking for an Open-Source Speech to Text model (english) that captures filler words, pauses and also records timestamps for each word.

This page summarizes the projects mentioned and recommended in the original post on /r/LocalLLaMA

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • pocketsphinx

    A small speech recognizer

  • whisper

    Robust Speech Recognition via Large-Scale Weak Supervision

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • "Why not just transcribe the audio?" I thought

    1 project | /r/ANMAPodcast | 22 Jan 2023
  • Speech recognition library for financial markets

    1 project | /r/algotrading | 25 Jul 2021
  • Disabled computer science student ISO advice about single-handed keyboards

    5 projects | /r/ErgoMechKeyboards | 21 Apr 2021
  • Speech recognition

    2 projects | /r/selfhosted | 25 Dec 2020
  • Creando Subtítulos Automáticos para Vídeos con Python, Faster-Whisper, FFmpeg, Streamlit, Pillow

    7 projects | dev.to | 29 Apr 2024