Otter.ai has saved reporters hours transcribing interviews. Caveat emptor

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  1. silero-models

    Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

    Silero[0] seems to have decent performance (although you will have to some minimal coding). I believe there are better ones if you're willing to tinker a bit more.

    [0]: https://github.com/snakers4/silero-models

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. vscode-ltex

    LTeX: Grammar/spell checker :mag::heavy_check_mark: for VS Code using LanguageTool with support for LaTeX :mortar_board:, Markdown :pencil:, and others

  4. mp4grep

    mp4grep is a CLI for transcribing and searching audio/video files

    The output is more intended for captioning so it's lots of short phrases with timestamps and no punctuation, but it'll give you a quick taste of what Vosk can do.

    [1] https://github.com/o-oconnell/mp4grep

  5. TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    The Mozilla DeepSpeech spin-off Coqui has an STT that is locally installable:

    https://coqui.ai/

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • What's the best text-to-speech free non-cloud software?

    4 projects | /r/DataHoarder | 31 May 2023
  • Ask HN: Are there any good open source Text-to-Speech tools?

    15 projects | news.ycombinator.com | 1 Jan 2023
  • OpenAI deems its voice cloning tool too risky for general release

    1 project | news.ycombinator.com | 31 Mar 2024
  • Base TTS (Amazon): The largest text-to-speech model to-date

    3 projects | news.ycombinator.com | 14 Feb 2024
  • WhisperSpeech – An Open Source text-to-speech system built by inverting Whisper

    9 projects | news.ycombinator.com | 17 Jan 2024

Did you know that Jupyter Notebook is
the 13th most popular programming language
based on number of references?