whisper.cpp VS whisper

Compare whisper.cpp vs whisper and see what are their differences.

whisper

Robust Speech Recognition via Large-Scale Weak Supervision (by openai)
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
whisper.cpp whisper
199 373
41,464 84,697
2.7% 2.7%
9.9 7.4
4 days ago 21 days ago
C++ Python
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

whisper.cpp

Posts with mentions or reviews of whisper.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-06-09.

whisper

Posts with mentions or reviews of whisper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-07-14.
  • Summarization experiments with Hugging Face Transformers - part 1
    2 projects | dev.to | 14 Jul 2025
    Now, I can't copy the YouTube video descriptions verbatim for SEO purposes: search engines don't like duplicate content. The obvious solution is to do some kind of summarization. First of all, I always use whisper and proofread the SRT file containing the recognized audio before uploading it to YouTube. I then copy the SRT file verbatim in an LLM chat, in chunks, and ask ChatGPT to generate a summary once I say so. This last step is tedious and you always rely on a SaaS. Some alternatives include the use local AI models via Ollama, or more specialized software that can run different tasks besides text generation.
  • Show HN: TokenDagger – A tokenizer 2-4x faster than OpenAI's Tiktoken
    14 projects | news.ycombinator.com | 30 Jun 2025
    > has a great package ecosystem

    So great there are 8 of them. 800% better than all the rest!

    > If you think Python is a bad language for AI integrations, try writing one in a compiled language.

    I'll take this challenge, all day, every day, so long as I and the hypothetical 'move fast and break things' have equal "must run in prod" and "must be understandable by some other human" qualifiers

    What type is `array`? Don't worry your pretty head about it, feed it whatever type you want and let Sentry's TypeError sort it out <https://github.com/openai/whisper/blob/v20250625/whisper/aud...> Oh, sorry, and you wanted to know what `pad_or_trim` returns? Well that's just, like, your opinion man

  • DeepSpeech Is Discontinued
    4 projects | news.ycombinator.com | 25 Jun 2025
    It seems that the team that used to work on DeepSpeech then worked on coqui-ai STT https://github.com/coqui-ai/STT and now recommends using OpenAI Whisper (https://github.com/openai/whisper)
  • OpenAI Charges by the Minute, So Make the Minutes Shorter
    5 projects | news.ycombinator.com | 25 Jun 2025
    It's a very simple change in a vanilla python implementation. The encoder is a set of attention blocks, and the length of the attention can be changed without changing the calculation at all.

    Here(https://github.com/openai/whisper/blob/main/whisper/model.py...) is the relevant code in the whisper repo. You'd just need to change the for loop to an enumerate and subsample the context along its length at the point you want. I believe it would be:

    for i, block in enumerate(self.blocks):

  • Show HN: WhisperBuddy, Privacy-first AI-transcription app built after my layoff
    1 project | news.ycombinator.com | 8 Jun 2025
    I was laid off recently and, instead of looking for another job right away, I decided to build something I always wanted: a transcription tool that respects user privacy.

    Existing transcription tools often require internet connectivity, send your private audio to cloud servers, or lock you into monthly subscriptions. I wanted something different—so I built WhisperBuddy, a privacy-first AI transcription app that runs entirely on your machine.

    WhisperBuddy uses OpenAI’s Whisper model (see: https://github.com/openai/whisper). While the model may not always deliver the absolute best accuracy, it provides solid performance and, most importantly, 100% privacy—no audio ever leaves your device.

    I optimized the app for local performance and usability, and I’m eager to hear feedback from the community.

    Thanks for checking it out!

  • Auto-Generating Clips for Social Media from Live Streams with the Strands Agents SDK
    3 projects | dev.to | 6 Jun 2025
    To accomplish this task, I decided to try out the new Strands Agents SDK. It's a fairly new framework for building agents that has a simple way to define tools that the agent can use to assist in responding to prompts. For this solution, we'll need FFMPEG and Whisper installed on the machine where the agent runs. I'll be working locally, but this could easily be converted to a server-based solution using FastAPI or another web framework and deployed to the cloud in a Docker/Podman container.
  • From Voice to Text: Exploring Speech-to-Text Tools and APIs for Developers
    3 projects | dev.to | 19 May 2025
    Link: Whisper GitHub
  • 15 AI tools that almost replace a full dev team but please don’t fire us yet
    8 projects | dev.to | 3 May 2025
    Whisper: OpenAI’s speech-to-text.
  • The ultimate open source stack for building AI agents
    13 projects | dev.to | 30 Apr 2025
    Start by hooking up speech-to-text (STT) using something like OpenAI’s Whisper if you’re going open source, or Deepgram if you want a super-accurate plug-and-play API.
  • How to create video transcription with ffmpeg and whisper
    1 project | dev.to | 9 Apr 2025
    # Install Homebrew if you don't have it /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)" # Install ffmpeg brew install ffmpeg # Install Python (if needed) brew install python # Install Whisper pip3 install --upgrade pip pip3 install git+https://github.com/openai/whisper.git

What are some alternatives?

When comparing whisper.cpp and whisper you can also consider the following projects:

bark - 🔊 Text-Prompted Generative Audio Model

vosk-api - Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

faster-whisper - Faster Whisper transcription with CTranslate2

silero-vad - Silero VAD: pre-trained enterprise-grade Voice Activity Detector

whisperX - WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured