mmdeploy vs whisper.cpp

mmdeploy

OpenMMLab Model Deployment Framework (by open-mmlab)

Source Code

mmdeploy.readthedocs.io

Suggest alternative

Edit details

whisper.cpp

Port of OpenAI's Whisper model in C/C++ (by ggml-org)

openai speech-to-text Transformer Whisper Inference speech-recognition

Source Code

Suggest alternative

Edit details

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

Sevalla - Deploy and host your apps and databases, now with $50 credit!

Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

sevalla.com

featured

mmdeploy		whisper.cpp
	Project
4	Mentions	201
3,023	Stars	42,817
1.6%	Growth	4.0%
4.7	Activity	9.9
11 months ago	Latest Commit	8 days ago
Python	Language	C++
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

mmdeploy

Posts with mentions or reviews of mmdeploy. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-28.

[D] Object detection models that can be easily converted to CoreML
1 project | /r/MachineLearning | 25 Jul 2023
Orange Pi 5 Plus Koboldcpp Demo (MPT, Falcon, Mini-Orca, Openllama)
4 projects | /r/LocalLLaMA | 28 Jun 2023

The RK3588 also has a NPU for accelerating neural networks. The bad news is the API is not supported by any of the inference engines (afaik), but the NPU can run models directly that have been converted to the RKNN format. It is a long shot, but you can find details here.
MMDeploy: Deploy All the Algorithms of OpenMMLab
22 projects | /r/u_Allent_pjlab | 21 Nov 2022

BibTeX @misc{=mmdeploy, title={OpenMMLab's Model Deployment Toolbox.}, author={MMDeploy Contributors}, howpublished = {\url{https://github.com/open-mmlab/mmdeploy}}, year={2021} }
Removing the bounding box generated by OnnxRuntime segmentation
2 projects | /r/computervision | 4 Nov 2022

I have a semantic segmentation model trained using the mmdetection repo. Then it is converted to the ONNX format using the mmdeploy repo.

whisper.cpp

Posts with mentions or reviews of whisper.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-08-14.

Show HN: OWhisper – Ollama for realtime speech-to-text
9 projects | news.ycombinator.com | 14 Aug 2025

Thank you for taking the time to build something and share it. However what is the advantage of using this over whisper.cpp stream that can also do real time conversion?
https://github.com/ggml-org/whisper.cpp/tree/master/examples...
Kitten TTS: 25MB CPU-Only, Open-Source Voice Model
19 projects | news.ycombinator.com | 5 Aug 2025

Whisper and the many variants. Here's a good implementation.
https://github.com/ggml-org/whisper.cpp
Ask HN: What API or software are people using for transcription?
10 projects | news.ycombinator.com | 9 Jun 2025

Whisper large v3 from openai, but we host it ourselves on Modal.com. It's easy, fast, no rate limits, and cheap as well.
If you want to run it locally, I'd still go with whisper, then I'd look at something like whisper.cpp https://github.com/ggml-org/whisper.cpp. Runs quite well.
Whispercpp – Local, Fast, and Private Audio Transcription for Ruby
1 project | news.ycombinator.com | 7 Jun 2025
Build Your Own Siri. Locally. On-Device. No Cloud
1 project | news.ycombinator.com | 13 May 2025

not the gp but found this https://github.com/ggml-org/whisper.cpp/blob/master/models/c...
Run LLMs on Apple Neural Engine (ANE)
11 projects | news.ycombinator.com | 3 May 2025

Actually that's a really good question, I hadn't considered that the comparison here is just CPU vs using Metal (CPU+GPU).
To answer the question though - I think this would be used for cases where you are building an app that wants to utilize a small AI model while at the same time having the GPU free to do graphics related things, which I'm guessing is why Apple stuck these into their hardware in the first place.
Here is an interesting comparison between the two from a whisper.cpp thread - ignoring startup times - the CPU+ANE seems about on par with CPU+GPU: https://github.com/ggml-org/whisper.cpp/pull/566#issuecommen...
Building a personal, private AI computer on a budget
4 projects | news.ycombinator.com | 11 Feb 2025

A great thread with the type of info your looking for lives here: https://github.com/ggerganov/whisper.cpp/issues/89
But you can likely find similar threads for the llama.cpp benchmark here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...
These are good examples because the llama.cpp and whisper.cpp benchmarks take full advantage of the Apple hardware but also take full advantage of non-Apple hardware with GPU support, AVX support etc.
It’s been true for a while now that the memory bandwidth of modern Apple systems in tandem with the neural cores and gpu has made them very competitive Nvidia for local inference and even training.
Whisper.cpp: Looking for Maintainers
1 project | news.ycombinator.com | 4 Feb 2025
Show HN: Galene-stt: automatic captioning for the Galene videconferencing system
3 projects | news.ycombinator.com | 21 Nov 2024
Show HN: Transcribe YouTube Videos
7 projects | news.ycombinator.com | 7 Sep 2024

Not as convenient, but you could also have the user manually install the model, like whisper does.
Just forward the error message output by whisper, or even make a more user-friendly error message with instructions on how/where to download the models.
Whisper does provide a simple bash script to download models: https://github.com/ggerganov/whisper.cpp/blob/master/models/...
(As a Windows user, I can run bash scripts via Git Bash for Windows[1])
[1]: https://git-scm.com/download/win

What are some alternatives?

When comparing mmdeploy and whisper.cpp you can also consider the following projects:

FastDeploy - High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

bark - 🔊 Text-Prompted Generative Audio Model

mmfewshot - OpenMMLab FewShot Learning Toolbox and Benchmark

faster-whisper - Faster Whisper transcription with CTranslate2

mmselfsup - OpenMMLab Self-Supervised Learning Toolbox and Benchmark

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

mmdeploy vs FastDeploy whisper.cpp vs bark mmdeploy vs mmfewshot whisper.cpp vs faster-whisper mmdeploy vs mmselfsup whisper.cpp vs whisper

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

www.influxdata.com

featured

Sevalla - Deploy and host your apps and databases, now with $50 credit!

Sevalla is the PaaS you have been looking for! Advanced deployment pipelines, usage-based pricing, preview apps, templates, human support by developers, and much more!

sevalla.com

featured

Compare mmdeploy vs whisper.cpp and see what are their differences.

mmdeploy

whisper.cpp

mmdeploy

whisper.cpp

What are some alternatives?

Did you know that Python is
the 2nd most popular programming language
based on number of references?

mmdeploy VS whisper.cpp

Compare mmdeploy vs whisper.cpp and see what are their differences.

mmdeploy

whisper.cpp

mmdeploy

whisper.cpp

What are some alternatives?

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?