Porcupine vs whisper.cpp

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

Porcupine		whisper.cpp
	Project
31	Mentions	187
3,424	Stars	31,174
2.1%	Growth	-
9.1	Activity	9.8
9 days ago	Latest Commit	about 21 hours ago
Python	Language	C
Apache License 2.0	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Porcupine

Posts with mentions or reviews of Porcupine . We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-02-16.

I made a ChatGPT virtual assistant that you can talk to
1 project | /r/ArtificialInteligence | 5 Apr 2023

I call it DaVinci. DaVinci uses Picovoice (https://picovoice.ai/) solutions for wake word and voice activity detection and for converting speech to text, Amazon Polly to convert its responses into a natural sounding voice, and OpenAI’s GPT 3.5 to do the heavy lifting. It’s all contained in about 300 lines of Python code.
Speech Recognition in Unity: Adding Voice Input
3 projects | dev.to | 16 Feb 2023

Download pre-trained models: "Porcupine" from Porcupine Wake Word and Video Player Context from Rhino Speech-to-Intent repositories - You can also train a custom models on Picovoice Console.
Speech Recognition with SwiftUI
5 projects | dev.to | 13 Feb 2023

Below are some useful resources: Open-source code Picovoice Platform SDK Picovoice website
Speech Recognition with Angular
1 project | dev.to | 8 Feb 2023

Download the Porcupine model and turn the binary model into a base64 string.
OK Google, Add Hotword Detection to Chrome
1 project | dev.to | 3 Feb 2023

Download Porcupine (i.e. Deep Neural Network). Run the following to turn the binary model into a base64 string, from the project folder.
Hotword Detection for MCUs
1 project | dev.to | 31 Jan 2023

Porcupine SDK Porcupine SDK is on GitHub. Find libraries for supported MCUs on the Porcupine GitHub repository. Arduino libraries are available via a specialized package manager offered by Arduino.
Day 12: Always Listening Voice Commands with React.js
1 project | dev.to | 17 Jan 2023

Looking for more? Explore other languages on the Picovoice Console and check out for fully-working demos with Porcupine on GitHub.
Day 6: Making Cool Raspberry Pi Projects even Cooler with Voice AI (1/4)
1 project | dev.to | 9 Jan 2023

Don't forget to visit Porcupine's Wake Word's Github repository to see Python demos. If you want to do something similar to the video above, find the open-source codes here
Voice Assistant app in Haskell
8 projects | /r/haskell | 3 Jan 2023
What does "end-to-end" mean?
1 project | /r/embedded | 17 Dec 2022

I sometimes see the term "end-to-end", and it always passes right by my ears as marketing jargon. For example, there was a recent post today that linked to this page: https://picovoice.ai/, and you'll find the statement "... end-to-end platform for adding voice to anything on your terms". I did a quick Google search and it seems like the term is used in many different contexts (e.g., encryption, enterprise software for product development, etc.), but to be honest, I'm just not getting it. Maybe someone can explain here within the realm of embedded software? Could you provide some examples as well?

whisper.cpp

Posts with mentions or reviews of whisper.cpp. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-31.

Show HN: I created automatic subtitling app to boost short videos
1 project | news.ycombinator.com | 9 Apr 2024

whisper.cpp [1] has a karaoke example that uses ffmpeg's drawtext filter to display rudimentary karaoke-like captions. It also supports diarisation. Perhaps it could be a starting point to create a better script that does what you need.
--
1: https://github.com/ggerganov/whisper.cpp/blob/master/README....
LLaMA Now Goes Faster on CPUs
16 projects | news.ycombinator.com | 31 Mar 2024
LLMs on your local Computer (Part 1)
7 projects | dev.to | 11 Mar 2024

The ggml library is one of the first library for local LLM interference. It’s a pure C library that converts models to run on several devices, including desktops, laptops, and even mobile device - and therefore, it can also be considered as a tinkering tool, trying new optimizations, that will then be incorporated into other downstream projects. This tool is at the heart of several other projects, powering LLM interference on desktop or even mobile phones. Subprojects for running specific LLMs or LLM families exists, such as whisper.cpp.
Voxos.ai – An Open-Source Desktop Voice Assistant
7 projects | news.ycombinator.com | 19 Jan 2024

I'm not sure if it is _fully_ openai compatible, but whispercpp has a server bundled that says it is "OAI-like": https://github.com/ggerganov/whisper.cpp/tree/master/example...
I don't have any direct experience with it... I've only played around with whisper locally, using scripts.
Jarvis: A Voice Virtual Assistant in Python (OpenAI, ElevenLabs, Deepgram)
7 projects | news.ycombinator.com | 18 Dec 2023

unless i'm misunderstanding `whisper.cpp` seems to support streaming & the repository includes a native example[0] and a WASM example[1] with a demo site[2].
[0]: https://github.com/ggerganov/whisper.cpp/tree/master/example...
Wchess
1 project | news.ycombinator.com | 14 Dec 2023
I've open sourced my Flutter plugin to run on-device LLMs on any platform. TestFlight builds available now.
9 projects | /r/FlutterDev | 8 Dec 2023

Usage 1: Good to transcribe audio. An example use case could be to summarize YouTube videos or long courses. Usage 2: You talk with voice to your AI that responds with text (later with audio too). - https://github.com/ggerganov/whisper.cpp
Scrybble is the ReMarkable highlights to Obsidian exporter I have been looking for
9 projects | /r/RemarkableTablet | 7 Dec 2023

🗣️🎙️ whisper.cpp (offline speech-to-text transcription, models trained by OpenAI, CLI based, browser based)
Whisper.wasm
1 project | news.ycombinator.com | 13 Nov 2023
Whisper C++ not working for me. Anyone else?
1 project | /r/Xcode | 11 Nov 2023

Has anyone played around with Whisper C++ for swift? I'm hitting a snag even on the demo. I've downloaded the github repo and everything matches up with this video [ https://youtu.be/b10OHCDHDQ4 ] but when he hits the transcribe button, it actually prints out the captioning. When I do it, it skips that part and just says "Done...". But it, does everything else - plays the audio, says it's transcribing.. just doesn't show me the transcription: and it's not in the debug window either. But the demo isn't throwing any errors, and I haven't messed with the code really so this is their example. https://github.com/ggerganov/whisper.cpp

What are some alternatives?

When comparing Porcupine and whisper.cpp you can also consider the following projects:

snowboy - Future versions with model training module will be maintained through a forked version here: https://github.com/seasalt-ai/snowboy

faster-whisper - Faster Whisper transcription with CTranslate2

mycroft-precise - A lightweight, simple-to-use, RNN wake word listener

Whisper - High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

Caffe - Caffe: a fast open framework for deep learning.

bark - 🔊 Text-Prompted Generative Audio Model

DeepSpeech - DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

whisper - Robust Speech Recognition via Large-Scale Weak Supervision

mxnet - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

whisperX - WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Caffe2

llama.cpp - LLM inference in C/C++

Porcupine vs snowboy whisper.cpp vs faster-whisper Porcupine vs mycroft-precise whisper.cpp vs Whisper Porcupine vs Caffe whisper.cpp vs bark Porcupine vs DeepSpeech whisper.cpp vs whisper Porcupine vs mxnet whisper.cpp vs whisperX Porcupine vs Caffe2 whisper.cpp vs llama.cpp

Compare Porcupine vs whisper.cpp and see what are their differences.

Porcupine

whisper.cpp

Porcupine

whisper.cpp

What are some alternatives?