Top 10 C speech-recognition Projects

whisper.cpp

187 31,174 9.8 C

Port of OpenAI's Whisper model in C/C++

Project mention: Show HN: I created automatic subtitling app to boost short videos | news.ycombinator.com | 2024-04-09

whisper.cpp [1] has a karaoke example that uses ffmpeg's drawtext filter to display rudimentary karaoke-like captions. It also supports diarisation. Perhaps it could be a starting point to create a better script that does what you need.
--
1: https://github.com/ggerganov/whisper.cpp/blob/master/README....

pocketsphinx

6 3,736 7.4 C

A small speech recognizer

Project mention: [Discussion] Looking for an Open-Source Speech to Text model (english) that captures filler words, pauses and also records timestamps for each word. | /r/LocalLLaMA | 2023-07-07

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
willow

37 2,361 9.6 C

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

Project mention: ESPHome | news.ycombinator.com | 2024-04-23

Fair points but with all due respect completely misses the point and context. My comment was a reply to a new user interested in esphome on a post about esphome.
You're talking about CircuitPython, 35KB web replies, PSRAM, UF2 bootloader, etc. These are comparatively very advanced topics and you didn't mention esphome once.
The comfort and familiarity of Amazon for what is already a new, intimidating, and challenging subject is of immeasurable value for a novice. They can click those links, fill a cart, and have stuff show up tomorrow with all of the usual ease, friendliness, and reliability of Amazon. If they get frustrated or it doesn't work out they can shove it in the box and get a full refund Amazon-style.
You're suggesting wandering all over the internet, ordering stuff from China, multiple vendors, etc while describing a bunch of things that frankly just won't matter to them. I say this as someone who has been an esphome and home assistant user since day one. The approach I described has never failed or remotely bothered me and over the past ~decade I've seen it suggested to new users successfully time and time again.
In terms of PSRAM to my knowledge the only thing it is utilized for in the esphome ecosystem is higher resolution displays and more advanced voice assistant scenarios that almost always require -S3 anyway and are a very advanced, challenging use cases. I'm very familiar with displays, voice, the S3, and PSRAM but more on that in a second...
> live with one less LX7 core and no Bluetooth
I'm the founder of Willow[0] and when comparing Willow to esphome the most frequent request we get is supporting bluetooth functionality i.e. esphome bluetooth proxy[1]. This is an extremely popular use case in the esphome/home assistant community. Not having bluetooth while losing a core and paying more is a bigger issue than pin spacing.
It's also a pretty obscure board and while not a big deal to you and I if you look around at docs, guides, etc, etc you'll see the cheap-o boards from Amazon are by far the most popular and common (unsurprisingly). Another plus for a new user.
Speaking of Willow (and back to PSRAM again) even the voice assistant satellite functionality of Home Assistant doesn't fundamentally require it - the most popular device doesn't have it either[2].
Very valuable comment with a lot of interesting information, just doesn't apply to context.
[0] - https://heywillow.io/
[1] - https://esphome.io/components/bluetooth_proxy.html
[2] - https://www.home-assistant.io/voice_control/thirteen-usd-voi...

julius

1 1,773 0.0 C

Open-Source Large Vocabulary Continuous Speech Recognition Engine (by julius-speech)
tevr-asr-tool

9 408 5.9 C

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
spchcat

2 392 0.0 C

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

Project mention: Is there such a thing as self hosting cell phone voicemail transcription? | /r/selfhosted | 2023-06-27

Check this out https://github.com/petewarden/spchcat

vosk

2 354 0.0 C

VOSK Speech Recognition Toolkit
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
whisper.rn

3 276 9.0 C

React Native binding of whisper.cpp.

Project mention: How would you record hours of audio to be transcribed? | /r/reactnative | 2023-07-11

there's https://github.com/mybigday/whisper.rn for react native, haven't tried it yet though

react-native-vosk

1 32 6.1 C

Speech recognition module for react native using Vosk library

Project mention: What Speech To Text library does Joplin use? | /r/joplinapp | 2023-10-27

Update: found out they use react-native-vosk https://github.com/riderodd/react-native-vosk thank you regardless

SLID-on-Microcontrollers

1 1 10.0 C

Speech Classification using a Convolutional Neural Network running on a Microcontroller

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C speech-recognition related posts

[Discussion] Looking for an Open-Source Speech to Text model (english) that captures filler words, pauses and also records timestamps for each word.
2 projects | /r/LocalLLaMA | 7 Jul 2023
Is there such a thing as self hosting cell phone voicemail transcription?
2 projects | /r/selfhosted | 27 Jun 2023
Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST
3 projects | news.ycombinator.com | 23 May 2023
Show HN: Willow – Open-Source Privacy-Focused Voice Assistant Hardware
13 projects | news.ycombinator.com | 15 May 2023
"Why not just transcribe the audio?" I thought
1 project | /r/ANMAPodcast | 22 Jan 2023
Ask HN: Why is pay so much higher in the US? (or is it?)
1 project | news.ycombinator.com | 12 Sep 2022
Show HN: 用284行C++语言实现最先进的德语语音识别 (Show HN: State-of-the-art German speech recognition in 284 lines of C++)
1 project | /r/hnzh | 11 Aug 2022
A note from our sponsor - InfluxDB
www.influxdata.com | 27 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source speech-recognition projects in C? This list will help you:

	Project	Stars
1	whisper.cpp	31,174
2	pocketsphinx	3,736
3	willow	2,361
4	julius	1,773
5	tevr-asr-tool	408
6	spchcat	392
7	vosk	354
8	whisper.rn	276
9	react-native-vosk	32
10	SLID-on-Microcontrollers	1