C++ speech-recognition

Open-source C++ projects categorized as speech-recognition

Top 11 C++ speech-recognition Projects

  • DeepSpeech

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

  • Project mention: ESpeak-ng: speech synthesizer with more than one hundred languages and accents | news.ycombinator.com | 2024-05-01

    As I understand it DeepSpeech is no longer actively maintained by Mozilla: https://github.com/mozilla/DeepSpeech/issues/3693

    For Text To Speech, I've found Piper TTS useful (for situations where "quality"=="realistic"/"natual"): https://github.com/rhasspy/piper

    For Speech to Text (which AIUI DeepSpeech provided), I've had some success with Vosk: https://github.com/alphacep/vosk-api

  • wav2letter

    Facebook AI Research's Automatic Speech Recognition Toolkit

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • openvino

    OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

  • Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05
  • STT

    🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

  • subsync

    Subtitle Speech Synchronizer

  • Project mention: Using Whisper to transcribe the entire Forensic Files series | /r/DataHoarder | 2023-06-04

    I've found subsync to work flawlessly at timing subtitles, even with whisper transcripts.

  • athena

    an open-source implementation of sequence-to-sequence based speech processing engine (by athena-team)

  • dsnote

    Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

  • Project mention: Speech Note: offline Linux app for note taking, reading and translating | news.ycombinator.com | 2023-08-30
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • CIDLib

    The CIDLib general purpose C++ development environment

  • Project mention: Remaining Relevant Over Four Decades | /r/programming | 2023-06-03
  • RuntimeSpeechRecognizer

    Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

  • Project mention: Runtime Speech Recognizer - Open-source Whisper OpenAI Plugin for Unreal Engine | /r/unrealengine | 2023-06-03

    GitHub: Link Marketplace: Link. Documentation: Link.

  • fstalign

    An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.

  • react-native-wenet

    Wenet speech to text for react native

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ speech-recognition related posts

Index

What are some of the best open-source speech-recognition projects in C++? This list will help you:

Project Stars
1 DeepSpeech 24,508
2 wav2letter 6,340
3 openvino 6,104
4 STT 2,162
5 subsync 1,219
6 athena 944
7 dsnote 362
8 CIDLib 208
9 RuntimeSpeechRecognizer 214
10 fstalign 140
11 react-native-wenet 10

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com