Top 14 automatic-speech-recognition Open-Source Projects

wenet

5 3,691 9.6 Python

Production First and Production Ready End-to-End Speech Recognition Toolkit

Project mention: Open Source Libraries | /r/AudioAI | 2023-10-02

wenet-e2e/wenet

awesome-speech-recognition-speech-synthesis-papers

0 2,870 3.5

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
STT

11 2,131 0.6 C++

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Project mention: Rest in Peas: The Unrecognized Death of Speech Recognition (2010) | news.ycombinator.com | 2023-05-04

What has happened since then? I know Common Voice has come and gone https://en.wikipedia.org/wiki/Common_Voice https://github.com/coqui-ai/STT
And I've seen some neural approaches too
No idea where to look for comparisons though.

whisper-asr-webservice

11 1,617 7.7 Python

OpenAI Whisper ASR Webservice API

Project mention: How I converted a podcast into a knowledge base using Orama search and OpenAI whisper and Astro | dev.to | 2023-05-23

cheetah

5 552 8.3 Python

On-device streaming speech-to-text engine powered by deep learning (by Picovoice)
leopard

15 406 8.6 Python

On-device speech-to-text engine powered by deep learning
whisper-youtube

3 319 2.9 Jupyter Notebook

🔉 Youtube Videos Transcription with OpenAI's Whisper

Project mention: Magyar Youtube feliratozo v.1.1 (ingyenes colab) | /r/hungary | 2023-05-03

Source: https://colab.research.google.com/github/ArthurFDLR/whisper-youtube/blob/main/whisper_youtube.ipynb

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
soxan

1 218 10.0 Jupyter Notebook

Wav2Vec for speech recognition, classification, and audio classification
FAST-RIR

1 137 5.0 Python

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Auto-Subtitled-Video-Generator

3 65 3.7 Python

Input a YouTube video link or upload a video file and get a video with subtitles.
go-subgen

3 52 7.1 Go

Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr

Project mention: Self hosted call transcribing and searchable text solution | /r/selfhosted | 2023-06-09

you might be able to use go-subgen (it's my project) if you're willing/able to trigger it via http posts. There might also be a whisper frontend that can do filesystem watching that could work better, but I don't know of any off the top of my head.

ovos-stt-plugin-vosk

1 14 2.9 Python

vosk STT plugin for mycroft
werpy

0 9 9.3 Python

🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
joureka-app

1 5 10.0 Python

joureka - Mit mehr Muße vom Interview zum Artikel!
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

automatic-speech-recognition related posts

Magyar Youtube feliratozo v.1.1 (ingyenes colab)
2 projects | /r/hungary | 3 May 2023
Telex videón magyar és angol felirat demó (whisper-ctranslate2)
1 project | /r/hungary | 28 Apr 2023
Numen - FOSS voice control for handsfree computing
2 projects | /r/linux | 15 Mar 2023
I've built a few tools on top of GPT-3.5 (text generation, q&a with embeddings). AMA about resources and AI dev stacks for building with OpenAI's APIs
2 projects | /r/learnmachinelearning | 23 Feb 2023
I've built an Auto Subtitled Video Generator using Streamlit and OpenAI Whisper, hosted on HuggingFace spaces. All you have to do is input a YouTube video link and get a video with subtitles (alongside with .txt, .vtt, .srt files).
1 project | /r/programming | 13 Oct 2022
[Project] I've built an Auto Subtitled Video Generator using Streamlit and OpenAI Whisper, hosted on HuggingFace spaces.
1 project | /r/MachineLearning | 13 Oct 2022
Introducing Whisper
2 projects | /r/artificial | 22 Sep 2022
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source automatic-speech-recognition projects? This list will help you:

	Project	Stars
1	wenet	3,691
2	awesome-speech-recognition-speech-synthesis-papers	2,870
3	STT	2,131
4	whisper-asr-webservice	1,617
5	cheetah	552
6	leopard	406
7	whisper-youtube	319
8	soxan	218
9	FAST-RIR	137
10	Auto-Subtitled-Video-Generator	65
11	go-subgen	52
12	ovos-stt-plugin-vosk	14
13	werpy	9
14	joureka-app	5